Dopamine signals as temporal difference errors: recent advances.

Starkweather, Clara Kwon; Uchida, Naoshige

Starkweather, Clara Kwon; Uchida, Naoshige.

Afiliación

Starkweather CK; Center for Brain Science, Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA.
Uchida N; Center for Brain Science, Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA. Electronic address: uchida@mcb.harvard.edu.

Curr Opin Neurobiol ; 67: 95-105, 2021 04.

Article en En | MEDLINE | ID: mdl-33186815

RESUMEN

In the brain, dopamine is thought to drive reward-based learning by signaling temporal difference reward prediction errors (TD errors), a 'teaching signal' used to train computers. Recent studies using optogenetic manipulations have provided multiple pieces of evidence supporting that phasic dopamine signals function as TD errors. Furthermore, novel experimental results have indicated that when the current state of the environment is uncertain, dopamine neurons compute TD errors using 'belief states' or a probability distribution over potential states. It remains unclear how belief states are computed but emerging evidence suggests involvement of the prefrontal cortex and the hippocampus. These results refine our understanding of the role of dopamine in learning and the algorithms by which dopamine functions in the brain.

Asunto(s)

Dopamina; Recompensa; Encéfalo; Neuronas Dopaminérgicas; Aprendizaje

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Recompensa / Dopamina Idioma: En Revista: Curr Opin Neurobiol Asunto de la revista: BIOLOGIA / NEUROLOGIA Año: 2021 Tipo del documento: Article País de afiliación: Estados Unidos Pais de publicación: Reino Unido

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google