Dopamine signals as temporal difference errors: recent advances.
Curr Opin Neurobiol
; 67: 95-105, 2021 04.
Article
en En
| MEDLINE
| ID: mdl-33186815
In the brain, dopamine is thought to drive reward-based learning by signaling temporal difference reward prediction errors (TD errors), a 'teaching signal' used to train computers. Recent studies using optogenetic manipulations have provided multiple pieces of evidence supporting that phasic dopamine signals function as TD errors. Furthermore, novel experimental results have indicated that when the current state of the environment is uncertain, dopamine neurons compute TD errors using 'belief states' or a probability distribution over potential states. It remains unclear how belief states are computed but emerging evidence suggests involvement of the prefrontal cortex and the hippocampus. These results refine our understanding of the role of dopamine in learning and the algorithms by which dopamine functions in the brain.
Texto completo:
1
Colección:
01-internacional
Base de datos:
MEDLINE
Asunto principal:
Recompensa
/
Dopamina
Idioma:
En
Revista:
Curr Opin Neurobiol
Asunto de la revista:
BIOLOGIA
/
NEUROLOGIA
Año:
2021
Tipo del documento:
Article
País de afiliación:
Estados Unidos
Pais de publicación:
Reino Unido