Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
Más filtros











Base de datos
Intervalo de año de publicación
1.
bioRxiv ; 2024 May 04.
Artículo en Inglés | MEDLINE | ID: mdl-38746300

RESUMEN

Midbrain dopamine neurons promote reinforcement learning and movement vigor. A major outstanding question is how dopamine-recipient neurons in the striatum parse these heterogeneous signals. Here we characterized dopamine and acetylcholine release in the dorsomedial striatum (DMS) of rats performing a decision-making task. We found that dopamine acted as a reward prediction error (RPE), modulating behavior and DMS spiking on subsequent trials when coincident with pauses in cholinergic release. In contrast, at task events that elicited coincident bursts of acetylcholine and dopamine, dopamine preceded contralateral movements and predicted movement vigor without inducing plastic changes in DMS firing rates. Our findings provide a circuit-level mechanism by which cholinergic modulation allows the same dopamine signals to be used for either movement or learning depending on instantaneous behavioral context.

2.
bioRxiv ; 2024 Apr 18.
Artículo en Inglés | MEDLINE | ID: mdl-38659861

RESUMEN

Biological accounts of reinforcement learning posit that dopamine encodes reward prediction errors (RPEs), which are multiplied by a learning rate to update state or action values. These values are thought to be represented in synaptic weights in the striatum, and updated by dopamine-dependent plasticity, suggesting that dopamine release might reflect the product of the learning rate and RPE. Here, we leveraged the fact that animals learn faster in volatile environments to characterize dopamine encoding of learning rates. We trained rats on a task with semi-observable states offering different rewards, and rats adjusted how quickly they initiated trials across states using RPEs. Computational modeling and behavioral analyses showed that learning rates were higher following state transitions, and scaled with trial-by-trial changes in beliefs about hidden states, approximating normative Bayesian strategies. Notably, dopamine release in the nucleus accumbens encoded RPEs independent of learning rates, suggesting that dopamine-independent mechanisms instantiate dynamic learning rates.

3.
bioRxiv ; 2024 Feb 29.
Artículo en Inglés | MEDLINE | ID: mdl-38318205

RESUMEN

Recurrent neural networks (RNN) are ubiquitously used in neuroscience to capture both neural dynamics and behaviors of living systems. However, when it comes to complex cognitive tasks, traditional methods for training RNNs can fall short in capturing crucial aspects of animal behavior. To address this challenge, we take inspiration from a commonly used (though rarely appreciated) approach from the experimental neuroscientist's toolkit: behavioral shaping. Our solution leverages task compositionality and models the animal's relevant learning experiences prior to the task. Taking as target a temporal wagering task previously studied in rats, we designed a pretraining curriculum of simpler cognitive tasks that are prerequisites for performing it well. These pretraining tasks are not just simplified versions of the temporal wagering task, but reflect relevant sub-computations. We show that this approach is required for RNNs to adopt similar strategies as rats, including long-timescale inference of latent states, which conventional pretraining approaches fail to capture. Mechanistically, our pretraining supports the development of key dynamical systems features needed for implementing both inference and value-based decision making. Overall, our approach addresses a gap in neural network model training by incorporating inductive biases of animals, which is important when modeling complex behaviors that rely on computational abilities acquired from past experiences.

4.
bioRxiv ; 2023 Dec 10.
Artículo en Inglés | MEDLINE | ID: mdl-38105956

RESUMEN

Gonadal hormones act throughout the brain 1 , and nearly all neuropsychiatric disorders vary in symptom severity with hormonal fluctuations over the reproductive cycle, gestation, and perimenopause 2-4 . Yet the mechanisms by which hormones influence mental and cognitive processes are unclear. Exogenous estrogenic hormones modulate dopamine signaling in the nucleus accumbens core (NAcc) 5,6 , which instantiates reward prediction errors (RPEs) for reinforcement learning 7-16 . Here we show that endogenous estrogenic hormones enhance RPEs and sensitivity to previous rewards by regulating expression of dopamine reuptake proteins in the NAcc. We trained rats to perform a temporal wagering task with different reward states; rats adjusted how quickly they initiated trials across states, balancing effort against expected rewards. Dopamine release in the NAcc reflected RPEs that predicted and causally in-fluenced subsequent initiation times. When fertile, females more quickly adjusted their initiation times to match reward states due to enhanced dopaminergic RPEs in the NAcc. Proteomics revealed reduced expression of dopamine transporters in fertile stages of the reproductive cycle. Finally, genetic suppression of midbrain estrogen receptors eliminated hormonal modulation of behavior. Estrogenic hormones therefore control the rate of reinforcement learning by regulating RPEs via dopamine reuptake, providing a mechanism by which hormones influence neural dynamics for motivation and learning.

5.
Nat Commun ; 14(1): 7573, 2023 Nov 21.
Artículo en Inglés | MEDLINE | ID: mdl-37989741

RESUMEN

The value of the environment determines animals' motivational states and sets expectations for error-based learning1-3. How are values computed? Reinforcement learning systems can store or cache values of states or actions that are learned from experience, or they can compute values using a model of the environment to simulate possible futures3. These value computations have distinct trade-offs, and a central question is how neural systems decide which computations to use or whether/how to combine them4-8. Here we show that rats use distinct value computations for sequential decisions within single trials. We used high-throughput training to collect statistically powerful datasets from 291 rats performing a temporal wagering task with hidden reward states. Rats adjusted how quickly they initiated trials and how long they waited for rewards across states, balancing effort and time costs against expected rewards. Statistical modeling revealed that animals computed the value of the environment differently when initiating trials versus when deciding how long to wait for rewards, even though these decisions were only seconds apart. Moreover, value estimates interacted via a dynamic learning rate. Our results reveal how distinct value computations interact on rapid timescales, and demonstrate the power of using high-throughput training to understand rich, cognitive behaviors.


Asunto(s)
Aprendizaje , Refuerzo en Psicología , Ratas , Animales , Recompensa , Motivación
6.
bioRxiv ; 2023 Aug 02.
Artículo en Inglés | MEDLINE | ID: mdl-36993514

RESUMEN

The value of the environment determines animals' motivational states and sets expectations for error-based learning1-3. How are values computed? Reinforcement learning systems can store or "cache" values of states or actions that are learned from experience, or they can compute values using a model of the environment to simulate possible futures3. These value computations have distinct trade-offs, and a central question is how neural systems decide which computations to use or whether/how to combine them4-8. Here we show that rats use distinct value computations for sequential decisions within single trials. We used high-throughput training to collect statistically powerful datasets from 291 rats performing a temporal wagering task with hidden reward states. Rats adjusted how quickly they initiated trials and how long they waited for rewards across states, balancing effort and time costs against expected rewards. Statistical modeling revealed that animals computed the value of the environment differently when initiating trials versus when deciding how long to wait for rewards, even though these decisions were only seconds apart. Moreover, value estimates interacted via a dynamic learning rate. Our results reveal how distinct value computations interact on rapid timescales, and demonstrate the power of using high-throughput training to understand rich, cognitive behaviors.

7.
Proc Natl Acad Sci U S A ; 120(2): e2212120120, 2023 01 10.
Artículo en Inglés | MEDLINE | ID: mdl-36598952

RESUMEN

The process by which sensory evidence contributes to perceptual choices requires an understanding of its transformation into decision variables. Here, we address this issue by evaluating the neural representation of acoustic information in the auditory cortex-recipient parietal cortex, while gerbils either performed a two-alternative forced-choice auditory discrimination task or while they passively listened to identical acoustic stimuli. During task engagement, stimulus identity decoding performance from simultaneously recorded parietal neurons significantly correlated with psychometric sensitivity. In contrast, decoding performance during passive listening was significantly reduced. Principal component and geometric analyses revealed the emergence of low-dimensional encoding of linearly separable manifolds with respect to stimulus identity and decision, but only during task engagement. These findings confirm that the parietal cortex mediates a transition of acoustic representations into decision-related variables. Finally, using a clustering analysis, we identified three functionally distinct subpopulations of neurons that each encoded task-relevant information during separate temporal segments of a trial. Taken together, our findings demonstrate how parietal cortex neurons integrate and transform encoded auditory information to guide sound-driven perceptual decisions.


Asunto(s)
Corteza Auditiva , Lóbulo Parietal , Animales , Lóbulo Parietal/fisiología , Percepción Auditiva/fisiología , Corteza Auditiva/fisiología , Estimulación Acústica , Acústica , Gerbillinae
8.
Elife ; 102021 10 25.
Artículo en Inglés | MEDLINE | ID: mdl-34693908

RESUMEN

Studies of neural dynamics in lateral orbitofrontal cortex (lOFC) have shown that subsets of neurons that encode distinct aspects of behavior, such as value, may project to common downstream targets. However, it is unclear whether reward history, which may subserve lOFC's well-documented role in learning, is represented by functional subpopulations in lOFC. Previously, we analyzed neural recordings from rats performing a value-based decision-making task, and we documented trial-by-trial learning that required lOFC (Constantinople et al., 2019). Here, we characterize functional subpopulations of lOFC neurons during behavior, including their encoding of task variables. We found five distinct clusters of lOFC neurons, either based on clustering of their trial-averaged peristimulus time histograms (PSTHs), or a feature space defined by their average conditional firing rates aligned to different task variables. We observed weak encoding of reward attributes, but stronger encoding of reward history, the animal's left or right choice, and reward receipt across all clusters. Only one cluster, however, encoded the animal's reward history at the time shortly preceding the choice, suggesting a possible role in integrating previous and current trial outcomes at the time of choice. This cluster also exhibits qualitatively similar responses to identified corticostriatal projection neurons in a recent study (Hirokawa et al., 2019), and suggests a possible role for subpopulations of lOFC neurons in mediating trial-by-trial learning.


Asunto(s)
Conducta de Elección/fisiología , Aprendizaje/fisiología , Neuronas/fisiología , Corteza Prefrontal/fisiología , Recompensa , Animales , Masculino , Ratas , Ratas Long-Evans
9.
Cell ; 184(10): 2534-2536, 2021 05 13.
Artículo en Inglés | MEDLINE | ID: mdl-33989547

RESUMEN

In this issue of Cell, Spellman and colleagues record and manipulate the activity of neurons in the medial prefrontal cortex of mice performing a task in which they must pay attention to different stimuli. They show that this brain region is important for monitoring the animals' performance, and neurons that appear to contribute to behavior reside in deep cortical layers.


Asunto(s)
Neuronas , Corteza Prefrontal , Animales , Ratones
10.
Curr Biol ; 30(17): 3293-3303.e4, 2020 09 07.
Artículo en Inglés | MEDLINE | ID: mdl-32619478

RESUMEN

Sensory-driven decisions are formed by accumulating information over time. Although parietal cortex activity is thought to represent accumulated evidence for sensory-based decisions, recent perturbation studies in rodents and non-human primates have challenged the hypothesis that these representations actually influence behavior. Here, we asked whether the parietal cortex integrates acoustic features from auditory cortical inputs during a perceptual decision-making task. If so, we predicted that selective inactivation of this projection should impair subjects' ability to accumulate sensory evidence. We trained gerbils to perform an auditory discrimination task and obtained measures of integration time as a readout of evidence accumulation capability. Minimum integration time was calculated behaviorally as the shortest stimulus duration for which subjects could discriminate the acoustic signals. Direct pharmacological inactivation of parietal cortex increased minimum integration times, suggesting its role in the behavior. To determine the specific impact of sensory evidence, we chemogenetically inactivated the excitatory projections from auditory cortex to parietal cortex and found this was sufficient to increase minimum behavioral integration times. Our signal-detection-theory-based model accurately replicated behavioral outcomes and indicated that the deficits in task performance were plausibly explained by elevated sensory noise. Together, our findings provide causal evidence that parietal cortex plays a role in the network that integrates auditory features for perceptual judgments.


Asunto(s)
Corteza Auditiva/fisiología , Percepción Auditiva/fisiología , Conducta Animal/fisiología , Discriminación en Psicología/fisiología , Gerbillinae/fisiología , Lóbulo Parietal/fisiología , Estimulación Acústica , Animales , Femenino , Masculino
11.
Elife ; 82019 11 06.
Artículo en Inglés | MEDLINE | ID: mdl-31692447

RESUMEN

Individual choices are not made in isolation but are embedded in a series of past experiences, decisions, and outcomes. The effects of past experiences on choices, often called sequential biases, are ubiquitous in perceptual and value-based decision-making, but their neural substrates are unclear. We trained rats to choose between cued guaranteed and probabilistic rewards in a task in which outcomes on each trial were independent. Behavioral variability often reflected sequential effects, including increased willingness to take risks following risky wins, and spatial 'win-stay/lose-shift' biases. Recordings from lateral orbitofrontal cortex (lOFC) revealed encoding of reward history and receipt, and optogenetic inhibition of lOFC eliminated rats' increased preference for risk following risky wins, but spared other sequential effects. Our data show that different sequential biases are neurally dissociable, and the lOFC's role in adaptive behavior promotes learning of more abstract biases (here, biases for the risky option), but not spatial ones.


Asunto(s)
Conducta de Elección , Toma de Decisiones , Aprendizaje , Corteza Prefrontal/fisiología , Animales , Masculino , Ratas , Recompensa
12.
Curr Biol ; 29(12): 2066-2074.e5, 2019 06 17.
Artículo en Inglés | MEDLINE | ID: mdl-31155352

RESUMEN

In 1979, Daniel Kahneman and Amos Tversky published a ground-breaking paper titled "Prospect Theory: An Analysis of Decision under Risk," which presented a behavioral economic theory that accounted for the ways in which humans deviate from economists' normative workhorse model, Expected Utility Theory [1, 2]. For example, people exhibit probability distortion (they overweight low probabilities), loss aversion (losses loom larger than gains), and reference dependence (outcomes are evaluated as gains or losses relative to an internal reference point). We found that rats exhibited many of these same biases, using a task in which rats chose between guaranteed and probabilistic rewards. However, prospect theory assumes stable preferences in the absence of learning, an assumption at odds with alternative frameworks such as animal learning theory and reinforcement learning [3-7]. Rats also exhibited trial history effects, consistent with ongoing learning. A reinforcement learning model in which state-action values were updated by the subjective value of outcomes according to prospect theory reproduced rats' nonlinear utility and probability weighting functions and also captured trial-by-trial learning dynamics.


Asunto(s)
Toma de Decisiones , Ratas/psicología , Recompensa , Asunción de Riesgos , Animales , Aprendizaje , Masculino , Probabilidad , Ratas Long-Evans , Ratas Wistar , Refuerzo en Psicología
13.
Neuron ; 95(2): 385-398.e5, 2017 Jul 19.
Artículo en Inglés | MEDLINE | ID: mdl-28669543

RESUMEN

Decision-making in dynamic environments often involves accumulation of evidence, in which new information is used to update beliefs and select future actions. Using in vivo cellular resolution imaging in voluntarily head-restrained rats, we examined the responses of neurons in frontal and parietal cortices during a pulse-based accumulation of evidence task. Neurons exhibited activity that predicted the animal's upcoming choice, previous choice, and graded responses that reflected the strength of the accumulated evidence. The pulsatile nature of the stimuli enabled characterization of the responses of neurons to a single quantum (pulse) of evidence. Across the population, individual neurons displayed extensive heterogeneity in the dynamics of responses to pulses. The diversity of responses was sufficiently rich to form a temporal basis for accumulated evidence estimated from a latent variable model. These results suggest that heterogeneous, often transient sensory responses distributed across the fronto-parietal cortex may support working memory on behavioral timescales. VIDEO ABSTRACT.


Asunto(s)
Potenciales de Acción/fisiología , Conducta de Elección/fisiología , Toma de Decisiones/fisiología , Lóbulo Frontal/fisiología , Memoria a Corto Plazo/fisiología , Neuronas/fisiología , Lóbulo Parietal/fisiología , Animales , Conducta Animal/fisiología , Ratas , Factores de Tiempo
14.
Elife ; 4: e11308, 2015 Dec 17.
Artículo en Inglés | MEDLINE | ID: mdl-26673896

RESUMEN

Decision-making behavior is often characterized by substantial variability, but its source remains unclear. We developed a visual accumulation of evidence task designed to quantify sources of noise and to be performed during voluntary head restraint, enabling cellular resolution imaging in future studies. Rats accumulated discrete numbers of flashes presented to the left and right visual hemifields and indicated the side that had the greater number of flashes. Using a signal-detection theory-based model, we found that the standard deviation in their internal estimate of flash number scaled linearly with the number of flashes. This indicates a major source of noise that, surprisingly, is not consistent with the widely used 'drift-diffusion modeling' (DDM) approach but is instead closely related to proposed models of numerical cognition and counting. We speculate that this form of noise could be important in accumulation of evidence tasks generally.


Asunto(s)
Cognición , Toma de Decisiones , Animales , Modelos Neurológicos , Ratas
15.
J Neurosci ; 33(34): 13583-99, 2013 Aug 21.
Artículo en Inglés | MEDLINE | ID: mdl-23966682

RESUMEN

In many cortical neurons, HCN1 channels are the major contributors to Ih, the hyperpolarization-activated current, which regulates the intrinsic properties of neurons and shapes their integration of synaptic inputs, paces rhythmic activity, and regulates synaptic plasticity. Here, we examine the physiological role of Ih in deep layer pyramidal neurons in mouse prefrontal cortex (PFC), focusing on persistent activity, a form of sustained firing thought to be important for the behavioral function of the PFC during working memory tasks. We find that HCN1 contributes to the intrinsic persistent firing that is induced by a brief depolarizing current stimulus in the presence of muscarinic agonists. Deletion of HCN1 or acute pharmacological blockade of Ih decreases the fraction of neurons capable of generating persistent firing. The reduction in persistent firing is caused by the membrane hyperpolarization that results from the deletion of HCN1 or Ih blockade, rather than a specific role of the hyperpolarization-activated current in generating persistent activity. In vivo recordings show that deletion of HCN1 has no effect on up states, periods of enhanced synaptic network activity. Parallel behavioral studies demonstrate that HCN1 contributes to the PFC-dependent resolution of proactive interference during working memory. These results thus provide genetic evidence demonstrating the importance of HCN1 to intrinsic persistent firing and the behavioral output of the PFC. The causal role of intrinsic persistent firing in PFC-mediated behavior remains an open question.


Asunto(s)
Potenciales de Acción/genética , Canales Catiónicos Regulados por Nucleótidos Cíclicos/metabolismo , Función Ejecutiva/fisiología , Memoria/fisiología , Neuronas/fisiología , Canales de Potasio/metabolismo , Corteza Prefrontal/citología , Potenciales de Acción/efectos de los fármacos , Animales , Conducta de Elección/efectos de los fármacos , Canales Catiónicos Regulados por Nucleótidos Cíclicos/genética , Regulación de la Expresión Génica/efectos de los fármacos , Regulación de la Expresión Génica/fisiología , Proteínas Fluorescentes Verdes/genética , Canales Regulados por Nucleótidos Cíclicos Activados por Hiperpolarización , Técnicas In Vitro , Aprendizaje por Laberinto/efectos de los fármacos , Memoria/efectos de los fármacos , Ratones , Ratones Endogámicos C57BL , Ratones Transgénicos , Neuronas/efectos de los fármacos , Neurotransmisores/farmacología , Técnicas de Placa-Clamp , Canales de Potasio/genética , Aprendizaje Seriado/efectos de los fármacos , Aprendizaje Seriado/fisiología , Potenciales Sinápticos/efectos de los fármacos , Potenciales Sinápticos/genética
16.
Science ; 340(6140): 1591-4, 2013 Jun 28.
Artículo en Inglés | MEDLINE | ID: mdl-23812718

RESUMEN

The thalamocortical (TC) projection to layer 4 (L4) is thought to be the main route by which sensory organs communicate with cortex. Sensory information is believed to then propagate through the cortical column along the L4→L2/3→L5/6 pathway. Here, we show that sensory-evoked responses of L5/6 neurons in rats derive instead from direct TC synapses. Many L5/6 neurons exhibited sensory-evoked postsynaptic potentials with the same latencies as L4. Paired in vivo recordings from L5/6 neurons and thalamic neurons revealed substantial convergence of direct TC synapses onto diverse types of infragranular neurons, particularly in L5B. Pharmacological inactivation of L4 had no effect on sensory-evoked synaptic input to L5/6 neurons. L4 is thus not an obligatory distribution hub for cortical activity, and thalamus activates two separate, independent "strata" of cortex in parallel.


Asunto(s)
Potenciales Evocados Somatosensoriales , Neocórtex/fisiología , Tálamo/fisiología , Animales , Neocórtex/citología , Neocórtex/efectos de los fármacos , Neuronas/efectos de los fármacos , Neuronas/fisiología , Ratas , Ratas Wistar , Sinapsis/efectos de los fármacos , Sinapsis/fisiología , Tálamo/citología , Tálamo/efectos de los fármacos
17.
Neuron ; 69(6): 1061-8, 2011 Mar 24.
Artículo en Inglés | MEDLINE | ID: mdl-21435553

RESUMEN

Mammalian brains generate internal activity independent of environmental stimuli. Internally generated states may bring about distinct cortical processing modes. To investigate how brain state impacts cortical circuitry, we recorded intracellularly from the same neurons, under anesthesia and subsequent wakefulness, in rat barrel cortex. In every cell examined throughout layers 2-6, wakefulness produced a temporal pattern of synaptic inputs differing markedly from those under anesthesia. Recurring periods of synaptic quiescence, prominent under anesthesia, were abolished by wakefulness, which produced instead a persistently depolarized state. This switch in dynamics was unaffected by elimination of afferent synaptic input from thalamus, suggesting that arousal alters cortical dynamics by neuromodulators acting directly on cortex. Indeed, blockade of noradrenergic, but not cholinergic, pathways induced synaptic quiescence during wakefulness. We conclude that global brain states can switch local recurrent networks into different regimes via direct neuromodulation.


Asunto(s)
Corteza Cerebral/fisiología , Red Nerviosa/fisiología , Neuronas/fisiología , Vigilia/fisiología , Acetilcolina/metabolismo , Animales , Nivel de Alerta/fisiología , Femenino , Vías Nerviosas/fisiología , Norepinefrina/metabolismo , Técnicas de Placa-Clamp , Ratas , Ratas Wistar , Tálamo/fisiología , Vibrisas/fisiología
18.
J Comp Neurol ; 516(4): 291-311, 2009 Oct 01.
Artículo en Inglés | MEDLINE | ID: mdl-19634181

RESUMEN

Voltage-gated potassium channels that are composed of Kv3 subunits exhibit distinct electrophysiological properties: activation at more depolarized potentials than other voltage-gated K+ channels and fast kinetics. These channels have been shown to contribute to the high-frequency firing of fast-spiking (FS) GABAergic interneurons in the rat and mouse brain. In the rodent neocortex there are distinct patterns of expression for the Kv3.1b and Kv3.2 channel subunits and of coexpression of these subunits with neurochemical markers, such as the calcium-binding proteins parvalbumin (PV) and calbindin D-28K (CB). The distribution of Kv3 channels and interrelationship with calcium-binding protein expression has not been investigated in primate cortex. We used immunoperoxidase and immunofluorescent labeling and stereological counting techniques to characterize the laminar and cell-type distributions of Kv3-immunoreactive (ir) neurons in macaque V1. We found that across the cortical layers approximately 25% of both Kv3.1b- and Kv3.2-ir neurons are non-GABAergic. In contrast, all Kv3-ir neurons in rodent cortex are GABAergic (Chow et al. [1999] J Neurosci. 19:9332-9345). The putatively excitatory Kv3-ir neurons were mostly located in layers 2, 3, and 4b. Further, the proportion of Kv3-ir neurons that express PV or CB also differs between macaque V1 and rodent cortex. These data indicate that, within the population of cortical neurons, a broader population of neurons, encompassing cells of a wider range of morphological classes may be capable of sustaining high-frequency firing in macaque V1.


Asunto(s)
Macaca/anatomía & histología , Neuronas/metabolismo , Canales de Potasio Shaw/metabolismo , Corteza Visual/metabolismo , Ácido gamma-Aminobutírico/metabolismo , Animales , Recuento de Células , Inmunohistoquímica , Macaca/metabolismo , Neuronas/citología , Subunidades de Proteína/metabolismo , Distribución Tisular , Corteza Visual/citología
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA