keyword
MENU ▼
Read by QxMD icon Read
search

Reinforcement learning

keyword
https://www.readbyqxmd.com/read/28732231/kernel-dynamic-policy-programming-applicable-reinforcement-learning-to-robot-systems-with-high-dimensional-states
#1
Yunduan Cui, Takamitsu Matsubara, Kenji Sugimoto
We propose a new value function approach for model-free reinforcement learning in Markov decision processes involving high dimensional states that addresses the issues of brittleness and intractable computational complexity, therefore rendering the value function approach based reinforcement learning algorithms applicable to high dimensional systems. Our new algorithm, Kernel Dynamic Policy Programming (KDPP) smoothly updates the value function in accordance to the Kullback-Leibler divergence between current and updated policies...
June 29, 2017: Neural Networks: the Official Journal of the International Neural Network Society
https://www.readbyqxmd.com/read/28731839/cost-benefit-arbitration-between-multiple-reinforcement-learning-systems
#2
Wouter Kool, Samuel J Gershman, Fiery A Cushman
Human behavior is sometimes determined by habit and other times by goal-directed planning. Modern reinforcement-learning theories formalize this distinction as a competition between a computationally cheap but inaccurate model-free system that gives rise to habits and a computationally expensive but accurate model-based system that implements planning. It is unclear, however, how people choose to allocate control between these systems. Here, we propose that arbitration occurs by comparing each system's task-specific costs and benefits...
July 1, 2017: Psychological Science
https://www.readbyqxmd.com/read/28729439/striatal-gpr88-modulates-foraging-efficiency
#3
Aundrea Rainwater, Elisenda Sanz, Richard D Palmiter, Albert Quintana
The striatum is anatomically and behaviorally implicated in behaviors that promote efficient foraging. To investigate this function, we studied instrumental choice behavior in mice lacking GPR88, a striatum-enriched orphan G-protein-coupled receptor that modulates striatal medium spiny neuron (MSN) excitability. Our results reveal that hungry mice lacking GPR88 (KO mice) were slow to acquire food-reinforced lever-press, but could lever press similar to controls on a progressive-ratio schedule. Both WT and KO mice discriminated between reward and no-reward levers; however, KO mice failed to discriminate based on relative quantity- reward (1 versus 3 food pellets) or effort (3 versus 9 lever presses)...
July 20, 2017: Journal of Neuroscience: the Official Journal of the Society for Neuroscience
https://www.readbyqxmd.com/read/28727514/similarity-in-romantic-couples-drinking-motivations-and-drinking-behaviours
#4
Ivy-Lee L Kehayes, Sean P Mackinnon, Simon B Sherry, Kenneth E Leonard, Sherry H Stewart
BACKGROUND: Research suggests enhancement, conformity, social, coping-with-anxiety, and coping-with-depression drinking motives are linked to specific drinking outcomes in a theoretically-expected manner. Social learning theory suggests people who spend more time together emulate each other's behaviour to acquire reinforcing outcomes. The present study sought to integrate drinking motives theory and social learning theory to investigate similarity in drinking behaviours and drinking motives in romantic couples...
July 20, 2017: Substance Abuse
https://www.readbyqxmd.com/read/28726560/bayesian-methods-for-addressing-long-standing-problems-in-associative-learning-the-case-of-pree
#5
Fernando Blanco, Joaquín Moris
Most associative models typically assume that learning can be understood as a gradual change in associative strength that captures the situation into one single parameter, or representational state. We will call this view single-state learning. However, there is ample evidence showing that under many circumstances different relationships that share features can be learned independently, and animals can quickly switch between expressing one or another. We will call this multiple-state learning. Theoretically, it is understudied because it needs a different data analysis approach from those usually employed...
July 20, 2017: Quarterly Journal of Experimental Psychology: QJEP
https://www.readbyqxmd.com/read/28723943/stress-enhances-model-free-reinforcement-learning-only-after-negative-outcome
#6
Heyeon Park, Daeyeol Lee, Jeanyung Chey
Previous studies found that stress shifts behavioral control by promoting habits while decreasing goal-directed behaviors during reward-based decision-making. It is, however, unclear how stress disrupts the relative contribution of the two systems controlling reward-seeking behavior, i.e. model-free (or habit) and model-based (or goal-directed). Here, we investigated whether stress biases the contribution of model-free and model-based reinforcement learning processes differently depending on the valence of outcome, and whether stress alters the learning rate, i...
2017: PloS One
https://www.readbyqxmd.com/read/28721169/investigating-an-outbreak-of-measles-in-margibi-county-liberia-october-2015
#7
Joseph Asamoah Frimpong, Maame Pokuah Amo-Addae, Peter Adebayo Adewuyi, Meeyoung Mattie Park, Casey Daniel Hall, Thomas Knue Nagbe
The emergence and re-emergence of infectious diseases highlights the need to have well-trained field epidemiologists who will be at the forefront in the fight against these diseases, especially during an outbreak. Training for outbreak investigation is most effective when participants can develop their competencies in a practical exercise. To that end, this case study was based on a measles outbreak investigation conducted in Liberia during October 2015 by Liberia Frontline Field Epidemiology Training Program (FETP) residents, simulating steps to perform outbreak investigation in a real-life situation as a field epidemiologist...
2017: Pan African Medical Journal
https://www.readbyqxmd.com/read/28720405/persistent-cognitive-and-morphological-alterations-induced-by-repeated-exposure-of-adolescent-rats-to-the-abused-inhalant-toluene
#8
K M Braunscheidel, J T Gass, P J Mulholland, S B Floresco, J J Woodward
While thepsychoactive inhalant toluene causes behavioral effects similarto those produced by other drugs of abuse, the persistent behavioral and anatomical abnormalities induced by toluene exposure are not well known. To mimic human "binge-like" inhalant intoxication, adolescent, male Sprague-Dawley rats were exposed to toluene vapor (57000 ppm) twice daily for five consecutive days. These rats remained in their home cages until adulthood (P60), when they were trained in operant boxes to respond to a palatable food reward and then challenged with several different cognitive tasks...
July 15, 2017: Neurobiology of Learning and Memory
https://www.readbyqxmd.com/read/28719661/minimizing-endpoint-variability-through-reinforcement-learning-during-reaching-movements-involving-shoulder-elbow-and-wrist
#9
David Marc Anton Mehler, Alexandra Reichenbach, Julius Klein, Jörn Diedrichsen
Reaching movements are comprised of the coordinated action across multiple joints. The human skeleton is redundant for this task because different joint configurations can lead to the same endpoint in space. How do people learn to use combinations of joints that maximize success in goal-directed motor tasks? To answer this question, we used a 3-degree-of-freedom manipulandum to measure shoulder, elbow and wrist joint movements during reaching in a plane. We tested whether a shift in the relative contribution of the wrist and elbow joints to a reaching movement could be learned by an implicit reinforcement regime...
2017: PloS One
https://www.readbyqxmd.com/read/28719247/telehealth-in-schools-using-a-systematic-educational-model-based-on-fiction-screenplays-interactive-documentaries-and-three-dimensional-computer-graphics
#10
Diogo Julien Miranda, Chao Lung Wen
BACKGROUND: Preliminary studies suggest the need of a global vision in academic reform, leading to education re-invention. This would include problem-based education using transversal topics, developing of thinking skills, social interaction, and information-processing skills. We aimed to develop a new educational model in health with modular components to be broadcast and applied as a tele-education course. MATERIALS AND METHODS: We developed a systematic model based on a "Skills and Goals Matrix" to adapt scientific contents on fictional screenplays, three-dimensional (3D) computer graphics of the human body, and interactive documentaries...
July 18, 2017: Telemedicine Journal and E-health: the Official Journal of the American Telemedicine Association
https://www.readbyqxmd.com/read/28716096/optimizing-reproducibility-of-operant-testing-through-reinforcer-standardization-identification-of-key-nutritional-constituents-determining-reward-strength-in-touchscreens
#11
Eun Woo Kim, Benjamin U Phillips, Christopher J Heath, So Yeon Cho, Hyunjeong Kim, Jemeen Sreedharan, Ho-Taek Song, Jong Eun Lee, Timothy J Bussey, Chul Hoon Kim, Eosu Kim, Lisa M Saksida
Reliable and reproducible assessment of animal learning and behavior is a central aim of basic and translational neuroscience research. Recent developments in automated operant chamber technology have led to the possibility of universal standard protocols, in addition to increased translational potential, reliability and accuracy. However, the impact of regional and national differences in the supplies of available reinforcers in this system on behavioural performance and inter-laboratory variability is an unknown and at present uncontrolled variable...
July 17, 2017: Molecular Brain
https://www.readbyqxmd.com/read/28710060/values-affirmation-intervention-reduces-achievement-gap-between-underrepresented-minority-and-white-students-in-introductory-biology-classes
#12
Hannah Jordt, Sarah L Eddy, Riley Brazil, Ignatius Lau, Chelsea Mann, Sara E Brownell, Katherine King, Scott Freeman
Achievement gaps between underrepresented minority (URM) students and their white peers in college science, technology, engineering, and mathematics classrooms are persistent across many white-majority institutions of higher education. Attempts to reduce this phenomenon of underperformance through increasing classroom structure via active learning have been partially successful. In this study, we address the hypothesis that the achievement gap between white and URM students in an undergraduate biology course has a psychological and emotional component arising from stereotype threat...
2017: CBE Life Sciences Education
https://www.readbyqxmd.com/read/28707569/imaginative-reinforcement-learning-computational-principles-and-neural-mechanisms
#13
Samuel J Gershman, Jimmy Zhou, Cody Kommers
Imagination enables us not only to transcend reality but also to learn about it. In the context of reinforcement learning, an agent can rationally update its value estimates by simulating an internal model of the environment, provided that the model is accurate. In a series of sequential decision-making experiments, we investigated the impact of imaginative simulation on subsequent decisions. We found that imagination can cause people to pursue imagined paths, even when these paths are suboptimal. This bias is systematically related to participants' optimism about how much reward they expect to receive along imagined paths; providing feedback strongly attenuates the effect...
July 14, 2017: Journal of Cognitive Neuroscience
https://www.readbyqxmd.com/read/28706499/valence-dependent-belief-updating-computational-validation
#14
Bojana Kuzmanovic, Lionel Rigoux
People tend to update beliefs about their future outcomes in a valence-dependent way: they are likely to incorporate good news and to neglect bad news. However, belief formation is a complex process which depends not only on motivational factors such as the desire for favorable conclusions, but also on multiple cognitive variables such as prior beliefs, knowledge about personal vulnerabilities and resources, and the size of the probabilities and estimation errors. Thus, we applied computational modeling in order to test for valence-induced biases in updating while formally controlling for relevant cognitive factors...
2017: Frontiers in Psychology
https://www.readbyqxmd.com/read/28704216/low-back-pain-patients-learn-to-adapt-motor-behavior-with-adverse-secondary-consequences
#15
Jaap H van Dieën, Herta Flor, Paul W Hodges
We hypothesize that changes in motor behavior in individuals with low-back pain are adaptations aimed at minimizing the real or perceived risk of further pain. Through reinforcement learning, pain and subsequent adaptions result in less dynamic motor behavior, leading to increased loading and impoverished sensory feedback, which contributes to cortical reorganization and proprioceptive impairments that reduce the ability to control lumbar movement in a robust manner.
July 12, 2017: Exercise and Sport Sciences Reviews
https://www.readbyqxmd.com/read/28700627/network-analysis-of-exploratory-behaviors-of-mice-in-a-spatial-learning-and-memory-task
#16
Yusuke Suzuki, Itaru Imayoshi
The Barnes maze is one of the main behavioral tasks used to study spatial learning and memory. The Barnes maze is a task conducted on "dry land" in which animals try to escape from a brightly lit exposed circular open arena to a small dark escape box located under one of several holes at the periphery of the arena. In comparison with another classical spatial learning and memory task, the Morris water maze, the negative reinforcements that motivate animals in the Barnes maze are less severe and less stressful...
2017: PloS One
https://www.readbyqxmd.com/read/28697690/try-and-try-again-post-error-boost-of-an-implicit-measure-of-agency
#17
S Di Costa, H Théro, V Chambon, P Haggard
The sense of agency refers to the feeling that we control our actions and, through them, effects in the outside world. Reinforcement learning provides an important theoretical framework for understanding why people choose to make particular actions. Few previous studies have considered how reinforcement and learning might influence the subjective experience of agency over actions and outcomes. In two experiments, participants chose between two action alternatives, which differed in reward probability. Occasional reversals of action-reward mapping required participants to monitor outcomes and adjust action selection processing accordingly...
July 12, 2017: Quarterly Journal of Experimental Psychology: QJEP
https://www.readbyqxmd.com/read/28696337/memristive-device-based-learning-for-navigation-in-robots
#18
Mohammad Sarim, Manish Kumar, Rashmi Jha, Ali A Minai
Biomimetic robots have gained attention recently for various applications ranging from resource hunting to search and rescue operations during disasters. Biological species are known to intuitively learn from the environment, gather and process data, and make appropriate decisions. Such sophisticated computing capabilities in robots are difficult to achieve, especially if done in real-time with ultra- low energy consumption. Here, we present a novel memristive device based learning architecture for robots. Two terminal memristive devices with resistive switching of oxide layer are modeled in a crossbar array to develop a neuromorphic platform that can impart active real-time learning capabilities in a robot...
July 11, 2017: Bioinspiration & Biomimetics
https://www.readbyqxmd.com/read/28691905/effects-of-dopamine-on-reinforcement-learning-and-consolidation-in-parkinson-s-disease
#19
John P Grogan, Demitra Tsivos, Laura Smith, Brogan E Knight, Rafal Bogacz, Alan Whone, Elizabeth J Coulthard
Emerging evidence suggests that dopamine may modulate learning and memory with important implications for understanding the neurobiology of memory and future therapeutic targeting. An influential hypothesis posits that dopamine biases reinforcement learning. More recent data also suggest an influence during both consolidation and retrieval. Eighteen Parkinson's disease patients learned through feedback ON or OFF medication with memory tested 24 hours later ON or OFF medication (4 conditions, within-subjects design with matched healthy control group)...
July 10, 2017: ELife
https://www.readbyqxmd.com/read/28690434/don-t-believe-the-gripe-increasing-course-structure-in-a-large-non-majors-neuroscience-course
#20
Anastasia Nagel, Andrea Nicholas
Active teaching is increasingly accepted as a better option for higher education STEM courses than traditional lecture-based instruction. However, concerns remain regarding student preferences and the impact of increased course structure on teaching evaluations. Undergraduates in a non-majors neuropharmacology course were enrolled in an enriched blended course format, providing online case-based learning opportunities in a large lecture hall setting. Students working in small assigned groups solved weekly case studies developed to teach basic neuropharmacology concepts...
2017: Journal of Undergraduate Neuroscience Education: JUNE: a Publication of FUN, Faculty for Undergraduate Neuroscience
keyword
keyword
23454
1
2
Fetch more papers »
Fetching more papers... Fetching...
Read by QxMD. Sign in or create an account to discover new knowledge that matter to you.
Remove bar
Read by QxMD icon Read
×

Search Tips

Use Boolean operators: AND/OR

diabetic AND foot
diabetes OR diabetic

Exclude a word using the 'minus' sign

Virchow -triad

Use Parentheses

water AND (cup OR glass)

Add an asterisk (*) at end of a word to include word stems

Neuro* will search for Neurology, Neuroscientist, Neurological, and so on

Use quotes to search for an exact phrase

"primary prevention of cancer"
(heart or cardiac or cardio*) AND arrest -"American Heart Association"