Topics in Cognitive Science 7 (2):351-367 (2015)
Abstract |
Decision making in noisy and changing environments requires a fine balance between exploiting knowledge about good courses of action and exploring the environment in order to improve upon this knowledge. We present an experiment on a restless bandit task in which participants made repeated choices between options for which the average rewards changed over time. Comparing a number of computational models of participants’ behavior in this task, we find evidence that a substantial number of them balanced exploration and exploitation by considering the probability that an option offers the maximum reward out of all the available options
|
Keywords | Dynamic decision making Volatility Uncertainty Exploration‐exploitation trade‐off Restless multi‐armed bandit task |
Categories | (categorize this paper) |
DOI | 10.1111/tops.12145 |
Options |
![]() ![]() ![]() ![]() |
Download options
References found in this work BETA
From Conditioning to Category Learning: An Adaptive Network Model.Mark A. Gluck & Gordon H. Bower - 1988 - Journal of Experimental Psychology: General 117 (3):227-247.
Should I Stay or Should I Go? How the Human Brain Manages the Trade-Off Between Exploitation and Exploration.Jonathan D. Cohen, Samuel M. McClure & Yu & J. Angela - 2008 - In Jon Driver, Patrick Haggard & Tim Shallice (eds.), Mental Processes in the Human Brain. Oxford University Press.
Comparison of Decision Learning Models Using the Generalization Criterion Method.Woo-Young Ahn, Jerome R. Busemeyer, Eric-Jan Wagenmakers & Julie C. Stout - 2008 - Cognitive Science 32 (8):1376-1402.
The Nature of Belief-Directed Exploratory Choice in Human Decision-Making.W. Bradley Knox, A. Ross Otto, Peter Stone & Bradley C. Love - 2011 - Frontiers in Psychology 2.
Bayesian Modeling of Human Sequential Decision-Making on the Multi-Armed Bandit Problem.Daniel Acuna & Paul Schrater - 2008 - In B. C. Love, K. McRae & V. M. Sloutsky (eds.), Proceedings of the 30th Annual Conference of the Cognitive Science Society. Cognitive Science Society. pp. 100--200.
Citations of this work BETA
Deconstructing the Human Algorithms for Exploration.Samuel J. Gershman - 2018 - Cognition 173:34-42.
Task Complexity Moderates the Influence of Descriptions in Decisions From Experience.Leonardo Weiss-Cohen, Emmanouil Konstantinidis, Maarten Speekenbrink & Nigel Harvey - 2018 - Cognition 170:209-227.
The Placebo Effect: To Explore or to Exploit?Kirsten Barnes, Benjamin Margolin Rottman & Ben Colagiuri - 2021 - Cognition 214:104753.
Model‐Based Wisdom of the Crowd for Sequential Decision‐Making Tasks.Bobby Thomas, Jeff Coon, Holly A. Westfall & Michael D. Lee - 2021 - Cognitive Science 45 (7):e13011.
Cognitive Models of Optimal Sequential Search with Recall.Sudeep Bhatia, Lisheng He, Wenjia Joyce Zhao & Pantelis P. Analytis - 2021 - Cognition 210:104595.
Similar books and articles
Dynamic Stochastic Dominance in Bandit Decision Problems.Thierry Magnac & Jean-Marc Robin - 1999 - Theory and Decision 47 (3):267-295.
Applying Weak Equivalence of Categories Between Partial Map and Pointed Set Against Changing the Condition of 2‐Arms Bandit Problem.Takayuki Niizato & Yukio-Pegio Gunji - 2011 - Complexity 16 (4):10-21.
Ambiguity Aversion in Multi-Armed Bandit Problems.Christopher M. Anderson - 2012 - Theory and Decision 72 (1):15-33.
Bayesian Modeling of Human Sequential Decision-Making on the Multi-Armed Bandit Problem.Daniel Acuna & Paul Schrater - 2008 - In B. C. Love, K. McRae & V. M. Sloutsky (eds.), Proceedings of the 30th Annual Conference of the Cognitive Science Society. Cognitive Science Society. pp. 100--200.
Risk Aversion in Expected Intertemporal Discounted Utilities Bandit Problems.Jean-Philippe Chancelier, Michel De Lara & André de Palma - 2009 - Theory and Decision 67 (4):433-440.
Do Dynamic-Choice "Exploitation" Arguments Justify the Standard Rationality Axioms of Decision Theory?Miroslaw Janusz - 2000 - Dissertation, Cornell University
Decision Making Under Great Uncertainty.Sven Ove Hansson - 1996 - Philosophy of the Social Sciences 26 (3):369-386.
Decision Making Under Great Uncertainty.Sven Ove Hansson - 1996 - Philosophy of the Social Sciences 26 (3):369-386.
Pragmatic Approach to Decision Making Under Uncertainty: The Case of the Disjunction Effect.Maria Bagassi & Laura Macchi - 2006 - Thinking and Reasoning 12 (3):329 – 350.
Preferential Attachment and the Search for Successful Theories.J. McKenzie Alexander - 2013 - Philosophy of Science 80 (5):769-782.
Optimality and Some of Its Discontents: Successes and Shortcomings of Existing Models for Binary Decisions.Philip Holmes & Jonathan D. Cohen - 2014 - Topics in Cognitive Science 6 (2):258-278.
Managing Scientific Uncertainty in Medical Decision Making: The Case of the Advisory Committee on Immunization Practices.J. M. Martinez - 2012 - Journal of Medicine and Philosophy 37 (1):6-27.
Patient Autonomy and the Challenge of Clinical Uncertainty.Mark Parascandola, Jennifer Hawkins & Marion Danis - 2002 - Kennedy Institute of Ethics Journal 12 (3):245-264.
Uncertainty and the Ethics of Clinical Trials.Sven Ove Hansson - 2006 - Theoretical Medicine and Bioethics 27 (2):149-167.
Analytics
Added to PP index
2015-04-21
Total views
39 ( #292,926 of 2,518,239 )
Recent downloads (6 months)
1 ( #408,577 of 2,518,239 )
2015-04-21
Total views
39 ( #292,926 of 2,518,239 )
Recent downloads (6 months)
1 ( #408,577 of 2,518,239 )
How can I increase my downloads?
Downloads