Looking for the Right Time to Shift Strategy in the Exploration-exploitation Dilemma
cytuj
pobierz pliki
RIS BIB ENDNOTEChoose format
RIS BIB ENDNOTELooking for the Right Time to Shift Strategy in the Exploration-exploitation Dilemma
Publication date: 11.04.2016
Schedae Informaticae, 2015, Volume 24, pp. 73 - 82
https://doi.org/10.4467/20838476SI.15.007.3029Authors
Looking for the Right Time to Shift Strategy in the Exploration-exploitation Dilemma
Balancing exploratory and exploitative behavior is an essential dilemma faced by adaptive agents. The challenge of finding a good trade-off between exploration (learn new things) and exploitation (act optimally based on what is already known) has been largely studied for decision-making problems where the agent must learn a policy of actions. In this paper we propose the engaged climber method, designed for solving the exploration-exploitation dilemma. The solution consists in explicitly creating two different policies (for exploring or for exploiting), and to determine the good moments to shift from the one to the other by the use of notions like engagement and curiosity.
Information: Schedae Informaticae, 2015, Volume 24, pp. 73 - 82
Article type: Original article
Titles:
Looking for the Right Time to Shift Strategy in the Exploration-exploitation Dilemma
Looking for the Right Time to Shift Strategy in the Exploration-exploitation Dilemma
IRIT University of Toulouse 1 Capitole
Published at: 11.04.2016
Article status: Open
Licence: None
Percentage share of authors:
Article corrections:
-Publication languages:
EnglishView count: 2050
Number of downloads: 1517