Reinforcement Learning: Exploration - Exploitation Dilema in Multi Agent Foraging Task (Record no. 90357)
[ view plain ]
000 -LEADER | |
---|---|
fixed length control field | 01697naa a2200193 4500 |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION | |
fixed length control field | 160926b xxu||||| |||| 00| 0 eng d |
100 ## - MAIN ENTRY--PERSONAL NAME | |
Personal name | Yogeswaran, Mohan |
245 ## - TITLE STATEMENT | |
Title | Reinforcement Learning: Exploration - Exploitation Dilema in Multi Agent Foraging Task |
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT) | |
Place of publication, distribution, etc | |
Name of publisher, distributor, etc | |
Date of publication, distribution, etc | |
300 ## - PHYSICAL DESCRIPTION | |
Extent | 49 (3) Jul-Sep 2012, 223-236p. |
520 ## - Remark | |
Summary, etc | The exploration–exploitation dilemma has been an unresolved issue within the framework of multi-agent reinforcement learning. The agents have to explore in order to improve the state which potentially yields higher rewards in the future or exploit the state that yields the highest reward based on the existing knowledge. Pure exploration degrades the agent’s learning but increases the flexibility of the agent to adapt in a dynamic environment. On the other hand pure exploitation drives the agent’s learning process to locally optimal solutions. Various learning policies have been studied to address this issue. This paper presents critical experimental results on a number of learning policies reported in the open literatures. Learning policies namely greedy, ξ-greedy, Boltzmann Distribution (BD), Simulated Annealing (SA), Probability Matching (PM) and Optimistic Initial Values (OIV) are implemented to study on their performances on a multi-agent foraging-task modelled. Based on the numerical results that were obtained, the performances of the learning policies are discussed.<br/><br/> |
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name as entry element | Reinforcement |
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name as entry element | Q Learnign |
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name as entry element | Learning Policies |
700 ## - ADDED ENTRY--PERSONAL NAME | |
Personal name | Exploration - Exploitation Dilema |
773 0# - HOST ITEM ENTRY | |
Place, publisher, and date of publication | |
Other item identifier | B-2508 |
Title | BV- Opsearch (Jan - Dec 2012) |
906 ## - LOCAL DATA ELEMENT F, LDF (RLIN) | |
a | General Management |
942 ## - ADDED ENTRY ELEMENTS (KOHA) | |
Source of classification or shelving scheme | Dewey Decimal Classification |
Item type | Articles |
Withdrawn status | Lost status | Source of classification or shelving scheme | Damaged status | Not for loan | Home library | Current library | Date acquired | Programme | Barcode | Date last seen | Koha item type | Collection Type | Subject type |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Dewey Decimal Classification | Main Library | Main Library | 26/09/2016 | AR16014 | 26/09/2016 | Articles | Indian Book | General Management |