SluitenHelpPrint
Switch to English
Cursus: INFOMAA
INFOMAA
Multi-agent learning
Cursus informatie
CursuscodeINFOMAA
Studiepunten (EC)7,5
Cursusdoelen
Upon successful completion of this course, the student:
  • has knowledge of  the main algorithms for multi-agent learning;
  • is able to apply these algorithms in formal settings such as strategic form games;
  • has insight into the meta-theoretic properties of the studied algorithms and can verify simple meta-theoretic algorithms;
  • has knowledge of the relations between different algorithms for multi-agent learning, and can verify simple relations;
  • is able to evaluate the suitability of the studied algorithms for modeling different types of multi-agent learning.
Inhoud
Multi-agent learning (MAL) studies software agents that learn and adapt to the behaviour of other software agents, that themselves adapt to the behaviour of other software agents, and so on. The presence of other learning agents complicates learning, which makes the environment non-stationary (moving target) and non-Markovian (not only experiences from the immediate past but also earlier experiences are relevant). With adaptive agents it also becomes less beneficent to only adapt to the behaviour of other agents, on the pain of being exploited by more steadfast agents that do not follow but instead impose their strategy on others. Important topics of adaptive agents include statistical learning and single-agent reinforcement learning. Important topics of MAL include (evolutionary) game theory, fictitious play, gradient dynamics, no-regret learning, multi-agent reinforcement learning (MinMax-Q, Nash-Q), leader (teacher) vs. follower (learner) adaptation, and the emergence of social conventions. Examples of domains that need robust MAL algorithms include manufacturing systems (managers of a factory coordinate to maximise their profit), distributed sensor networks (multiple sensors collaborate to perform a large-scale sensing task under strict power constraints), robo-soccer, disaster rescue (robots must safely find victims as fast as possible after an earthquake) and recreational games of imperfect information such as poker. Indeed, poker and simplified forms of poker are an important topics of research in multi-agent learning.
SluitenHelpPrint
Switch to English