Английская Википедия:Intrinsic motivation (artificial intelligence)

Шаблон:Short description Intrinsic motivation in the study of artificial intelligence and robotics is a mechanism for enabling artificial agents (including robots) to exhibit inherently rewarding behaviours such as exploration and curiosity, grouped under the same term in the study of psychology. Psychologists consider intrinsic motivation in humans to be the drive to perform an activity for inherent satisfaction – just for the fun or challenge of it.^[1]

Definition

An intelligent agent is intrinsically motivated to act if the information content alone, or the experience resulting from the action, is the motivating factor.

Information content in this context is measured in the information-theoretic sense of quantifying uncertainty. A typical intrinsic motivation is to search for unusual, surprising situations (exploration), in contrast to a typical extrinsic motivation such as the search for food (homeostasis).^[2] Extrinsic motivations are typically described in artificial intelligence as task-dependent or goal-directed.

Origins in psychology

The study of intrinsic motivation in psychology and neuroscience began in the 1950s with some psychologists explaining exploration through drives to manipulate and explore, however, this homeostatic view was criticised by White.^[3] An alternative explanation from Berlyne in 1960 was the pursuit of an optimal balance between novelty and familiarity.^[4] Festinger described the difference between internal and external view of the world as dissonance that organisms are motivated to reduce.^[5] A similar view was expressed in the '70s by Kagan as the desire to reduce the incompatibility between cognitive structure and experience.^[6] In contrast to the idea of optimal incongruity, Deci and Ryan identified in the mid 80's an intrinsic motivation based on competence and self-determination.^[7]

Computational models

An influential early computational approach to implement artificial curiosity in the early 1990s by Schmidhuber, has since been developed into a "Formal theory of creativity, fun, and intrinsic motivation”.^[8]

Intrinsic motivation is often studied in the framework of computational reinforcement learning^[9]^[10] (introduced by Sutton and Barto), where the rewards that drive agent behaviour are intrinsically derived rather than externally imposed and must be learnt from the environment.^[11] Reinforcement learning is agnostic to how the reward is generated - an agent will learn a policy (action strategy) from the distribution of rewards afforded by actions and the environment. Each approach to intrinsic motivation in this scheme is essentially a different way of generating the reward function for the agent.

Curiosity vs. exploration

Intrinsically motivated artificial agents exhibit behaviour that resembles curiosity or exploration. Exploration in artificial intelligence and robotics has been extensively studied in reinforcement learning models,^[12] usually by encouraging the agent to explore as much of the environment as possible, to reduce uncertainty about the dynamics of the environment (learning the transition function) and how best to achieve its goals (learning the reward function). Intrinsic motivation, in contrast, encourages the agent to first explore aspects of the environment that confer more information, to seek out novelty. Recent work unifying state visit count exploration and intrinsic motivation has shown faster learning in a video game setting.^[13]

Types of models

Ouedeyer and Kaplan have made a substantial contribution to the study of intrinsic motivation.^[14]^[2]^[15] They define intrinsic motivation based on Berlyne's theory,^[4] and divide approaches to the implementation of intrinsic motivation into three categories that broadly follow the roots in psychology: "knowledge-based models", "competence-based models" and "morphological models".^[2] Knowledge-based models are further subdivided into "information-theoretic" and "predictive".^[15] Baldassare and Mirolli present a similar typology, differentiating knowledge-based models between prediction-based and novelty-based.^[16]

Information-theoretic intrinsic motivation

The quantification of prediction and novelty to drive behaviour is generally enabled through the application of information-theoretic models, where agent state and strategy (policy) over time are represented by probability distributions describing a markov decision process and the cycle of perception and action treated as an information channel.^[17]^[18] These approaches claim biological feasibility as part of a family of bayesian approaches to brain function. The main criticism and difficulty of these models is the intractability of computing probability distributions over large discrete or continuous state spaces.^[2] Nonetheless, a considerable body of work has built up modelling the flow of information around the sensorimotor cycle, leading to de facto reward functions derived from the reduction of uncertainty, including most notably active inference,^[19] but also infotaxis,^[20] predictive information,^[21]^[22] and empowerment.^[23]

Competence-based models

Steels' autotelic principle ^[24] is an attempt to formalise flow (psychology).^[25]

Achievement, affiliation and power models

Other intrinsic motives that have been modelled computationally include achievement, affiliation and power motivation.^[26] These motives can be implemented as functions of probability of success or incentive. Populations of agents can include individuals with different profiles of achievement, affiliation and power motivation, modelling population diversity and explaining why different individuals take different actions when faced with the same situation.

Beyond achievement, affiliation and power

A more recent computational theory of intrinsic motivation attempts to explain a large variety of psychological findings based on such motives. Notably this model of intrinsic motivation goes beyond just achievement, affiliation and power, by taking into consideration other important human motives. Empirical data from psychology were computationally simulated and accounted for using this model.^[27]

Intrinsically Motivated Learning

Intrinsically motivated (or curiosity-driven) learning is an emerging research topic in artificial intelligence and developmental robotics^[28] that aims to develop agents that can learn general skills or behaviours, that can be deployed to improve performance in extrinsic tasks, such as acquiring resources.^[29] Intrinsically motivated learning has been studied as an approach to autonomous lifelong learning in machines^[30]^[31] and open-ended learning in computer game characters.^[32] In particular, when the agent learns a meaningful abstract representation, a notion of distance between two representations can be used to gauge novelty, hence allowing for an efficient exploration of its environment.^[33] Despite the impressive success of deep learning in specific domains (e.g. AlphaGo), many in the field (e.g. Gary Marcus) have pointed out that the ability to generalise remains a fundamental challenge in artificial intelligence. Intrinsically motivated learning, although promising in terms of being able to generate goals from the structure of the environment without externally imposed tasks, faces the same challenge of generalisation – how to reuse policies or action sequences, how to compress and represent continuous or complex state spaces and retain and reuse the salient features that have been learnt.^[29]

References

Шаблон:Reflist

↑ Ошибка цитирования Неверный тег <ref>; для сносок ryan2000 не указан текст
↑ ^2,0 ^2,1 ^2,2 ^2,3 Ошибка цитирования Неверный тег <ref>; для сносок oudeyer2008 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок white1959 не указан текст
↑ ^4,0 ^4,1 Ошибка цитирования Неверный тег <ref>; для сносок Berlyne1960 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок festinger1957 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок kagan1972 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок deci1985 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок schmidhuber2010 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок barto2004 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок singh2005 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок barto2012 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок thrun1992 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок bellemare2016 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок kaplan2004 не указан текст
↑ ^15,0 ^15,1 Ошибка цитирования Неверный тег <ref>; для сносок oudeyer2009 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок baldassarre2013 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок klyubin2008 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок biehl2018 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок friston2006 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок vergassola не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок ay2008 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок martius2013 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок salge2014 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок steels2004 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок csik2000 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок merrick2016 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок sbd2022 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок lungarella2003 не указан текст
↑ ^29,0 ^29,1 Ошибка цитирования Неверный тег <ref>; для сносок santucci2020 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок barto2013 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок mirolli2013 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок merrick2009 не указан текст
↑ Ошибка цитирования Неверный тег <ref>; для сносок tao2020novelty не указан текст

[ryan2000-1] Ошибка цитирования Неверный тег <ref>; для сносок ryan2000 не указан текст

[oudeyer2008-2] 2,0 ^2,1 ^2,2 ^2,3 Ошибка цитирования Неверный тег <ref>; для сносок oudeyer2008 не указан текст

[white1959-3] Ошибка цитирования Неверный тег <ref>; для сносок white1959 не указан текст

[Berlyne1960-4] 4,0 ^4,1 Ошибка цитирования Неверный тег <ref>; для сносок Berlyne1960 не указан текст

[festinger1957-5] Ошибка цитирования Неверный тег <ref>; для сносок festinger1957 не указан текст

[kagan1972-6] Ошибка цитирования Неверный тег <ref>; для сносок kagan1972 не указан текст

[deci1985-7] Ошибка цитирования Неверный тег <ref>; для сносок deci1985 не указан текст

[schmidhuber2010-8] Ошибка цитирования Неверный тег <ref>; для сносок schmidhuber2010 не указан текст

[barto2004-9] Ошибка цитирования Неверный тег <ref>; для сносок barto2004 не указан текст

[singh2005-10] Ошибка цитирования Неверный тег <ref>; для сносок singh2005 не указан текст

[barto2012-11] Ошибка цитирования Неверный тег <ref>; для сносок barto2012 не указан текст

[thrun1992-12] Ошибка цитирования Неверный тег <ref>; для сносок thrun1992 не указан текст

[bellemare2016-13] Ошибка цитирования Неверный тег <ref>; для сносок bellemare2016 не указан текст

[kaplan2004-14] Ошибка цитирования Неверный тег <ref>; для сносок kaplan2004 не указан текст

[oudeyer2009-15] 15,0 ^15,1 Ошибка цитирования Неверный тег <ref>; для сносок oudeyer2009 не указан текст

[baldassarre2013-16] Ошибка цитирования Неверный тег <ref>; для сносок baldassarre2013 не указан текст

[klyubin2008-17] Ошибка цитирования Неверный тег <ref>; для сносок klyubin2008 не указан текст

[biehl2018-18] Ошибка цитирования Неверный тег <ref>; для сносок biehl2018 не указан текст

[friston2006-19] Ошибка цитирования Неверный тег <ref>; для сносок friston2006 не указан текст

[vergassola-20] Ошибка цитирования Неверный тег <ref>; для сносок vergassola не указан текст

[ay2008-21] Ошибка цитирования Неверный тег <ref>; для сносок ay2008 не указан текст

[martius2013-22] Ошибка цитирования Неверный тег <ref>; для сносок martius2013 не указан текст

[salge2014-23] Ошибка цитирования Неверный тег <ref>; для сносок salge2014 не указан текст

[steels2004-24] Ошибка цитирования Неверный тег <ref>; для сносок steels2004 не указан текст

[csik2000-25] Ошибка цитирования Неверный тег <ref>; для сносок csik2000 не указан текст

[merrick2016-26] Ошибка цитирования Неверный тег <ref>; для сносок merrick2016 не указан текст

[sbd2022-27] Ошибка цитирования Неверный тег <ref>; для сносок sbd2022 не указан текст

[lungarella2003-28] Ошибка цитирования Неверный тег <ref>; для сносок lungarella2003 не указан текст

[santucci2020-29] 29,0 ^29,1 Ошибка цитирования Неверный тег <ref>; для сносок santucci2020 не указан текст

[barto2013-30] Ошибка цитирования Неверный тег <ref>; для сносок barto2013 не указан текст

[mirolli2013-31] Ошибка цитирования Неверный тег <ref>; для сносок mirolli2013 не указан текст

[merrick2009-32] Ошибка цитирования Неверный тег <ref>; для сносок merrick2009 не указан текст

[tao2020novelty-33] Ошибка цитирования Неверный тег <ref>; для сносок tao2020novelty не указан текст

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

Партнерские ресурсы
Криптовалюты	Обмен криптовалют - www.bestchange.ru Криптовалютная биржа CoinEx Криптовалютная биржа Binance HIVE OS - операционная система для майнинга e4pool - Мультивалютный пул для майнинга.
Магазины	AliExpress — глобальная виртуальная (в Интернете) торговая площадка, предоставляющая возможность покупать товары производителей из КНР; computeruniverse.net - Интернет-магазин компьютеров(Промо код 5 Евро на первую покупку:FWWC3ZKQ);
Хостинг	DigitalOcean - американский провайдер облачных инфраструктур, с главным офисом в Нью-Йорке и с центрами обработки данных по всему миру;
Разное	Викиум - Онлайн-тренажер для мозга Like Центр - Центр поддержки и развития предпринимательства. Gamersbay - лучший магазин по бустингу для World of Warcraft. Ноотропы OmniMind N°1 - Усиливает мозговую активность. Повышает мотивацию. Улучшает память. Санкт-Петербургская школа телевидения - это федеральная сеть образовательных центров, которая имеет филиалы в 37 городах России. Lingualeo.com — интерактивный онлайн-сервис для изучения и практики английского языка в увлекательной игровой форме. Junyschool (Джунискул) – международная школа программирования и дизайна для детей и подростков от 5 до 17 лет, где ученики осваивают компьютерную грамотность, развивают алгоритмическое и креативное мышление, изучают основы программирования и компьютерной графики, создают собственные проекты: игры, сайты, программы, приложения, анимации, 3D-модели, монтируют видео. Умназия - Интерактивные онлайн-курсы и тренажеры для развития мышления детей 6-13 лет SkillBox - это один из лидеров российского рынка онлайн-образования. Среди партнеров Skillbox ведущий разработчик сервисного дизайна AIC, медиа-компания Yoola, первое и самое крупное русскоязычное аналитическое агентство Tagline, онлайн-школа дизайна и иллюстрации Bang! Bang! Education, оператор PR-рынка PACO, студия рисования Draw&Go, агентство performance-маркетинга Ingate, scrum-студия Sibirix, имидж-лаборатория Персона. «Нетология» — это университет по подготовке и дополнительному обучению специалистов в области интернет-маркетинга, управления проектами и продуктами, дизайна, Data Science и разработки. В рамках Нетологии студенты получают ценные теоретические знания от лучших экспертов Рунета, выполняют практические задания на отработку полученных навыков, общаются с экспертами и единомышленниками. Познакомиться со всеми продуктами подробнее можно на сайте https://netology.ru, линейка курсов и профессий постоянно обновляется. StudyBay Brazil – это онлайн биржа для португалоговорящих студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт. Автор24 — самая большая в России площадка по написанию учебных работ: контрольные и курсовые работы, дипломы, рефераты, решение задач, отчеты по практике, а так же любой другой вид работы. Сервис сотрудничает с более 70 000 авторов. Более 1 000 000 работ уже выполнено. StudyBay – это онлайн биржа для англоязычных студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт.

Английская Википедия:Intrinsic motivation (artificial intelligence)

Содержание

Definition

Origins in psychology

Computational models

Curiosity vs. exploration

Types of models

Information-theoretic intrinsic motivation

Competence-based models

Achievement, affiliation and power models

Beyond achievement, affiliation and power

Intrinsically Motivated Learning

See also

References

Навигация

Действия на странице

Действия на странице

Персональные инструменты

Навигация

Поиск

Инструменты