Reinforcement Learning

Sutton Richard S.

Innbundet / 1992 / Engelsk

Produktdetaljer

ISBN

9780792392347

Publisert

1992-05-31

Utgiver

Kluwer Academic Publishers

Høyde

235 mm

Bredde

155 mm

Aldersnivå

Research, UP, P, 05, 06

Språk

Product language

Engelsk

Format

Product format

Innbundet

Antall sider

172

Redaktør

Sutton Richard S.

Reinforcement Learning

Sutton Richard S.

Innbundet / 1992 / Engelsk

Sutton Richard S.

Innbundet / 1992 / Engelsk

Nettpris:

3.028,-

Laveste pris siste 30 dager: 3.028,-

Levering 7-20 dager

Reinforcement learning is the learning of mapping from situations to actions so as to maximize a scalar reward or reinforcement signal.The learner is not told which action to take, as in most forms of machine learning, but instead must discover which actions yield the highest reward by trying them. In the most interesting and challenging cases, actions may affect not only the immediate reward, but also the next situation and through that all subsequent rewards. These two characteristics - trial-and-error search and delayed reward - are the two most important distinguishing features of reinforcement learning. Reinforcement learning is both a new and old topic in AI. The term appears to have been coined by Minsky (1961) and independently in control theory by Waltz and Fu (1965). The earliest machine learning research now viewed as directly relevant was Samuel's (1959) checker player, which used temporal-difference learning to manage delayed reward much as it is used today. Of course learning and reinforcement have been studied in psychology for almost a century and that work has had a strong impact on the AI/engineering work. One could in fact consider all of reinforcement learning to be simply the reverse engineering of certain psychological learning processes (for example, operant conditioning and secondary reinforcement). "Reinforcement Learning" is an edited volume of original research, comprising seven invited contributions by researchers.

Les mer

Reinforcement learning is the learning of a mapping from situations to actions so as to maximize a scalar reward or reinforcement signal. The earliest machine learning research now viewed as directly relevant was Samuel's (1959) checker player, which used temporal-difference learning to manage delayed reward much as it is used today.

Les mer

Springer Book Archives

GPSR Compliance The European Union's (EU) General Product Safety Regulation (GPSR) is a set of rules that requires consumer products to be safe and our obligations to ensure this. If you have any concerns about our products you can contact us on ProductSafety@springernature.com. In case Publisher is established outside the EU, the EU authorized representative is: Springer Nature Customer Service Center GmbH Europaplatz 3 69115 Heidelberg, Germany ProductSafety@springernature.com

Les mer

Springer Book Archives

Les mer

Produktdetaljer

ISBN

9780792392347

Publisert

1992-05-31

Utgiver

Kluwer Academic Publishers

Høyde

235 mm

Bredde

155 mm

Aldersnivå

Research, UP, P, 05, 06

Språk

Product language

Engelsk

Format

Product format

Innbundet

Antall sider

172

Redaktør

Sutton Richard S.

Reinforcement Learning

Produktdetaljer

Reinforcement Learning

Relaterte produkter

Produktdetaljer

Relaterte produkter