Next:
Agents
Up:
415.706FC Datamining and Machine
Previous:
Summary Cont.
Reinforcement Learning
Agents
General Problem
Reinforcement Learning Problems
Learning Task
Optimal Policy
Grid-world
Finding Optimal Policies
Q Learning
Q Learning Properties
Q learning Algorithm
Illustrative Example
Convergence
Experimentation Strategies
Updating Sequence
Nondeterministic Rewards and Actions
Temporal Difference Learning
Generalizing from Examples
Relationship to Dynamic Programming
Summary
Patricia Riddle
Fri May 15 13:00:36 NZST 1998