- Assigning credit/blame for outcomes to each internal decision
- Loading Problem
- Loading a training set into the free parameters
- Important to any learning machine attempting to improve performance in situations involving temporally extended behaviour
Two Sub-problems:
Temporal credit-assignment problem
- Assigning credit for outcomes to actions
- Involves time when actions that deserve credit were taken
- Relevant when many actions taken and want to know which one was responsible
Structural credit-assignment problem
- Assigning credit for actions to internal decisions
- Involves internal structures of actions generated by system
- Relevant for identifying which component should have behaviour altered
- By how much
Important in MLPs when there are many hidden neurons