foundations of computational agents
Utility is a measure of preference that combines with probability.
A decision network can represent a finite stage partially observable sequential decision problem in terms of features.
An MDP can represent an infinite stage or indefinite stage sequential decision problem in terms of states.
A fully observable MDP can be solved with value iteration or policy iteration.
A dynamic decision network allows for the representation of an MDP in terms of features.