foundations of computational agents
One learning task is supervised learning, where there is a set of examples, and a set of features, partitioned into input features and target features. The aim is to predict the values of the target features from the input features.
A feature is a function from examples into a value. If is an example, and is a feature, is the value of feature for example . The domain of a feature is the set of values it can return. Note that this is the range of the function, but is traditionally called the domain.
In a supervised learning task, the learner is given
a set of input features,
a set of target features,
a set of training examples, where the values for the input features and the target features are given for each example, and
a set of test examples, where only the values for the input features are given.
The aim is to predict the values of the target features for the test examples and as-yet-unseen examples.
Figure 7.1 shows training and test examples typical of a classification task. The aim is to predict whether a person reads an article posted to a threaded discussion website given properties of the article. The input features are , , , and . There is one target feature, . The domain of is , the domain of is , and so on.
There are eighteen training examples, each of which has a value for all of the features. In this data set, , , and .
There are two test examples, and , where the user action is unknown.
Figure 7.2 shows some data for a regression task, where the aim is to predict the value of feature on examples for which the value of feature is provided. This is a regression task because is a real-valued feature. Predicting a value of for example is an interpolation problem, as its value for the input feature is between the values of the training examples. Predicting a value of for the example is an extrapolation problem, because its value is outside the range of the training examples.