Reference: Fayyad et al. 1996
- Model Representation is the language L for describing discoverable patterns.
- Model Evaluation estimates how well a particular pattern (a model and its parameters) meet the criteria of the KDD process. Evaluation of predictive accuracy (validity) is based on cross validation. Evaluation of descriptive quality involves predictive accuracy, novelty, utility, and understandability of the fitted model. Both logical and statistical criteria can be used for model evaluation.
- Search Method consists of two components:
- In parameter search, the algorithm must search for the parameters which optimize the model evaluation criteria given observed data and a fixed model representation.
- Model search occurs as a loop over the parameter search method: the model representation is changed so that a family of models are considered.
Popular Data Mining Methods
Reference: Fayyad et al. 1996
- Decision Trees and Rules
- Nonlinear Regression and Classification Methods
- Example-based Methods
- Probabilistic Graphical Dependency Models
- Relational Learning Models