Types
Kind
- Rarametric
- Non-Parametric
Categories
Approaches
Taxonomy
- Generative Methods
- Discriminative Methods
Selection Criteria

Types

Regression

A supervised problem, the outputs are continuous rather than discrete.

Classification

Inputs are divided into two or more classes, and the learner must produce a model that assigns unseen inputs to one or more (multi-label classification) of these classes. This is typically tackled in a supervised way.

Clustering

A set of inputs is to be divided into groups. Unlike in classification, the groups are not known beforehand, making this typically an unsupervised task.

Density Estimation

Finds the distribution of inputs in some space.

Dimensionality Reduction

Simplifies inputs by mapping them into a lower-dimensional space.

Kind

Rarametric

Step 1: Making an assumption about the functional form or shape of our function ( Types - 图1 ), i.e.: Types - 图2 is linear, thus we will select a linear model.

Step 2: Selecting a procedure to fit or train our model. This means estimating the Beta parameters in the linear function. A common approach is the (ordinary) least squares, amongst others.

Non-Parametric

When we do not make assumptions about the form of our function ( Types - 图3 ). However, since these methods do not reduce the problem of estimating Types - 图4 to a small number of parameters, a large number of observations is required in order to obtain an accurate estimate for Types - 图5 . An example would be the thin-plate spline model.

Approaches

Decision tree learning

Association rule learning

Artificial neural networks

Deep learning

Inductive logic programming

Support vector machines

Clustering

Bayesian networks

Reinforcement learning

Representation learning

Similarity and metric learning

Sparse dictionary learning

Genetic algorithms

Rule-based machine learning

Learning classifier systems

Taxonomy

Generative Methods

Model class-conditional pdfs and prior probabilities. “Generative” since sampling can generate synthetic data points.

Popular models:

Gaussians, Naïve Bayes, Mixtures of multinomials
Mixtures of Gaussians, Mixtures of experts, Hidden Markov Models (HMM)
Sigmoidal belief networks, Bayesian networks, Markov random fields

Discriminative Methods

Directly estimate posterior probabilities. No attempt to model underlying probability distributions. Focus computational resources on given task– better performance.

Popular models:

Logistic regression, SVMs
Traditional neural networks, Nearest neighbor
Conditional Random Fields (CRF)

Selection Criteria

Prediction Accuracy vs Model Interpretability.

There is an inherent tradeoff between Prediction Accuracy and Model Interpretability, that is to say that as the model get more flexible in the way the function (f) is selected, they get obscured, and are hard to interpret. Flexible methods are better for inference, and inflexible methods are preferable for prediction.

Machine Learning

Types

Types

Regression

Classification

Clustering

Density Estimation

Dimensionality Reduction

Kind

Rarametric

Non-Parametric

Categories

Supervised

Unsupervised

Reinforcement Learning

Approaches

Taxonomy

Generative Methods

Discriminative Methods

Selection Criteria