Back to Glossary
Machine Learning

Mixture density network (MDN)

A mixture density network (MDN) is a type of neural network that predicts a probability distribution over possible outputs, rather than a single point estimate. It combines a standard neural network with a mixture model (often a Gaussian mixture model) to estimate the conditional probability density function of the target variable given the input.

Explanation

MDNs are particularly useful when dealing with multi-modal data, where the target variable can have multiple likely values for a given input. Instead of predicting a single value, the network outputs the parameters (e.g., means, variances, and mixing coefficients) of a mixture distribution. The neural network part of the MDN maps the input to these parameters. The loss function is typically the negative log-likelihood of the target data under the predicted mixture distribution. During training, the network learns to adjust the parameters of the mixture components to best fit the data's underlying distribution. This allows the model to capture complex relationships and uncertainties inherent in the data. MDNs can be used in a variety of applications, including robotics, time series forecasting, and inverse problems where the mapping from input to output is not one-to-one.

Related Terms