Applications of Optimal Transport

Schematic of the thermal slope effect of ice sheets
Optimal Transport Theory and Wasserstein Distance

Introduction to Optimal Transport and Wasserstein Distance

Optimal Transport (OT) theory focuses on how to transport “mass” (or probability distributions) from one to another with minimal cost. Starting from the practical problem of “how to minimize the cost of transporting soil to build fortifications”, Monge (1781) first discussed and studied optimal transport theory. In the 1940s, Kantorovich reformulated this problem and described the mass transport between different marginal distributions through a joint distribution (coupling). Wasserstein Distance is a “geometric distance” between distributions defined under the optimal transport framework, which can quantitatively measure the difference and similarity between two probability distributions. Compared with indicators describing the relationship between distributions such as Kullback–Leibler divergence, W distance is a strict metric that satisfies metric axioms (non-negativity, identity, symmetry, triangle inequality), so it is more stable and has more geometric interpretability when comparing distributions.

Physical Intuition of Wasserstein Distance

Suppose we want to transport and reshape a pile of soil $\mu$ into another shape $\nu$:

From this example, we can see some applications of Wasserstein distance:

Calculation of Wasserstein Distance

Some Applications of Wasserstein Distance in Climate Science

References

Figalli, A., & Glaudo, F. (2021). An invitation to optimal transport, Wasserstein distances, and gradient flows.