Total variation distance of probability measures

In probability theory, the total variation distance is a distance measure for probability distributions. It is an example of a statistical distance metric, and is sometimes just called "the" statistical distance.

Definition

The total variation distance between two probability measures P and Q on a sigma-algebra ${\mathcal {F}}$ of subsets of the sample space $\Omega$ is defined via^[1]

\delta (P,Q)=\sup _{A\in {\mathcal {F}}}\left|P(A)-Q(A)\right|.

Informally, this is the largest possible difference between the probabilities that the two probability distributions can assign to the same event.

Special cases

For a finite or countable alphabet we can relate the total variation distance to the 1-norm of the difference of the two probability distributions as follows:^[2]

\delta (P,Q)={\frac {1}{2}}\|P-Q\|_{1}={\frac {1}{2}}\sum _{x}\left|P(x)-Q(x)\right|\;.

Similarly, for arbitrary sample space $\Omega$ , measure $\mu$ , and probability measures $P$ and $Q$ with Radon-Nikodym derivatives $f_{P}$ and $f_{Q}$ with respect to $\mu$ , an equivalent definition of the total variation distance is

\delta (P,Q)={\frac {1}{2}}\|f_{P}-f_{Q}\|_{L_{1}(\mu )}={\frac {1}{2}}\int _{\Omega }\left|f_{P}-f_{Q}\right|d\mu \;.

Relationship with other concepts

The total variation distance is related to the Kullback–Leibler divergence by Pinsker's inequality.

References

↑ Chatterjee, Sourav. "Distances between probability measures" (PDF). UC Berkeley. Archived from the original (PDF) on July 8, 2008. Retrieved 21 June 2013.
↑ Levin, David Asher; Peres, Yuval; Wilmer, Elizabeth Lee. Markov Chains and Mixing Times. American Mathematical Soc. ISBN 9780821886274.

This article is issued from Wikipedia - version of the 9/20/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.

Total variation distance of probability measures

Definition

Special cases

Relationship with other concepts

See also

References