Reminder: This post contains 749 words
· 3 min read
· by Xianbin
Total variation distance is very useful in many areas. This post shows some useful properties.
Definition
Consider two distributions μ,ν (probabilities on E), the total variation distance between μ and ν is as follows.
∥μ−ν∥tvd=supA⊂E∣μ(A)−ν(A)∣
Since ∑x∈Eμ(x)=∑x∈Eν(x)=1, then we have
μ(x)≥ν(x)∑μ(x)−ν(x)=ν(x)≤μ(x)∑ν(x)−μ(x)
∥μ−ν∥tvd=21x∈E∑∣μ(x)−ν(x)∣
Let B:={x:μ(x)≥ν(x)}
Let X∼μ,Y∼ν.
P(X=Y)≥P(X∈B,Y∈Bˉ)=P(X∈B)−P(X∈B,Y∈B)≥P(X∈B)−P(Y∈B)=μ(B)−ν(B)=∥μ−ν∥tvd