Goal: estimate probability that a random group-1 value is larger than a random group-2 value.
Group 1: 0.91 0.88 0.90 0.93
Group 2: 0.86 0.87 0.85 0.88
n1=4, n2=4, total pairs=16.
If most G1 values exceed G2 values, A12 will be noticeably above 0.5.
Often used in benchmarking and ML comparisons to express “probability of superiority” in an intuitive way.
Ties count as 0.5 (half win) because neither group dominates in that pair.
No. It compares distributions (via pairwise wins), not just means or a single metric.