Goal: compare two paired binary methods (A vs B) on the same subjects.
Suppose we test 100 items with two classifiers A and B and record paired outcomes (correct/incorrect).
McNemar uses only discordant pairs:
Example: b=12, c=5
Paired nominal outcomes (same items evaluated by two methods). It’s not for independent samples.
Concordant pairs (both correct or both incorrect) don’t inform which method is better in a paired comparison.
Exact is safer for small b+c. For large discordant totals, the chi-square approximation is usually fine.