You’re missing the point.
It excludes the match. So then my rating drops down. If my rating drops low enough, then the match counts!
So UTR does iterations over and over, with my rating gyrating up and down because it doesn’t know which matches to count and which yo exclude.
But the reliability % goes up if more matches are counted, and the lower my rating, the more matches are counted. So eventually the algorithm finds more confidence in my rating if it counts my poor performances and excludes my good ones.
If I win with a bad partner, it doesn’t count. But if I win with a good partner, it does!
In other words, it excludes only the best performances, and keeps the worst. This is the definition of bias.