I was looking at the match statistics on the ATP site. The last section on each match is the "point statistics" where they break down the points into points won on serve, and points won receiving serve.
I was a bit surprised to learn that the correlation between this statistic and the break/hold percentages is much less linear than I thought. The correlation is there, of course, but variations in the holding/breaking percentage are strongly magnified by variations in the percentage of points won on serve. By this I mean the following. Let’s assume a match where 50% of points are won on serve, and 50% on return. In such a match, you would expect the break percentage (and the holding percentage) to be both also 50%. It is a safe assumption.
But now what happens is that as you increase the points won on serve above 50%, the holding percentage increases much faster. Thus, winning about 60-65 percent of points on serve, translates in a holding percentage between 75 and 80 percent! And winning above 70 percent of points on serve brings your holding percentage above 90%, making you almost unbreakeable.
An extreme illustration of this strong non-linearity can be seen in the Federer-Roddick match at Wimbledon. Consider. Federer won 78% of his points on serve. Roddick won 71% if his points on serve. So the combined percentage of points won on serve, for the entire match, was just above 74%. And yet the holding percentage for the match was a staggering 96% (only 3 out of 75 games were breaks).
Of course looking only at a few matches is not at all statistically meaningful. An interesting exercise would be to compare break or holding percentages in different tournaments (all matches) with the percentage of points won on return or on serve also for the entire tournament.
I am thinking that calculations done by points, rather than games, may increase the accuracy of comparing the behaviour of different surfaces when it comes to their friendliness toward servers. It just seems like a finer tool. In other words, I would expect the variations within the same surface, year in year out, to be even less pronounced than the variations you obtain by measuring break or holding percentages, though I may be wrong. I wish the organizers of all majors and master level tournaments made these statistics available.
I was a bit surprised to learn that the correlation between this statistic and the break/hold percentages is much less linear than I thought. The correlation is there, of course, but variations in the holding/breaking percentage are strongly magnified by variations in the percentage of points won on serve. By this I mean the following. Let’s assume a match where 50% of points are won on serve, and 50% on return. In such a match, you would expect the break percentage (and the holding percentage) to be both also 50%. It is a safe assumption.
But now what happens is that as you increase the points won on serve above 50%, the holding percentage increases much faster. Thus, winning about 60-65 percent of points on serve, translates in a holding percentage between 75 and 80 percent! And winning above 70 percent of points on serve brings your holding percentage above 90%, making you almost unbreakeable.
An extreme illustration of this strong non-linearity can be seen in the Federer-Roddick match at Wimbledon. Consider. Federer won 78% of his points on serve. Roddick won 71% if his points on serve. So the combined percentage of points won on serve, for the entire match, was just above 74%. And yet the holding percentage for the match was a staggering 96% (only 3 out of 75 games were breaks).
Of course looking only at a few matches is not at all statistically meaningful. An interesting exercise would be to compare break or holding percentages in different tournaments (all matches) with the percentage of points won on return or on serve also for the entire tournament.
I am thinking that calculations done by points, rather than games, may increase the accuracy of comparing the behaviour of different surfaces when it comes to their friendliness toward servers. It just seems like a finer tool. In other words, I would expect the variations within the same surface, year in year out, to be even less pronounced than the variations you obtain by measuring break or holding percentages, though I may be wrong. I wish the organizers of all majors and master level tournaments made these statistics available.