schmke
Legend
Apologies, I thought you were referring to something on TennisLink. Anything TR lists isn't rooted in reality, they have no clue (well, very little) if you are in range of appeal or not it seems from their performance this time around.
Apologies, I thought you were referring to something on TennisLink. Anything TR lists isn't rooted in reality, they have no clue (well, very little) if you are in range of appeal or not it seems from their performance this time around.
Apologies, I thought you were referring to something on TennisLink. Anything TR lists isn't rooted in reality, they have no clue (well, very little) if you are in range of appeal or not it seems from their performance this time around.
It actually did appear on tennislink also, but only until I used the appeal option. Im pretty sure TR is taking it directly from the usta site (i.e., it has nothing to do with the TR algorithm), because I’ve never seen it before on either site. I am guessing it will disappear from TR next week when it next updates. TR is re-calibrated to the usta computer ratings every year by force-fitting the discrepancies to match the usta computer.Apologies, I thought you were referring to something on TennisLink. Anything TR lists isn't rooted in reality, they have no clue (well, very little) if you are in range of appeal or not it seems from their performance this time around.
I just played my first USTA match in nearly 2 years.
8.0 mixed.
.......
UTR now says I’m a 4. A UTR 4.xx with 2 decimal place accuracy. Same as my female 3.5 mixed partners. Is it the rating decay with inactivity thing?
My main point, that many here are missing, is that these discrepancies like the example you cite don’t have to be there.The other issue is that as a practical matter there are only a very small number of doubles matches where teams of a single gender play against a team containing the other gender. This means that the UTR doubles rating system almost certainly does, in practice, have a separate rating for men and women. I would bet that if you wanted to increase your UTR rating you could play on a team with two men against teams of two women. How much this would increase your UTR doubles rating might vary because some areas might have more matches like that, in the pool than others. But on the whole I bet that would lead to a considerable increase in rating. When I look at the the opponents I played it was typically the male player was much more of a threat than UTR's rating suggested.
My main point, that many here are missing, is that these discrepancies like the example you cite don’t have to be there.
If UTR would simply fix the algorithm with a few tweaks, it would be much more accurate.
1. Do not adjust ratings based on performances of prior opponents.
2. Do not make any assumptions about rating change during period of inactivity.
3. Don’t throw out match data from more than 12 months, unless the player is a junior.
Make these fixes to repair the self-defeating issues, and the algorithm will no longer require large sample sizes to settle in. Which means UTR won’t have to put the “limited data” disclaimer on the page all the time.
My main point, that many here are missing, is that these discrepancies like the example you cite don’t have to be there.
If UTR would simply fix the algorithm with a few tweaks, it would be much more accurate.
1. Do not adjust ratings based on performances of prior opponents.
2. Do not make any assumptions about rating change during period of inactivity.
3. Don’t throw out match data from more than 12 months, unless the player is a junior.
Make these fixes to repair the self-defeating issues, and the algorithm will no longer require large sample sizes to settle in. Which means UTR won’t have to put the “limited data” disclaimer on the page all the time.
But being a 5.0 ntrp male in 9.0 mixed is a statistically proven advantage, almost as much edge as being a 4.5 in 8.0. The best player on the court in doubles usually wins, so he should have the chance to prove it. It should still count, but maybe discount the expected win %.Don't count doubles matches where there is more than 2.0 point difference between partners.
I am 5-6 points higher than my 9.0 partner.
I would win more points against Nadal than she would win against me.
J
1. Do not adjust ratings based on performances of prior opponents.
I don't know that adjusting based on future performance of prior opponents has anything to do with the floor and cap.Maybe Schmke could verify what I am thinking, but I don't think they can do 1 because they have a floor and a cap in ratings.
I don't know that adjusting based on future performance of prior opponents has anything to do with the floor and cap.
The reason to do such adjusting is to give consideration to a result against a mis-rated player affecting your rating inappropriately, the idea being that if you lost to them when they were a 5.2 but after a few matches they were a 7.7, that loss was not nearly as bad as it seemed. If you don't do the future/prior adjustment, your rating is unfairly dinged. It can obviously go the other way too.
....
All of that has nothing to do with caps and floors. I think they have caps and floors just to provide consistency of a sort, e.g. knowing that 15+ means a pro, and 16+ a very top pro, the best recreational players are somewhere 8-11, etc. And you have to have a floor lest things go negative which would look bad.
Now, could some consistency be realized without a cap? Probably, an artificial cap does seem unnecessary, but they seem to think they need it for some reason.
In what sense they did _not_ replicate the chess system? It is virtually identical. Obviously in chess they do not really lower the ranking due to non-activity - but that is more because of the nature of chess. In chess your level does not really significantly drop off due to age or inactivity. Plus the main problem with tennis ranking - which is lack of data/matches - does not really exist in chess so it is somewhat naturally more accurate. But it is not because the algorithm is different, it is because chess players actually play.That said, the chess system sounds cool, not sure why they didn’t try to replicate that.
Don't count doubles matches where there is more than 2.0 point difference between partners.
I am 5-6 points higher than my 9.0 partner.
I would win more points against Nadal than she would win against me.
J
In what sense they did _not_ replicate the chess system? It is virtually identical. Obviously in chess they do not really lower the ranking due to non-activity
- but that is more because of the nature of chess. In chess your level does not really significantly drop off due to age or inactivity.
In what sense they did _not_ replicate the chess system? It is virtually identical. Obviously in chess they do not really lower the ranking due to non-activity - but that is more because of the nature of chess. In chess your level does not really significantly drop off due to age or inactivity. Plus the main problem with tennis ranking - which is lack of data/matches - does not really exist in chess so it is somewhat naturally more accurate. But it is not because the algorithm is different, it is because chess players actually play.
When you have a player that plays like 5-7 matches over 3 years, and 90% of them are mixed doubles there's really only so much you can do with that data.
Good summary of chess ratings vs tennis.In chess if you achieve a 1400 1600 1800 2000 2200 or 2400 etc rating that is an accomplishment proving you acquired a certain level of competence at the game that you have for the rest of your life. That competence will be recognized by people decades later. If I am 200 points lower than someone else I have an idea that will lose approximately 75% of the games.
Tennis ratings are a disaster. I was a utr 4.4? yesterday today I am a 6.03! great! except I didn't actually play any games to change that rating. Is that any sort of accomplishment? UTR is like a magic 8 ball. UTR does not keep track of your highest rating it often seems like a number pulled out of someone's rear end. No one can tell you what a UTR or USTA difference of .5 means. No one has any clue what that difference means. I might be a 3.5 that can play evenly with a 4.0 or I might be a 3.5 that would typically lose 6-0 6-0 to a higher 3.5. Tennis ratings are a joke compared to chess ratings.
The question is not how tennis ratings are different than chess ratings. The question is how is it even possible that people constantly devise tennis ratings in ways that make them unmeaningful in all the way chess ratings are meaningful. It is only explained by conscious effort to make them meaningless and indeed that is what USTA pretty much admits. They say they are afraid people will focus on the number too much so they make it vague and practically meaningless.
In chess if you achieve a 1400 1600 1800 2000 2200 or 2400 etc rating that is an accomplishment proving you acquired a certain level of competence at the game that you have for the rest of your life. That competence will be recognized by people decades later. If I am 200 points lower than someone else I have an idea that will lose approximately 75% of the games.
Tennis ratings are a disaster. I was a utr 4.4? yesterday today I am a 6.03! great! except I didn't actually play any games to change that rating. Is that any sort of accomplishment? UTR is like a magic 8 ball. UTR does not keep track of your highest rating it often seems like a number pulled out of someone's rear end. No one can tell you what a UTR or USTA difference of .5 means. No one has any clue what that difference means. I might be a 3.5 that can play evenly with a 4.0 or I might be a 3.5 that would typically lose 6-0 6-0 to a higher 3.5. Tennis ratings are a joke compared to chess ratings.
The question is not how tennis ratings are different than chess ratings. The question is how is it even possible that people constantly devise tennis ratings in ways that make them unmeaningful in all the way chess ratings are meaningful. It is only explained by conscious effort to make them meaningless and indeed that is what USTA pretty much admits. They say they are afraid people will focus on the number too much so they make it vague and practically meaningless.
Tennis ratings are a disaster. I was a utr 4.4? yesterday today I am a 6.03! great! except I didn't actually play any games to change that rating.
I think this lack of clarity as to how UTR is spitting out numbers does hurt its legitimacy. I think NTRP is much better since people can see how it is done and gain a better understanding.
In chess if you achieve a 1400 1600 1800 2000 2200 or 2400 etc rating that is an accomplishment proving you acquired a certain level of competence at the game that you have for the rest of your life. That competence will be recognized by people decades later.
If I am 200 points lower than someone else I have an idea that will lose approximately 75% of the games.
How's that different from tennis? If I achieve a UTR 10, isn't that also an accomplishment, proof of a certain level of competence?
It would be interesting to see how Elo deals with small sample sizes, like we saw in tennis during the pandemic. As long as the sample size is big enough, I would guess that UTR does pretty well. It doesn't do so well with limited data [during the pandemic, my singles UTR went up 200 points without me playing any matches. According to their stats, at one point I was 23,000th in the world.].
my understanding is a bit different. I thought the way it worked/works is that no matter what your _current_ ranking might be you can never enter a tournament for a level that is lower than your all-time level highest level ever was. So if let's say you achieved 1850 Elo in like 2018, and now you are 1775, you still cannot enter a tournament for under-1800.They used to. And people would sandbag by not playing any officially rated matches and then enter a big tournament. At some point, they changed it to the current method.
well, of course your level drops with age and/or inactivity but that drop off is nowhere near close to what happens in tennis.Yes, it does. Your understanding of the fundamentals is still sound but many things decline with age and inactivity:
- Calculation speed
- Concentration
- Familiarity with many patterns and knowing quickly what to do rather than spending precious time figuring it out
- Opening theory: lines which were sound 20 years ago may have been discovered faulty
- Stamina: putting out tremendous mental effort over a 2 hour game is very draining
My last rated game was > 30 years ago. The young me would crush the present me but my rating has stayed constant. If I enter a tournament, I'm gonna get whupped.
if I ever find a person that is impressed by someone getting 1400, 1600, or 1800 Elo chess ranking then that would be the first such person I have encountered.In chess if you achieve a 1400 1600 1800 2000 2200 or 2400 etc rating that is an accomplishment proving you acquired a certain level of competence at the game that you have for the rest of your life. That competence will be recognized by people decades later.
and if you are 2 UTR points lower you are going to lose ~100% of matches. Satisfied?If I am 200 points lower than someone else I have an idea that will lose approximately 75% of the games.
for the nth time - how many actual matches have you played against opponents with verified ranking? Go and look up @GSG UTR profile. Or Mark Sansait's one. Both have pretty extensive play history, their ranking info is solid, it fluctuates a bit but within expected range. Those ranking make sense _because they actually play matches_.Tennis ratings are a disaster. I was a utr 4.4? yesterday today I am a 6.03! great! except I didn't actually play any games to change that rating. Is that any sort of accomplishment?
of course it does. Your profile shows you ranking history, and you can easily find the highest level you ever had.UTR is like a magic 8 ball. UTR does not keep track of your highest rating it often seems like a number pulled out of someone's rear end.
No one can tell you what a UTR or USTA difference of .5 means. No one has any clue what that difference means. I might be a 3.5 that can play evenly with a 4.0 or I might be a 3.5 that would typically lose 6-0 6-0 to a higher 3.5. Tennis ratings are a joke compared to chess ratings.
The question is not how tennis ratings are different than chess ratings. The question is how is it even possible that people constantly devise tennis ratings in ways that make them unmeaningful in all the way chess ratings are meaningful. It is only explained by conscious effort to make them meaningless and indeed that is what USTA pretty much admits. They say they are afraid people will focus on the number too much so they make it vague and practically meaningless.
my understanding is a bit different. I thought the way it worked/works is that no matter what your _current_ ranking might be you can never enter a tournament for a level that is lower than your all-time level highest level ever was. So if let's say you achieved 1850 Elo in like 2018, and now you are 1775, you still cannot enter a tournament for under-1800.
well, of course your level drops with age and/or inactivity but that drop off is nowhere near close to what happens in tennis.
Who knows? I achieved a rating of 6.03. What does that mean? They say if I am 2 points higher than I am so much better than someone else that they won't even rate the match unless I lose. Well I was 4.4 yesterday. So am I so much better today that the player I was yesterday could not compete? What does a .5 difference in rating mean in UTR? What does it mean in USTA?
If you don't know what that means then the rating is meaningless. In chess people know what the ratings mean. Therefore they are meaningful.
The sample size is small because the rating groups choose to limit the games counted. UTR cuts off at 12 months. USTA won't even rate many of their own matches let alone matches outside of USTA. That means the sample sizes are small by design and therefore the ratings are crap. And because the ratings are crap people don't bother posting games to be rated. Probably 70% of my matches are not rated.
if I ever find a person that is impressed by someone getting 1400, 1600, or 1800 Elo chess ranking then that would be the first such person I have encountered.
Same in chess. My profile on chess.com shows I was 2018 in February 2021.
I think UTR needs to fix its pricing scheme more than its algorithm. Outside of college coaching and recruiting I don’t see how it offers nearly the value they try to charge for premium access.
@Moon Shooter,
Your emphasis on rating as a measure of your skill is like someone trying to lose weight who weighs himself every 5 minutes and is depressed when he sees the # rise by a few ounces, not realizing that weight fluctuation during the day is normal. You believe that only a linear progression, even at small time intervals, of lower #s proves you're losing weight. I think most would agree that's the exact wrong approach because you're overwhelmed by randomness. Weigh yourself fewer times and trust the process of increased calorie burning and better diet, etc. That takes the randomness out of the equation.
Yes, that's a huge change over a short time. As more matches are recorded, the jumps will decrease. You're still in the "settling out" phase.
You want close to 100% accuracy, 100% of the time, irrespective of sample size. That's not realistic.
So record your matches in myutr.com [with your opponent's agreement to do likewise]: there is a section for non-sanctioned matches. That solves the sample size problem.
The ratings work for me as I get competitive matches.
The target audience [outside juniors and college] is people like MoonShooter who place a great deal of emphasis on rating. Many don't even care enough to check the site let alone pay for premium access.
The target audience also includes people interested in playing UTR events. Premium access also gets you a discount on registration. In my area at least, there are starting to be enough adult tournaments, flex leagues, etc. that the yearly premium fee could end up paying for itself.
It's more like looking at a scale and having it tell you that you weigh somewhere between 180 and 440 pounds.
No it doesn't solve the problem because they still cut off the data at 12 months.
They also refuse to consider singles ratings at all for doubles ratings and vice versa - even when they have very little data to work with.
This means most adults will never have an accurate number.
And because most adults will never have an accurate number that is why most adults don't care about the number. So most adults have no interest in posting games there. See how this bad decision leads to a snowball effect of problems?
Well, it would be impressive to a 1200.![]()
So if you can't get a more accurate scale, stop weighing yourself so often. The scale isn't going to change [at least in the short-term]. The only thing that can change is how you use it.
You'd have to talk to the designers who made that decision. I assume it has something to do with the relevancy of the older data, which makes sense if the target audience is juniors who A) play a ton of matches; and B) improve rapidly.
The designers want UTR to be universal but these assumptions obviously don't hold so well for the typical adult player.
Again, a design decision with certain tradeoffs. I can see the pros and cons.
Why not? Why would including both improve accuracy? Yes, it increases the sample size but you're potentially mixing apples and oranges.
No, I don't see. I have a reasonably accurate UTR and I'd be fine using it to enter a tournament. I have no interest posting non-sanctioned matches because I just don't care enough about my UTR. You obviously care a lot more so you should post them. Not an ideal solution but at least it gives you more granularity.
I'm proposing they make the scale more accurate. What is wrong with that? Checking my rating less frequently will not make it more accurate.
Right not including any games older than 12 months does not work well for the typical adult player. That is my point. It is a design decision it is not something that is inevitable for any rating system. They seem to have made the decision to use the same algo for adults as kids due to marketing even though it clearly does not work so well for adults.
The rating works well if it predicts future results better. If you have many singles and doubles matches then it makes sense to seperate them out. If you have very few games of singles or doubles it does not make sense to completely ignore the other data. Lets say you are playing doubles with someone with a doubles rating of 3.2. But you see they only have 2 doubles matches. But you also see they have 20 singles matches and a singles rating of 8.36. Are you going to completely ignore the 8.36 when you consider how good they may be at doubles?
Likewise when UTR has people that may have played say 4 doubles matches and 3 singles matches it doesn't make sense for them to not at least consider all the data to some extent. Of course when the number of matches in a catagory gets larger then the importance of the other category can drop off. But until that happens it is silly to think only one category is useful to helping you determine strength of play.
Asking someone I play to include their match won't do much unless they are also going to post other matches they play. And UTR purposely ignores data that would make their ratings more accurate so most people are uninterested in bothering.
You see you think everyone is just uninterested in their rating because only rare people like me would ever be interested in an objective measure of their tennis skill compared to the rest of the country or world. But I think the main reason people are uninterested in these ratings is because they are inaccurate, and they are inaccurate because they intentionally ignore relevant data.
Hence the "Universal" in UTR.
I guess it depends on how much work that change will be vs what payoff I expect to receive from it and what potential negative effects it could have on the target audience. If it's to make rec players happy, that isn't going to generate nearly as much revenue as the target audience of juniors/collegiates/coaches.
I see the logic in that. But the bottom line is that your proposed changes aren't going to be implemented any time soon so do you change your behavior to alter the outcome?
I can only go by what I observe: I don't see people obsessing over their UTR and I've never heard someone comment that they would be interested if only the ratings were more accurate. People use UTR, tennisrecord, tennislink, etc as a window into their results. Most don't demand a high level of accuracy. I've seen a high degree of correlation among all of the sources I've checked and that's good enough for me.
The universal rating does not need to mean it uses the same k factor for older games for young people as it does for adults. It can just mean that everyone gets a rating with the same algorithm and that algorithm can account for age if accounting for age makes it more accurate.
I don't know what you are saying. How do you know what changes may or may not be implemented?
You don't see several threads and discussions like this one, where people are saying how the tennis ratings systems fail in many ways?
I don't think it is correct to say people are "obsessing" over their UTR or NTRP just because they mention problems.
You seem to live in an area where there are lots of USTA and UTR match opportunities. I am not sure you can speak for "most" people. I don't claim to speak for most people either. But your smug comments about how well it works well for you in your part of the country therefore everyone else must be just "obsessing" and should change their behavior, is condescending.
In chess if you achieve a 1400 1600 1800 2000 2200 or 2400 etc rating that is an accomplishment proving you acquired a certain level of competence at the game that you have for the rest of your life. That competence will be recognized by people decades later.