

Thread Tools  Search this Thread  Display Modes 

06172012, 11:16 AM  #1 
Hall Of Fame
Join Date: Aug 2008
Location: Completing the point with a shoulderhigh punch into the open court.
Posts: 1,695

Overthinking Simple Math
Purely out of my own curiosity, I was running through some math in my head this morning.
If someone has a 40% success rate at some activity, then attempts two more trials, the first of which is a failure and the second of which is a success, what is flawed in the following logic: When the failure occurred, there were less total trials than there were when the success occurred, meaning the failure constituted a larger percentage of the total trials than did the success, and so the total average should decrease more than it should come back up? I realize this is inherently false; the total average will always increase. If you are earning a 40% in a class and then score a 50% on a twoquestion quiz, your grade will always increase, no matter which order the questions were asked. The most apt explanation I've been able to come up with is not to look at the 1/2 as a 50%, but instead to look at 0/1 and 1/1 separately, as 0% followed by 100%. Because the 0% is closer to the average (40%) than the 100% is, the 100% will have a greater effect on the average. In a set of data, an outlier will always effect the data more than a trial closer to the average, no matter which order the two trials were conducted. But what makes that logic more valid than my earlier 'logic'? 
06172012, 11:31 AM  #2 
Super Moderator

Yeah... you're overthinking it.
Your "success rate" is based off some prior data. In the example that you have 40% in a class, and then get a 50% on a test... if the 40% is out of a million points overall, and your test was a total of 2 points... it's probably not going to show on your final grade. Also, you're using two different scales here. You have a percentage based on some number of previous trials, then you're saying a single instance of 0% and a single instance of 100% are going to shift the results more than the other. You are comparing a discrete scale vs a percentage, which doesn't really make sense. If you're going to do that, you should stay in the same scale: 2/5 > 2/6 > 3/7 This is "success rate" per trial. Also, it sounds like you're assuming the base number doesn't change (or something??). While it's true that since there are less instances in the first failure, the second even hasn't occurred yet, so it isn't being considered in the total sum. Once you move to the next step, there is a greater data set, and all points are weighted the same (since you're taking a running total).
__________________
4x Dunlop Bio 200 L // Babolat Sensor Report posts, it helps everyone out! Mod Qs/Concerns/etc? Email me! (please make return/profile email is valid!) 
diredesire 
View Public Profile 
Find More Posts by diredesire 
06172012, 02:31 PM  #3 
Hall Of Fame
Join Date: Jul 2007
Location: Arizona
Posts: 2,544

Let's see... Suppose the 40% same from 4 successes in 10 trials. The 11th trial is a failure, we now have 4/11 = ..363636..., for a decrease of .03636...
Then, the 12th trial is a success, 5/12 = .41666... for an increase of .0530303... It turns out that the decimal value of the 12th trial was larger than that of the 11th! This seems strange, b/c as the denominator grows, we'd expect the decimal value to shrink. I think the answer to this paradox is that a positive outcome is not always equal to a negative one. For example, suppose I'm 1/100, a further failure to 1/101 is only a change of .00009901, but a successful trial yields a 2/101 which is a change of .0098. BTW, positive changes aren't always larger than negative ones, the closer you are to 100%, the larger changes come from failures, but the closer you are to 0%, it's the other way around. Since 40% is closer to 0% than 100%, the absolute value of the success is larger, even though it has a greater denominator. 
Steady Eddy 
View Public Profile 
Find More Posts by Steady Eddy 
06172012, 05:33 PM  #4 
Bionic Poster
Join Date: Oct 2005
Posts: 34,535

Probability is always used for the future. In the present, it is only relative frequency.
If you want to revise the probabilities based on new data, then you need to apply Bayes theorem and posteriori and a priori probabilities. That is how signal estimation is done in presence of noise as more and more samples arrive. 
06172012, 06:24 PM  #5  
Hall Of Fame
Join Date: Jul 2007
Location: Arizona
Posts: 2,544

Quote:
Say a coach has won 9 of 10 games for a 90% win rate. If he wins the next game, his rate becomes 91%, it only goes up 1%. But if he loses, his rate will be 73%, it drops nearly 18%! This doesn't seem fair. With a high success rate, new wins only improve it a tiny bit, but just one loss makes a huge drop. This lack of symmetry between wins and losses explains the puzzle the OP discussed, IMO. 

Steady Eddy 
View Public Profile 
Find More Posts by Steady Eddy 
06172012, 07:10 PM  #6  
Legend
Join Date: Jul 2009
Posts: 7,543

Quote:


06192012, 05:32 AM  #7  
Bionic Poster
Join Date: Oct 2005
Posts: 34,535

Quote:


06172012, 07:02 PM  #8  
Legend
Join Date: Jul 2009
Posts: 7,543

Quote:
Let's use a fairly classic example of a coin toss (a Bernoulli trial where heads is the success with probability 50%). That 50% never changes no matter what. No matter how many times you toss that coin, it will always be just as likely to be heads as it is tails. Using this example, your paradox can be explained via the following two probabilities: 1. What is the probability of getting a heads on the 9th toss given you've gotten 8 tails previously? 1/2. All of the trials are independent, and so the probability of the heads has nothing to do with the probability of the prior 8 tails. 2. What is the probability of getting 8 tails and then getting a heads on the 9th toss? That is 9 * (1/2)^8 * (1/2) = 0.018, or about 2%. In this case, your tails have negatively affected what you're thinking to be your success rate when in reality, they have no effect whatsoever. The first one is what's really going on, and the second is what's confusing you as being the same thing when it's completely different. 

06182012, 08:07 AM  #9 
Professional
Join Date: May 2007
Location: Concrete Jungle Where Dreams are Made
Posts: 889

It depends...
:P
__________________
I'm a poser and a wannabe but still probably better than you. 
06192012, 05:58 AM  #10 
Professional
Join Date: May 2007
Location: Concrete Jungle Where Dreams are Made
Posts: 889

Stop using calculators, they are wrong!
__________________
I'm a poser and a wannabe but still probably better than you. 

Thread Tools  Search this Thread 
Display Modes  

