!@#%#$ Math
Oct. 3rd, 2002 12:27 amOkay, I can't figure out the math on this problem. Can anyone help me?
I have a bunch'a numbers. Response times from a web server. I round off these response times into nice numbers. Closest 100 milliseconds or sumpthin. For every 100 millisecond interval, I count the number of occurences, giving me a pretty graph, like this.
Now I want to do standard deviation stuff. I calculate the standard deviation of the response times. Turns out, in my case, to be about 300 milliseconds. Big spread. Clustered in the [0s-0.5s] range.
Now, standard deviations are ideal for drawing the dreaded bell curves:
And I know that for each sigma, we're covering a larger percentage of the occurrences. In the following picture, for example:
the red area is supposed to cover something like 68% of the data, if the arrow marks out one sigma.
What I ultimately want to do is to render a graph with the standard deviation curve superimposed over the bar chart:
My question: how the hell do I calculate meaningful Y values for the standard deviation curve? I figure that the mid-point should be 50% of my total number of occurrences. How do I get the other points?
(Did I mention that I didn't do all that well in statistics? Or Fourier Analysis, but that's a whole 'nuther story).
What I remember from stats
Date: 2002-10-03 04:54 am (UTC)Here's a useful site: http://www.robertniles.com/stats/
More: http://www.math.hmc.edu/~gu/math142/mellon/curves_and_surfaces/curves/bell.html
I'm not awake yet, so I might be confusing what you're asking for. Are you saying that the distribution you have is not normal, and you want to find a way to make it normal?
What you have does not look like a normal distribution since everything since you have a higher density on the right than to the left.
I have a whole department full of statisticians here, so let me know if you still need help.
Re: What I remember from stats
Date: 2002-10-03 05:32 am (UTC)Oooo, this formula looks promising:
But what's capital-E in the equation?
What I'm trying to do is show the normal distribution compared to the actual frequency, so that people can get an immediate visual sense of the fact that the data isn't normalized (for whatever reason). If the data is reasonably normalized, I'd expect the bars to line up neatly under the curve. Make sense?
Re: What I remember from stats
Date: 2002-10-03 06:54 am (UTC)2.718282
Re: What I remember from stats
Date: 2002-10-03 07:08 am (UTC)That's capital-E? I thought that was little-e.
Re: What I remember from stats
Date: 2002-10-03 07:13 am (UTC)http://www.ruf.rice.edu/~lane/hyperstat/A25726.html
Gives the same thing with a small e.