Friday, May 21, 2010

Binomial Distribution with Normal Distribution? (Statistics)?

Hello, I am having trouble with this problem:





27% of all small businesses are owned by white females. In a random sample of 350 small businesses owned by whites, let x be the number owned by women.





a) Find the mean of x.





b) Find the standard deviation of x.





c) Find the z-score for the value x=99.5





d) Find the approximate probability that the number of small businesses in the sample of 350 that are owned by a woman is 100 or more.





Many thanks!

Binomial Distribution with Normal Distribution? (Statistics)?
Let Xb be the number of business owned by women. Xb has the binomial distribution with n = 350 trials and success probability p = 0.27





In general, if X has the binomial distribution with n trials and a success probability of p then


P[Xb = x] = n!/(x!(n-x)!) * p^x * (1-p)^(n-x)


for values of x = 0, 1, 2, ..., n


P[Xb = x] = 0 for any other value of x.





To use the normal approximation to the binomial you must first validate that you have more than 10 expected successes and 10 expected failures. In other words, you need to have n * p %26gt; 10 and n * (1-p) %26gt; 10.





Some authors will say you only need 5 expected successes and 5 expected failures to use this approximation. If you are working towards the center of the distribution then this condition should be sufficient. However, the approximations in the tails of the distribution will be weaker especially if the success probability is low or high. Using 10 expected successes and 10 expected failures is a more conservative approach but will allow for better approximations especially when p is small or p is large.





In this case you have:


n * p = 350 * 0.27 = 94.5 expected success


n * (1 - p) = 350 * 0.73 = 255.5 expected failures





We have checked and confirmed that there are enough expected successes and expected failures. Now we can move on to the rest of the work.





If Xb ~ Binomial(n, p) then we can approximate probabilities using the normal distribution where Xn is normal with mean μ = n * p, variance σ² = n * p * (1-p), and standard deviation σ





Xb ~ Binomial(n = 350 , p = 0.27 )


Xn ~ Normal( μ = 94.5 , σ² = 68.985 )


Xn ~ Normal( μ = 94.5 , σ = 8.305721 )





I have noted two different notations for the Normal distribution, one using the variance and one using the standard deviation. In most textbooks and in most of the literature, the parameters used to denote the Normal distribution are the mean and the variance. In most software programs, the standard notation is to use the mean and the standard deviation.





The probabilities are approximated using a continuity correction. We need to use a continuity correction because we are estimating discrete probabilities with a continuous distribution. The best way to make sure you use the correct continuity correction is to draw out a small histogram of the binomial distribution and shade in the values you need. The continuity correction accounts for the area of the boxes that would be missing or would be extra under the normal curve.





P( Xb %26lt; x) ≈ P( Xn %26lt; (x - 0.5) )


P( Xb %26gt; x) ≈ P( Xn %26gt; (x + 0.5) )


P( Xb ≤ x) ≈ P( Xn ≤ (x + 0.5) )


P( Xb ≥ x) ≈ P( Xn ≥ (x - 0.5) )


P( Xb = x) ≈ P( (x - 0.5) %26lt; Xn %26lt; (x + 0.5) )


P( a ≤ Xb ≤ b ) ≈ P( (a - 0.5) %26lt; Xn %26lt; (b + 0.5) )


P( a ≤ Xb %26lt; b ) ≈ P( (a - 0.5) %26lt; Xn %26lt; (b - 0.5) )


P( a %26lt; Xb ≤ b ) ≈ P( (a + 0.5) %26lt; Xn %26lt; (b + 0.5) )


P( a %26lt; Xb %26lt; b ) ≈ P( (a + 0.5) %26lt; Xn %26lt; (b - 0.5) )





In the work that follows Xb has the binomial distribution, Xn has the normal distribution and Z has the standard normal distribution.





Remember that for any normal random variable Xn, you can transform it into standard units via: Z = (Xn - μ ) / σ





P( Xb ≥ 100 ) =





350


∑ P(Xb = x) = 0.2716606


x = 100





≈ P( Xn ≥ 99.5 )


= P( Z ≥ ( 99.5 - 94.5 ) / 8.305721 )


= P( Z ≥ 0.6019947 )


= 0.2735888


No comments:

Post a Comment