The situation in the question :so we have a number of scatter plots with each showing an estimated regression line (based on a valid model) and associated individual 95% con fidence intervals (CI) for the regression function at each x-value, as well as the observed data. A professor asks 'I don't understand how 95% of the observations fall outside the 95% CI as depicted in the figures'. Briefly explain how is is entirely possible that 95% of the observations fall outside the 95% CI as depicted in the figures.(We weren't given actual figures)

Lyla Carson

Lyla Carson

Answered question

2022-09-20

the situation in the question :so we have a number of scatter plots with each showing an estimated regression line (based on a valid model) and associated individual 95% con fidence intervals (CI) for the regression function at each x-value, as well as the observed data. A professor asks 'I don't understand how 95% of the observations fall outside the 95% CI as depicted in the figures'. Briefly explain how is is entirely possible that 95% of the observations fall outside the 95% CI as depicted in the figures.(We weren't given actual figures)
Anyway I thought that it may have been due to the fact that a lot of outliers affected the regression line calculated, and so a confidence interval formed from a bad regression line would be bad - resulting in 95% of observations falling outside the 95% CI.

Answer & Explanation

Kaiden Stevens

Kaiden Stevens

Beginner2022-09-21Added 12 answers

Step 1
In a classical frequentest setting, the probability statements regarding a confidence interval relates to the (random) bounds of the interval. For example, take the common confidence interval for the mean, μ, of some normal data generating process. We have
P ( y ¯ 1.96 σ n < μ < y ¯ + 1.96 σ n ) = 0.95
Step 2
Notice that μ is not treated as random, it is 'fixed' as there is only one true mean. The probability statements we make corresponds to the lower and upper bounds of the interval, that is, y ¯ ± 1.96 σ n , since these bounds depend on y ¯ (let's for the moment assume we know σ), then it could be entirely possibly (due to sheer 'luck') for a specific sample, we obtain a value for y¯ that results in the entire interval lying completely outside the majority of the observations. However, what the confidence interval does say is that during repeated sampling, 95% of the times we should expect to see the interval encapsulating the true mean.

Do you have a similar question?

Recalculate according to your conditions!

New Questions in College Statistics

Ask your question.
Get an expert answer.

Let our experts help you. Answer in as fast as 15 minutes.

Didn't find what you were looking for?