Box Office Mojo collects and posts data on movie grosses. For a random sample of 50 movies, we obtained both the domestic (U.S.) and overseas grosses,

snowlovelydayM 2020-11-09 Answered
Box Office Mojo collects and posts data on movie grosses. For a random sample of 50 movies, we obtained both the domestic (U.S.) and overseas grosses, in millions of dollars. a) Obtain a scatterplot for the data. b) Decide whether finding a regressimz line for the data is reasonable. If so, then also do parts (c)-(f). c) Determine and interpret the regression equation for the data. d) Identify potential outliers and influential observations. e) In case a potential outlier is present, remove it and discuss the effect. f) In case a potential influential observation is present, remove it and discuss the effect.
You can still ask an expert for help

Expert Community at Your Service

  • Live experts 24/7
  • Questions are typically answered in as fast as 30 minutes
  • Personalized clear answers
Learn more

Solve your problem for the price of one coffee

  • Available 24/7
  • Math expert for every subject
  • Pay only if we can solve it
Ask Question

Expert Answer

Tuthornt
Answered 2020-11-10 Author has 107 answers

Given: n= Sample size =50 a) Domestic is on the horizontal axis and Overseas is on the vertical axis. image b) It is reasonable to find a regression lien for the data if there is no strong curvature present in the scatterplot. We note that there is no strong curvature in the scatterplot of part (a) and thus it is reasonable to find a regression line for the data. c) Let us first determine the necessary sums:  xi=3588.9
 xi2=712440.81
 yi=5233.5
 xiyi=968209.51 Next, we can determine Sxx and Sxy
Sxx=  xi2  ( xi)2n=712440.81  3588.9250=4548367458
Sxy=  xiyi  ( xi)( yi)n=968209.51  3588.9  5233.550=592559.347 The estimate b of the slope β is the ratio of Sxy and Sxx: b= SxySxx= 592559.374454836.7458=1.3028 The mean is the sum of all values divided by the number of values: x=  xin= 3588.950=71.778
y=  yin= 5233.550=104.67 The estimate a of the intercept α is the average of y decreased by the product of the estimate of the slope and the average of x. a= y  b x=104.67  1.3028  71.778=11.1579 General least-squares equation: y^= α + β x. Replace α by a=11.1579 and β by b=1.3028 in the general least-squares equation: y=a + bx=11.1579 + 1.3028x d) There appear to be two outliers, because the two rightmost points lie far from the group of other points. There appear to be an influential obsevation, because the point in the rightmost corner lies very close to the regression line while the point is a potential outlier. e) Let us first determine the necessary sums:  xi=2732.8
 xi2=345183.2
 yi=4314
 xiyi=567537.88 Next, we can determine Sxx and

Not exactly what you’re looking for?
Ask My Question

Expert Community at Your Service

  • Live experts 24/7
  • Questions are typically answered in as fast as 30 minutes
  • Personalized clear answers
Learn more

Relevant Questions

asked 2020-11-12
The ability to estimate the volume of a tree based on a simple measurement, such as the tree’s diameter, is important to the lumber industry, ecologists, and conservationists. Data on volume, in cubic feet, and diameter at breast height, in inches, for 70 shortleaf pines were reported in C. Bruce and F. X. Schumacher’s Forest Mensuration (New York: McGraw-Hill, 1935) and analyzed by A. C. Akinson in the article “Transforming Both Sides of a Tree” (The American Statistician, Vol. 48, pp. 307–312). a) Obtain a scatterplot for the data. b) Decide whether finding a regression line for the data is reasonable. If so, then also do parts (c)-(f). c) Determine and interpret the regression equation for the data. d) Identify potential outliers and influential observations. e) In case a potential outlier is present, remove it and discuss the effect. f) In case a potential influential observation is present, remove it and discuss the effect.
asked 2021-02-24
The document Arizona Residential Property Valuation System, published by the Arizona Department of Revenue, describes how county assessors use computerized systems to value single-family residential properties for property tax purposes. a) Obtain a scatterplot for the data. b) Decide whether finding a regression line for the data is reasonable. If so, then also do parts (c)-(f). c) Determine and interpret the regression equation for the data. d) Identify potential outliers and influential observations. e) In case a potential outlier is present, remove it and discuss the effect. f) In case a potential influential observation is present, remove it and discuss the effect.
asked 2021-07-04

Using the daily high and low temperature readings at Chicago's O'Hare International Airport for an entire year, a meteorologist made a scatterplot relating y = high temperature to x = low temperature, both in degrees Fahrenheit.

After verifying that the conditions for the regression model were met, the meteorologist calculated the equation of the population regression line to be  [μy=16.6+1.02]with[σ=6.6+F]

If the meteorologist used a random sample of 10 days to calculate the regression line instead of using all the days in the year, would the slope of the sample regression line be exactly 1.02? Explain your answer.

asked 2021-05-29
Using the health records of ever student at a high school, the school nurse created a scatterplot relating y = height (in centimeters) to x = age (in years). After verifying that the conditions for the regression model were met, the nurse calculated the equation of the population regression line to be μ0=105+4.2x with σ=7 cm. If the nurse used a random sample of 50 students from the school to calculate the regression line instead of using all the students, would the slope of the sample regression line be exactly 4.2? Explain your answer.
asked 2020-10-21
An issue of BARRON’S presented information on top wealth managers in the United States, based on individual clients with accounts of $1 million or more. Data were given for various variables, two of which were number of private client managers and private client assets. a) Obtain a scatterplot for the data. b) Decide whether finding a regression line for the data is reasonable. If so, then also do parts (c)–(f). c) Determine and interpret the regression equation for the data. d) Identify potential outliers and influential observations. e) In case a potential outlier is present, remove it and discuss the effect. f) In case a potential influential observation is present, remove it and discuss the effect.
asked 2020-12-30
Use the technology of your choice to do the following tasks. The National Oceanic and Atmospheric Administration publishes temperature and precipitation information for cities around the world in Climates of the World. Data on average high temperature (in degrees Fahrenheit) in July and average precipitation (in inches) in July for 48 cities are on the WeissStats CD. For part (d), predict the average July precipitation of a city with an average July temperature of 83F a) Construct and interpret a scatterplot for the data. b) Decide whether finding a regression line for the data is reasonable. If so, then also do parts (c)-(f). c) Determine and interpret the regression equation. d) Make the indicated predictions. e) Compute and interpret the correlation coefficient. f) Identify potential outliers and influential observations.
asked 2021-02-25
Researchers have asked whether there is a relationship between nutrition and cancer, and many studies have shown that there is. In fact, one of the conclusions of a study by B. Reddy et al., “Nutrition and Its Relationship to Cancer” (Advances in Cancer Research, Vol. 32, pp. 237-345), was that “...none of the risk factors for cancer is probably more significant than diet and nutrition.” One dietary factor that has been studied for its relationship with prostate cancer is fat consumption. On the WeissStats CD, you will find data on per capita fat consumption (in grams per day) and prostate cancer death rate (per 100,000 males) for nations of the world. The data were obtained from a graph-adapted from information in the article mentioned-in J. Robbins’s classic book Diet for a New America (Walpole, NH: Stillpoint, 1987, p. 271). For part (d), predict the prostate cancer death rate for a nation with a per capita fat consumption of 92 grams per day. a) Construct and interpret a scatterplot for the data. b) Decide whether finding a regression line for the data is reasonable. If so, then also do parts (c)-(f). c) Determine and interpret the regression equation. d) Make the indicated predictions. e) Compute and interpret the correlation coefficient. f) Identify potential outliers and influential observations.