# The table shows the population of various cities, in thousands, and the average walking speed, in feet per second, of a person living in the city

Question
Modeling data distributions

The table shows the population of various cities, in thousands, and the average walking speed, in feet per second, of a person living in the city. $$\begin{array}{|c|c|} \hline Population\ (thousands) & Walking Speed\ (feet\ per\ second) \\ \hline 5.5 & 0.6 \\ \hline 14 & 1.0\\ \hline 71 & 1.6\\ \hline 138 & 1.9\\ \hline 342 & 2.2\\ \hline \end{array}$$

2020-12-29

Graph: Consider Walking speed as y-coordinates and population as x-coordinate and the table for this is given below: $$\begin{array}{|c|c|} \hline Population\ (thousands) & Walking Speed\ (feet\ per\ second) \\ \hline x-coordinates & y-coordinates\\ \hline 5.5 & 0.6 \\ \hline 14 & 1.0\\ \hline 71 & 1.6\\ \hline 138 & 1.9\\ \hline 342 & 2.2\\ \hline \end{array}$$

Use these points to plot the scatter plot. Thus, the scatter plot for the data is: Interpretation: The data in the scatter plot increase rapidly at first and then begin to level off a bit, the shape suggests that the logarithmic function is good choice for modeling the data.

### Relevant Questions

The following table shows the average yearly tuition and required fees, in thousand of dollars, charged by a certain private university in the school year beginning in the given year.
$$\begin{array}{|c|c|}\hline \text{Year} & \text{Average tuition} \\ \hline 2005 & 17.6 \\ \hline 2007 & 18.1 \\ \hline 2009 & 19.5 \\ \hline 2011 & 20.7 \\ \hline 2013 & 21.8 \\ \hline \end{array}$$
What prediction does the formula modeling this data give for average yearly tuition and required fees for the university for the academic year beginning in 2019?

The table shows the temperatures T (in degrees Fahrenheit) at which water boils at selected pressures p (in pounds per square inch). A model that approximates the datais: $$\displaystyle{T}={87.97}\ +\ {34.96}\ \text{In}\ {p}\ +\ {7.91}\ \sqrt{{{p}}}$$ a) Use a graphing untility to plot the data and graph the model in the same veiwing window. How well does the model fit the data? b) Use the graph to estimate the pressure at which the boiling point of water is  $$300^{\circ}$$ F. c) Calculate T when the pressure is 74 pounds per square inch. Verify your answer graphically.

1)A rewiew of voted registration record in a small town yielded the dollowing data of the number of males and females registered as Democrat, Republican, or some other affilation:

$$\begin{array}{c} Gender \\ \hline Affilation & Male & Female \\ \hline Democrat & 300 & 600 \\ Republican & 500 & 300 \\ Other & 200 & 100 \\ \hline \end{array}$$

What proportion of all voters is male and registered as a Democrat? 2)A survey was conducted invocted involving 303 subject concerning their preferences with respect to the size of car thay would consider purchasing. The following table shows the count of the responses by gender of the respondents:

$$\begin{array}{c} Size\ of\ Car \\ \hline Gender & Small & Medium & lange & Total \\ \hline Female & 58 & 63 & 17 & 138 \\ Male & 79 & 61 & 25 & 165 \\ Total & 137 & 124 & 42 & 303 \\ \hline \end{array}$$

the data are to be summarized by constructing marginal distributions. In the marginal distributio for car size, the entry for mediums car is ?

The tables show the battery lives (in hours) of two brands of laptops. a) Make a double box-and-whisker plot that represent's the data. b) Identifity the shape of each distribution. c) Which brand's battery lives are more spread out? Explain. d) Compare the distributions using their shapes and appropriate measures of center and variation.

Use the table from the Theoretical Distribution section to calculate the following answers. Round your answers to four decimal places. $$P(x = 3)=?$$
$$P(1 < x < 4) = ?$$
$$P(x \geq 8) = ?$$ Use the data from the Organize the Data section to calculate the following answers. Round your answers to four decimal places. $$RF(x = 3) = ?$$
$$RF(1 < x < 4) =?$$
$$RF(x \geq 8) = ?$$ Discussion Questions 1. Knowing that data vary, describe three similarities between the graphs and distributions of the theoretical, empirical, and simulation distributions. Use complete sentences.

An analysis of laboratory data collected with the goal of modeling the weight (in grams) of a bacterial culture after several hours of growth produced the least squares regression line $$\log(weight) = 0.25 + 0.61$$hours. Estimate the weight of the culture after 3 hours.

A) 0.32 g

B) 2.08 g

C) 8.0 g

D) 67.9 g

E) 120.2 g

A random sample of $$\displaystyle{n}_{{1}}={16}$$ communities in western Kansas gave the following information for people under 25 years of age.
$$\displaystyle{X}_{{1}}:$$ Rate of hay fever per 1000 population for people under 25
$$\begin{array}{|c|c|} \hline 97 & 91 & 121 & 129 & 94 & 123 & 112 &93\\ \hline 125 & 95 & 125 & 117 & 97 & 122 & 127 & 88 \\ \hline \end{array}$$
A random sample of $$\displaystyle{n}_{{2}}={14}$$ regions in western Kansas gave the following information for people over 50 years old.
$$\displaystyle{X}_{{2}}:$$ Rate of hay fever per 1000 population for people over 50
$$\begin{array}{|c|c|} \hline 94 & 109 & 99 & 95 & 113 & 88 & 110\\ \hline 79 & 115 & 100 & 89 & 114 & 85 & 96\\ \hline \end{array}$$
(i) Use a calculator to calculate $$\displaystyle\overline{{x}}_{{1}},{s}_{{1}},\overline{{x}}_{{2}},{\quad\text{and}\quad}{s}_{{2}}.$$ (Round your answers to two decimal places.)
(ii) Assume that the hay fever rate in each age group has an approximately normal distribution. Do the data indicate that the age group over 50 has a lower rate of hay fever? Use $$\displaystyle\alpha={0.05}.$$
(a) What is the level of significance?
State the null and alternate hypotheses.
$$\displaystyle{H}_{{0}}:\mu_{{1}}=\mu_{{2}},{H}_{{1}}:\mu_{{1}}<\mu_{{2}}$$
$$\displaystyle{H}_{{0}}:\mu_{{1}}=\mu_{{2}},{H}_{{1}}:\mu_{{1}}>\mu_{{2}}$$
$$\displaystyle{H}_{{0}}:\mu_{{1}}=\mu_{{2}},{H}_{{1}}:\mu_{{1}}\ne\mu_{{2}}$$
$$\displaystyle{H}_{{0}}:\mu_{{1}}>\mu_{{2}},{H}_{{1}}:\mu_{{1}}=\mu_{{12}}$$
(b) What sampling distribution will you use? What assumptions are you making?
The standard normal. We assume that both population distributions are approximately normal with known standard deviations.
The Student's t. We assume that both population distributions are approximately normal with unknown standard deviations,
The standard normal. We assume that both population distributions are approximately normal with unknown standard deviations,
The Student's t. We assume that both population distributions are approximately normal with known standard deviations,
What is the value of the sample test statistic? (Test the difference $$\displaystyle\mu_{{1}}-\mu_{{2}}$$. Round your answer to three decimalplaces.)

What is the value of the sample test statistic? (Test the difference $$\displaystyle\mu_{{1}}-\mu_{{2}}$$. Round your answer to three decimal places.)
(c) Find (or estimate) the P-value.
P-value $$\displaystyle>{0.250}$$
$$\displaystyle{0.125}<{P}-\text{value}<{0},{250}$$
$$\displaystyle{0},{050}<{P}-\text{value}<{0},{125}$$
$$\displaystyle{0},{025}<{P}-\text{value}<{0},{050}$$
$$\displaystyle{0},{005}<{P}-\text{value}<{0},{025}$$
P-value $$\displaystyle<{0.005}$$
Sketch the sampling distribution and show the area corresponding to the P-value.
P.vaiue Pevgiue
P-value f P-value

In an experiment designed to study the effects of illumination level on task performance (“Performance of Complex Tasks Under Different Levels of Illumination,” J. Illuminating Eng., 1976: 235–242), subjects were required to insert a fine-tipped probe into the eyeholes of ten needles in rapid succession both for a low light level with a black background and a higher level with a white background. Each data value is the time (sec) required to complete the task.
$$\begin{array}{|c|c|} \hline Subject & (1) & (2) & (3) & (4) & (5) &(6) & (7) & (8) & (9) \\ \hline Black & 25.85 & 28.84 & 32.05 & 25.74 & 20.89 & 41.05 & 25.01 & 24.96 & 27.47 \\ \hline White & 18.28 & 20.84 & 22.96 & 19.68 & 19.509 & 24.98 & 16.61 & 16.07 & 24.59 \\ \hline \end{array}$$
Does the data indicate that the higher level of illumination yields a decrease of more than 5 sec in true average task completion time? Test the appropriate hypotheses using the P-value approach.

The following quadratic function in general form, $$\displaystyle{S}{\left({t}\right)}={5.8}{t}^{2}—{81.2}{t}+{1200}$$ models the number of luxury home sales, S(t), in a major Canadian urban area, according to statistical data gathered over a 12 year period. Luxury home sales are defined in this market as sales of properties worth over \$3 Million (inflation adjusted). In this case, $$\displaystyle{\left\lbrace{t}\right\rbrace}={\left\lbrace{0}\right\rbrace}\ \text{represents}\ {2000}{\quad\text{and}\quad}{\left\lbrace{t}\right\rbrace}={\left\lbrace{11}\right\rbrace}$$represents 2011. Use a calculator to find the year when the smallest number of luxury home sales occurred. Without sketching the function, interpret the meaning of this function, on the given practical domain, in one well-expressed sentence.
5). $$\begin{array}{|c|c|}& P(x)\\ Left & 0.636 \\ Right & 0.304 \\ No\ preference & 0.060\end{array}$$