# Data is being processed from 35 data sources. Each dataset may or may or net generate dataset at the beginning of each timeslot, the probability that any individual source actually generates a dataset is 0.004,and the data sources are independent. In each time slot we can process up to two datasets. Now, the processing is real time with the processed results only being considered if they are done within the time slot. Determine the probability that all incoming datasets can be processed in any particular time slot?

Question
Probability
Data is being processed from 35 data sources. Each dataset may or may or net generate dataset at the beginning of each timeslot, the probability that any individual source actually generates a dataset is 0.004,and the data sources are independent. In each time slot we can process up to two datasets. Now, the processing is real time with the processed results only being considered if they are done within the time slot. Determine the probability that all incoming datasets can be processed in any particular time slot?

2021-01-29
Lex X be the number of generated dataset. Then X is a binomial random variable with parameters n=35 and p=0.004. Therefore,
$$\displaystyle{P}{\left({X}={x}\right)}={\left(\frac{{35}}{{x}}\right)}{0.004}^{{x}}\cdot{0.996}^{{35}}-{x}$$
We need to find $$\displaystyle{P}{\left({X}\le{2}\right)}$$ (we can process up to two datasets). Now we compute:
PSKP(X<=2)=P(X=0)+P(X=1)+P(X=2) =(35/0)0.004^0*(0.996^35-0)+(35/1)0.004^1*(0.996^35-1)+(35/2)0.004^2*0.996^35-2 ~ 0.9996194ZSK

### Relevant Questions

The table below shows the number of people for three different race groups who were shot by police that were either armed or unarmed. These values are very close to the exact numbers. They have been changed slightly for each student to get a unique problem.
Suspect was Armed:
Black - 543
White - 1176
Hispanic - 378
Total - 2097
Suspect was unarmed:
Black - 60
White - 67
Hispanic - 38
Total - 165
Total:
Black - 603
White - 1243
Hispanic - 416
Total - 2262
Give your answer as a decimal to at least three decimal places.
a) What percent are Black?
b) What percent are Unarmed?
c) In order for two variables to be Independent of each other, the P $$(A and B) = P(A) \cdot P(B) P(A and B) = P(A) \cdot P(B).$$
This just means that the percentage of times that both things happen equals the individual percentages multiplied together (Only if they are Independent of each other).
Therefore, if a person's race is independent of whether they were killed being unarmed then the percentage of black people that are killed while being unarmed should equal the percentage of blacks times the percentage of Unarmed. Let's check this. Multiply your answer to part a (percentage of blacks) by your answer to part b (percentage of unarmed).
Remember, the previous answer is only correct if the variables are Independent.
d) Now let's get the real percent that are Black and Unarmed by using the table?
If answer c is "significantly different" than answer d, then that means that there could be a different percentage of unarmed people being shot based on race. We will check this out later in the course.
Let's compare the percentage of unarmed shot for each race.
e) What percent are White and Unarmed?
f) What percent are Hispanic and Unarmed?
If you compare answers d, e and f it shows the highest percentage of unarmed people being shot is most likely white.
Why is that?
This is because there are more white people in the United States than any other race and therefore there are likely to be more white people in the table. Since there are more white people in the table, there most likely would be more white and unarmed people shot by police than any other race. This pulls the percentage of white and unarmed up. In addition, there most likely would be more white and armed shot by police. All the percentages for white people would be higher, because there are more white people. For example, the table contains very few Hispanic people, and the percentage of people in the table that were Hispanic and unarmed is the lowest percentage.
Think of it this way. If you went to a college that was 90% female and 10% male, then females would most likely have the highest percentage of A grades. They would also most likely have the highest percentage of B, C, D and F grades
The correct way to compare is "conditional probability". Conditional probability is getting the probability of something happening, given we are dealing with just the people in a particular group.
g) What percent of blacks shot and killed by police were unarmed?
h) What percent of whites shot and killed by police were unarmed?
i) What percent of Hispanics shot and killed by police were unarmed?
You can see by the answers to part g and h, that the percentage of blacks that were unarmed and killed by police is approximately twice that of whites that were unarmed and killed by police.
j) Why do you believe this is happening?
Do a search on the internet for reasons why blacks are more likely to be killed by police. Read a few articles on the topic. Write your response using the articles as references. Give the websites used in your response. Your answer should be several sentences long with at least one website listed. This part of this problem will be graded after the due date.
In 1985, neither Florida nor Georgia had laws banning open alcohol containers in vehicle passenger compartments. By 1990, Florida had passed such a law, but Georgia had not.
(i) Suppose you can collect random samples of the driving-age population in both states, for 1985 and 1990. Let arrest be a binary variable equal to unity if a person was arrested for drunk driving during the year. Without controlling for any other factors, write down a linear probability model that allows you to test whether the open container law reduced the probability of being arrested for drunk driving. Which coefficient in your model measures the effect of the law?
(ii) Why might you want to control for other factors in the model? What might some of these factors be?
(iii) Now, suppose that you can only collect data for 1985 and 1990 at the county level for the two states. The dependent variable would be the fraction of licensed drivers arrested for drunk driving during the year. How does this data structure differ from the individual-level data described in part (i)? What econometric method would you use?
Give a full and correct answer Why is it important that a sample be random and representative when conducting hypothesis testing? Representative Sample vs. Random Sample: An Overview Economists and researchers seek to reduce sampling bias to near negligible levels when employing statistical analysis. Three basic characteristics in a sample reduce the chances of sampling bias and allow economists to make more confident inferences about a general population from the results obtained from the sample analysis or study: * Such samples must be representative of the chosen population studied. * They must be randomly chosen, meaning that each member of the larger population has an equal chance of being chosen. * They must be large enough so as not to skew the results. The optimal size of the sample group depends on the precise degree of confidence required for making an inference. Representative sampling and random sampling are two techniques used to help ensure data is free of bias. These sampling techniques are not mutually exclusive and, in fact, they are often used in tandem to reduce the degree of sampling error in an analysis and allow for greater confidence in making statistical inferences from the sample in regard to the larger group. Representative Sample A representative sample is a group or set chosen from a larger statistical population or group of factors or instances that adequately replicates the larger group according to whatever characteristic or quality is under study. A representative sample parallels key variables and characteristics of the large society under examination. Some examples include sex, age, education level, socioeconomic status (SES), or marital status. A larger sample size reduced sampling error and increases the likelihood that the sample accurately reflects the target population. Random Sample A random sample is a group or set chosen from a larger population or group of factors of instances in a random manner that allows for each member of the larger group to have an equal chance of being chosen. A random sample is meant to be an unbiased representation of the larger population. It is considered a fair way to select a sample from a larger population since every member of the population has an equal chance of getting selected. Special Considerations: People collecting samples need to ensure that bias is minimized. Representative sampling is one of the key methods of achieving this because such samples replicate as closely as possible elements of the larger population under study. This alone, however, is not enough to make the sampling bias negligible. Combining the random sampling technique with the representative sampling method reduces bias further because no specific member of the representative population has a greater chance of selection into the sample than any other. Summarize this article in 250 words.
A two-sample inference deals with dependent and independent inferences. In a two-sample hypothesis testing problem, underlying parameters of two different populations are compared. In a longitudinal (or follow-up) study, the same group of people is followed over time. Two samples are said to be paired when each data point in the first sample is matched and related to a unique data point in the second sample.
This problem demonstrates inference from two dependent (follow-up) samples using the data from the hypothetical study of new cases of tuberculosis (TB) before and after the vaccination was done in several geographical areas in a country in sub-Saharan Africa. Conclusion about the null hypothesis is to note the difference between samples.
The problem that demonstrates inference from two dependent samples uses hypothetical data from the TB vaccinations and the number of new cases before and after vaccination. PSK\begin{array}{|c|c|} \hline Geographical\ regions & Before\ vaccination & After\ vaccination\\ \hline 1 & 85 & 11\\ \hline 2 & 77 & 5\\ \hline 3 & 110 & 14\\ \hline 4 & 65 & 12\\ \hline 5 & 81 & 10\\\hline 6 & 70 & 7\\ \hline 7 & 74 & 8\\ \hline 8 & 84 & 11\\ \hline 9 & 90 & 9\\ \hline 10 & 95 & 8\\ \hline \end{array}ZSK
Using the Minitab statistical analysis program to enter the data and perform the analysis, complete the following: Construct a one-sided $$\displaystyle{95}\%$$ confidence interval for the true difference in population means. Test the null hypothesis that the population means are identical at the 0.05 level of significance.
Several models have been proposed to explain the diversification of life during geological periods. According to Benton (1997), The diversification of marine families in the past 600 million years (Myr) appears to have followed two or three logistic curves, with equilibrium levels that lasted for up to 200 Myr. In contrast, continental organisms clearly show an exponential pattern of diversification, and although it is not clear whether the empirical diversification patterns are real or are artifacts of a poor fossil record, the latter explanation seems unlikely. In this problem, we will investigate three models fordiversification. They are analogous to models for populationgrowth, however, the quantities involved have a differentinterpretation. We denote by N(t) the diversification function,which counts the number of taxa as a function of time, and by rthe intrinsic rate of diversification.
(a) (Exponential Model) This model is described by $$\displaystyle{\frac{{{d}{N}}}{{{\left.{d}{t}\right.}}}}={r}_{{{e}}}{N}\ {\left({8.86}\right)}.$$ Solve (8.86) with the initial condition N(0) at time 0, and show that $$\displaystyle{r}_{{{e}}}$$ can be estimated from $$\displaystyle{r}_{{{e}}}={\frac{{{1}}}{{{t}}}}\ {\ln{\ }}{\left[{\frac{{{N}{\left({t}\right)}}}{{{N}{\left({0}\right)}}}}\right]}\ {\left({8.87}\right)}$$
(b) (Logistic Growth) This model is described by $$\displaystyle{\frac{{{d}{N}}}{{{\left.{d}{t}\right.}}}}={r}_{{{l}}}{N}\ {\left({1}\ -\ {\frac{{{N}}}{{{K}}}}\right)}\ {\left({8.88}\right)}$$ where K is the equilibrium value. Solve (8.88) with the initial condition N(0) at time 0, and show that $$\displaystyle{r}_{{{l}}}$$ can be estimated from $$\displaystyle{r}_{{{l}}}={\frac{{{1}}}{{{t}}}}\ {\ln{\ }}{\left[{\frac{{{K}\ -\ {N}{\left({0}\right)}}}{{{N}{\left({0}\right)}}}}\right]}\ +\ {\frac{{{1}}}{{{t}}}}\ {\ln{\ }}{\left[{\frac{{{N}{\left({t}\right)}}}{{{K}\ -\ {N}{\left({t}\right)}}}}\right]}\ {\left({8.89}\right)}$$ for $$\displaystyle{N}{\left({t}\right)}\ {<}\ {K}.$$
(c) Assume that $$\displaystyle{N}{\left({0}\right)}={1}$$ and $$\displaystyle{N}{\left({10}\right)}={1000}.$$ Estimate $$\displaystyle{r}_{{{e}}}$$ and $$\displaystyle{r}_{{{l}}}$$ for both $$\displaystyle{K}={1001}$$ and $$\displaystyle{K}={10000}.$$
(d) Use your answer in (c) to explain the following quote from Stanley (1979): There must be a general tendency for calculated values of $$\displaystyle{\left[{r}\right]}$$ to represent underestimates of exponential rates,because some radiation will have followed distinctly sigmoid paths during the interval evaluated.
(e) Explain why the exponential model is a good approximation to the logistic model when $$\displaystyle\frac{{N}}{{K}}$$ is small compared with 1.
At what age do babies learn to crawl? Does it take longer to learn in the winter when babies are often bundled in clothes that restrict their movement? Data were collected from parents who brought their babies into the University of Denver Infant Study Center to participate in one of a number of experiments between 1988 and 1991. Parents reported the birth month and the age at which their child was first able to creep or crawl a distance of 4 feet within 1 minute. The resulting data were grouped by month of birth: January, May, and September: $$\displaystyle{b}{e}{g}\in{\left\lbrace{a}{r}{r}{a}{y}\right\rbrace}{\left\lbrace{c}\right\rbrace}&{C}{r}{a}{w}{l}\in{g}\ {a}\ge\backslash{h}{l}\in{e}{B}{i}{r}{t}{h}\ {m}{o}{n}{t}{h}&{M}{e}{a}{n}&{S}{t}.{d}{e}{v}.&{n}\backslash{h}{l}\in{e}{J}{a}\nu{a}{r}{y}&{29.84}&{7.08}&{32}\backslash{M}{a}{y}&{28.58}&{8.07}&{27}\backslash{S}{e}{p}{t}{e}{m}{b}{e}{r}&{33.83}&{6.93}&{38}{e}{n}{d}{\left\lbrace{a}{r}{r}{a}{y}\right\rbrace}$$ Crawling age is given in weeks. Assume the data represent three independent simple random samples, one from each of the three populations consisting of babies born in that particular month, and that the populations of crawling ages have Normal distributions. A partial ANOVA table is given below. $$\displaystyle{b}{e}{g}\in{\left\lbrace{a}{r}{r}{a}{y}\right\rbrace}{\left\lbrace{c}\right\rbrace}{S}{o}{u}{r}{c}{e}&{S}{u}{m}\ {o}{f}\ \boxempty{s}&{D}{F}&{M}{e}{a}{n}\ \boxempty\ {F}\backslash{h}{l}\in{e}{G}{r}{o}{u}{p}{s}&{505.26}\backslash{E}{r}{r}{\quad\text{or}\quad}&&&{53.45}\backslash{T}{o}{t}{a}{l}{e}{n}{d}{\left\lbrace{a}{r}{r}{a}{y}\right\rbrace}$$ What are the degrees of freedom for the groups term?
Is statistical inference intuitive to babies? In other words, are babies able to generalize from sample to population? In this study,1 8-month-old infants watched someone draw a sample of five balls from an opaque box. Each sample consisted of four balls of one color (red or white) and one ball of the other color. After observing the sample, the side of the box was lifted so the infants could see all of the balls inside (the population). Some boxes had an “expected” population, with balls in the same color proportions as the sample, while other boxes had an “unexpected” population, with balls in the opposite color proportion from the sample. Babies looked at the unexpected populations for an average of 9.9 seconds (sd = 4.5 seconds) and the expected populations for an average of 7.5 seconds (sd = 4.2 seconds). The sample size in each group was 20, and you may assume the data in each group are reasonably normally distributed. Is this convincing evidence that babies look longer at the unexpected population, suggesting that they make inferences about the population from the sample? Let group 1 and group 2 be the time spent looking at the unexpected and expected populations, respectively. A) Calculate the relevant sample statistic. Enter the exact answer. Sample statistic: _____ B) Calculate the t-statistic. Round your answer to two decimal places. t-statistic = ___________ C) Find the p-value. Round your answer to three decimal places. p-value =
The accompanying two-way table was constructed using data in the article “Television Viewing and Physical Fitness in Adults” (Research Quarterly for Exercise and Sport, 1990: 315–320). The author hoped to determine whether time spent watching television is associated with cardiovascular fitness. Subjects were asked about their television-viewing habits and were classified as physically fit if they scored in the excellent or very good category on a step test. We include MINITAB output from a chi-squared analysis. The four TV groups corresponded to different amounts of time per day spent watching TV (0, 1–2, 3–4, or 5 or more hours). The 168 individuals represented in the first column were those judged physically fit. Expected counts appear below observed counts, and MINITAB displays the contribution to $$\displaystyle{x}^{{{2}}}$$ from each cell.
State and test the appropriate hypotheses using $$\displaystyle\alpha={0.05}$$
$$\displaystyle{b}{e}{g}\in{\left\lbrace{a}{r}{r}{a}{y}\right\rbrace}{\left\lbrace{\left|{c}\right|}{c}{\mid}\right\rbrace}{h}{l}\in{e}&{a}\mp,\ {1}&{a}\mp,\ {2}&{a}\mp,\ {T}{o}{t}{a}{l}\backslash{h}{l}\in{e}{1}&{a}\mp,\ {35}&{a}\mp,\ {147}&{a}\mp,\ {182}\backslash{h}{l}\in{e}&{a}\mp,\ {25.48}&{a}\mp,\ {156.52}&{a}\mp,\backslash{h}{l}\in{e}{2}&{a}\mp,\ {101}&{a}\mp,\ {629}&{a}\mp,\ {730}\backslash{h}{l}\in{e}&{a}\mp,\ {102.20}&{a}\mp,\ {627.80}&{a}\mp,\backslash{h}{l}\in{e}{3}&{a}\mp,\ {28}&{a}\mp,\ {222}&{a}\mp,\ {250}\backslash{h}{l}\in{e}&{a}\mp,\ {35.00}&{a}\mp,\ {215.00}&{a}\mp,\backslash{h}{l}\in{e}{4}&{a}\mp,\ {4}&{a}\mp,\ {34}&{a}\mp,\ {38}\backslash{h}{l}\in{e}&{a}\mp,\ {5.32}&{a}\mp,\ {32.68}&{a}\mp,\backslash{h}{l}\in{e}{T}{o}{t}{a}{l}&{a}\mp,\ {168}&{a}\mp,\ {1032}&{a}\mp,\ {1200}\backslash{h}{l}\in{e}$$
$$\displaystyle{C}{h}{i}{s}{q}={a}\mp,\ {3.557}\ +\ {0.579}\ +\ {a}\mp,\ {0.014}\ +\ {0.002}\ +\ {a}\mp,\ {1.400}\ +\ {0.228}\ +\ {a}\mp,\ {0.328}\ +\ {0.053}={6.161}$$
$$\displaystyle{d}{f}={3}$$
Two basketball players are essentially equal in all respects. In particular, by jumping they can raise their centers of mass the same vertical distance, H. The first player,Arabella, wishes to shoot over the second player, Boris, and forthis she needs to be as high above Boris as possible. Arabella Jumps at time t=0, and Boris jumps later, at time $$\displaystyle{t}_{{R}}$$(his reaction time). Assume that Arabella has not yet reached her maximum height when Boris jumps.
Part A.) Find the vertical displacement $$\displaystyle{D}{\left({t}\right)}={h}_{{A}}{\left({t}\right)}-{h}_{{B}}{\left({t}\right)}$$, as a function of time for the interval $$\displaystyle{0}{<}{t}{<}{t}_{{R}}$$, where $$\displaystyle{h}_{{A}}{\left({t}\right)}$$ is the height of the raised hands of Arabella, while $$\displaystyle{h}_{{B}}{\left({t}\right)}$$ is the height of the raised hands of Boris. (Express thevertical displacement in terms of H,g,and t.)
Part B.) Find the vertical displacement D(t) between the raised hands of the two players for the time period after Boris has jumped ($$\displaystyle{t}{>}{t}_{{R}}$$) but before Arabella has landed. (Express youranswer in terms of t,$$\displaystyle{t}_{{R}}$$, g,and H)