Statistics and Probability Examples Have Never Been So Accessible

Joglxym 2022-11-04

What is the shape of a chi-squared distribution?

Abdiel Mays 2022-11-04

In the example experiment on eye-witness memory of a car accident, if the question they used ("Was there shattered glass?") had been too easy and all participants got it right, this would have compromised the:
A. internal validity of the dependent variable
B. construct validity of the independent variable
C. internal validity of the independent variable
D. construct validity of the dependent variable

charmbraqdy 2022-11-04

What is the difference between $\frac{1}{N} \sum \frac{y_{i}}{x_{i}}$ and $\sum \frac{\bar{y}}{\bar{x}}$
I have a data set and I was looking at the population ratio and trying to estimate it using different methods. I was expecting eq1 and eq2 to be different but was very surprised that it was by a factor of almost 3x. I was just wondering why it was so different, is it partly due to n cancelling on eq2. Any explanation would be appreciated.

Madison Costa 2022-11-04

Why is the statistic $t = r \sqrt{\frac{n - 2}{1 - r^{2}}}$ $\approx t (n - 2)$ ?

atgnybo4fq 2022-11-04

Determining sample size of a set of boolean data where the probability is not 50%
I'll lay out the problem as a simplified puzzle of what I am attempting to calculate. I imagine some of this may seem fairly straightforward to many but I'm starting to get a bit lost in my head while trying to think through the problem.
Let's say I roll a 1000-sided die until it lands on the number 1. Let's say it took me 700 rolls to get there. I want to prove that the first 699 rolls were not number 1 and obviously the only way to deterministically do this is to include the first 699 failures as part of the result to show they were in fact "not 1".
However, that's a lot of data I would need to prove this. I would have to include all 700 rolls, which is a lot. Therefore, I want to probabilistically demonstrate the fact that I rolled 699 "not 1s" prior to rolling a 1. To do this, I decide I will randomly sample my "not 1" rolls to reduce the set to a statistically significant, yet more wieldy number. It will be good enough to demonstrate that I very probably did not roll a 1 prior to roll 700.
Here are my current assumptions about the state of this problem:
- My initial experiment of rolling until success is one of geometric distribution.
- However my goal for this problem is to demonstrate to a third party that I am not lying, therefore the skeptical third party is not concerned with geometric distribution but would view this simply as a binomial distribution problem.
A lot of sample size calculators exist on the web. They are all based around binomial distribution from what I can tell. So here's the formula I am considering:
$n = \frac{N \times X}{X + N - 1}$
$X = \frac{{Z_{α / 2}}^{2} \times p \times (1 - p)}{{M O E}^{2}}$
n is sample size
N is population size
Z is critical value ( $α$ is $1 - c o n f i d e n c e l e v e l a s p r o b a b i l i t y$ )
p is sample proportion
MOE is margin of error
As an aside, the website where I got this formula says it implements "finite population correction", is this desirable for my requirements?
Here is the math executed on my above numbers. I will use $Z_{a / 2} = 2.58$ for $α = 0.01$ , $p = 0.001$ and $M O E = 0.005$ . As stated above, $N = 699$ on account of there being 699 failure cases that I would like to sample with a certain level of confidence.
Based on my understanding, what this math will do is recommend a sample size that will show, with 99% confidence, that the sample result is within 0.5 percentage points of reality.
Doing the math, $X = 265.989744$ and $n = 192.8722086653 \approx 193$ , implying that I can have a sample size of 193 to fulfill this confidence level and interval.
My main question is whether my assumption about $p = \frac{1}{1000}$ is valid. If it's not, and I use the conservative $p = 0.5$ , then my sample size shoots up to $\approx 692$ . So I would like to know if my assumptions about what sample proportion actually is are correct.
More broadly, am I on the right track at all with this? From my attempt at demonstrating this probabilistically to my current thought process, is any of this accurate at all?

Annie French 2022-11-03

Grassmannians are a pretty useful subject in numerous fields of mathematics (and physics). In fact, it was the first non-trivial higher-dimensional example that was given in an introductory projective geometry course during my education.
Later I learned you can use them to define universal bundles and that they are playing a role in higher-dimensional geometry and topology. Though I have never came across a book or a survey article on the geometry and topology of those beasts. The field is a little wide, so let me specify what I am interested in:
Topology and Geometry of Grassmannians $G_{k} (R^{n})$ or $G_{k} (C^{n})$
Connections with bundle and obstruction theory.
Differential Topology of $G_{k} (R^{n})$ or $G_{k} (C^{n})$ (for instance, are there exotic Grassmannians).
Homotopy Theory of $G_{k} (R^{n})$ or $G_{k} (C^{n})$ .
Algebraic Geometry of $G_{k} (V)$ , where V is a n-dimensional vectorspace over a (possible characteristic $\neq 0$ field F)

Amy Bright 2022-11-03

Which of the following statements is true about the t-distribution?
a) For large sample sizes, the t-distribution has the same properties as the normal curve.
b) For small sample sizes, the t-distribution has the same properties as the normal curve.
c) Like the Normal distribution, the t-distribution is symmetric for small n.
d) Since population standard deviation is usually unknown, the standard error uses the sample standard deviation to estimate population standard deviation.

Alvin Parks 2022-11-03

How do you find the exact value of $\sin^{- 1} (\sin (\frac{π}{5}))$ ?

drzwiczkih5a 2022-11-03

Let $X i$ be a random variable distributed as $N (i, i^{2}), i = 1, 2, 3$ . As-sume that the random variables $X_{1}, X_{2}$ and $X_{3}$ are independent. Using only the three random variables $X_{1}, X_{2}$ and $X_{3}$ give an example of a statistic that has a $t$ distribution with two degrees of freedom.

tramolatzqvg 2022-11-03

Finding the Expected Value of $T =$ $\sum X_{i}^{2}$

clealtAfforcewug 2022-11-03

Bayesian Statistics - Basic question about prior
I try to get an understanding of bayesian statistics. My intuition tells me that in the expression for the posterior
$p (ϑ | x) = \frac{p (x | ϑ) p (ϑ)}{\int_{Θ} p (x | θ) p (θ) d θ}$
the term $p (ϑ)$ is the marginal distribution of the likelihood-function $p (ϑ, x)$ . It is obtained by
$p (ϑ) = \int_{X} p (ϑ | x) p_{X} (x) d x$
where $p_{X} (x)$ should be the marginal distribution of the Observable data. Does that make sense?
To this point it makes sense with this example: Offering somebody a car insurance without knowing the person's style of driving (determined by $ϑ \in Θ$ ) to feed some statistical model, we still can make use of the nation's car-crash statistics as our prior, which is a pdf on $Θ$ . That would be the marginal distribution of the "driving styles" across the population.
Maybe I am just oversimplifying here, because my resources did not mention this.

Anton Huynh 2022-11-03

Let ${X i} \sim N (i θ, 1)$ for $i = 1, . . . ., n$ be an independent, but not identically distributed sample. Check that $T = \sum_{i} X_{i}$ it is a sufficient statistic for $θ$ .

Jonas Huff 2022-11-03

When conducting a hypothesis test for the population mean, you will not know the population standard deviation, so you will have to use the sample standard deviation instead. How will this affect the process?

Kailyn Hamilton 2022-11-03

How do you find the exact value for $\cos 240$ ?

Uriah Molina 2022-11-02

Derive an expression for the $p$ -value using a test with test statistic $T = \sqrt{n} ({\bar{X}}_{n} - θ_{0}) / σ$

Alberto Calhoun 2022-11-02

Derive the Cramer von Mises test statistic
$n C_{n} = \frac{1}{12 n} + \sum_{i = 1}^{n} {(U_{(i)} - \frac{2 i - 1}{2 n})}^{2}$
where $U_{(i)} = F_{0} (X_{(i)})$ the order statistics

linnibell17591 2022-11-02

How do you use a power series to find the exact value of the sum of the series $1 - \frac{(\frac{π}{4})^{2}}{2!} + \frac{(\frac{π}{4})^{4}}{4!} - \frac{(\frac{π}{4})^{6}}{6!} +$ ...?

kituoti126 2022-11-02

Solve PDE using method of characteristics with non-local boundary conditions.
Given the population model by the following linear first order PDE in u(a,t) with constants b and $μ$ :
$u_{a} + u_{t} = - μ t u a, t > 0$
$u (a, 0) = u_{0} (a) a \geq 0$
$u (0, t) = F (t) = b \int_{0}^{\infty} u (a, t) d a$
We can split the integral in two with our non-local boundary data:
$F (t) = b \int_{0}^{t} u (a, t) d a + b \int_{t}^{\infty} u (a, t) d a$
Choosing the characteristic coordinates $(ξ, τ)$ and re-arranging the expression to form the normal to the solution surface we have the following equation with initial conditions:
$(u_{a}, u_{t}, - 1) ∙ (1, 1, - μ t u) = 0$
$x (0) = ξ, t (0) = 0, u (0) = u_{0} (ξ)$
Characteristic equations:
$\frac{d a}{d τ} = 1, \frac{d t}{d τ} = 1, \frac{d u}{d τ} = - μ t u$
Solving each of these ODE's in $τ$ gives the following:
$(1) \int d a = \int d τ (2) \int d t = \int d τ (3) \int d u = - \int μ t u d τ$
$a = τ + F (ξ) t = τ + F (ξ)$
$∴ a = τ + ξ ∴ t = τ$
$\int d u = - \int μ τ u d τ$
$\int \frac{1}{u} d u = - \int μ τ d τ$
$\ln u = - \frac{1}{2} μ τ^{2} + F (ξ)$
$u = G (ξ) e^{- \frac{1}{2} μ τ^{2}}$
$∴ u = u_{0} (ξ) e^{- \frac{1}{2} μ τ^{2}}$
Substituting back the original coordinates we can re-write this expression with a coordinate change:
$ξ = a - t τ = t$
$∴ u (a, t) = u_{0} (a - t) e^{- \frac{1}{2} t^{2}}$
Now this is where I get stuck, how do I use the boundary data to come up with a well-posed solution?
$u (0, t) = u_{0} (- t) e^{- \frac{1}{2} μ t^{2}} = b \int_{0}^{t} u (a, t) d a + b \int_{t}^{\infty} u (a, t) d a$

Emmanuel Giles 2022-11-02

Using a "population" consisting of probabilities to predict accuracy of sample
Since I'm not sure if the title explains my question well enough I've come up with an example myself:
Let's say I live in a country where every citizen goes to work everyday and every citizen has the choice to go by bus or by train (every citizen makes this choice everyday again - there are almost no citizens who always go by train and never by bus, and vice-versa).
I've done a lot of sampling and I have data on one million citizens about their behaviour in the past 1000 days. So, I calculate the "probability" per citizen of going by train on a single day. I can also calculate the average of those calculated probabilities of all citizens, let's say the average probability of a citizen going by train is 0.27. I figured that most citizens have tendencies around this number (most citizens have an individual probability between 0.22 and 0.32 of going by train for example).
Now, I started sampling an unknown person (but known to be living in the same country) and after asking him 10 consecutive days whether he went by train or by bus, I know that this person went to his work by train 4 times, and by bus 6 times.
My final question: how can I use my (accurate) data on one million citizens to approximate this person's probability of going by train?
I know that if I do the calculation the other way around, so, calculate the probability of this event occurring given the fact that I know this person's REAL probability is 0.4 this results in: ${0.4}^{4} \cdot {0.6}^{6} \cdot 10 C 4 =\sim 25 %$ . I could calculate this probability for all possible probabilities between 0.00 and 1.00 (so, $0 % - 100 %$ without any numbers in between) and sum them all, which sums to about 910%. I could set this to 100% (dividing by 9.1) and set all other percentages accordingly (dividing everything by 9.1 - so, our 25% becomes ~2.75%) and come up with a weighted sum: $2.75 % \cdot 0.4 + X % \cdot 0.41$ etc., but this must be wrong since I'm not taking my accurate samples of the population into account.

assupecoitteem81 2022-11-02

Let $X 1, . . ., X n$ be independent and identically distributed with density $P_{θ} (x) =$
${\begin{cases} 2 x / θ^{2} & for 0 \leq x < θ \\ 0 & else \end{cases}$
Demonstrate that $m_{n} = m a x (x_{1}, . . ., x_{n})$ is a sufficient statistic .

Master Statistics and Probability Problems with Expert Help