Conditional Probability and Independence nonsense in a problemThe statement:Suppose that a patient tests positive for a disease affecting 1% of the population. For a patient who has the disease, there is a 95% chance of testing positive, and for a patient who doesn't has the disease, there is a 95% chance of testing negative. The patient gets a second, independent, test done, and again tests positive. Find the probability that the patient has the disease.The problem:I can solve this problem, but I'm unable to understand what is wrong with the following:Let T i be the event that the patient tests positive in the i-th test, and let D be the event that the patient has the disease.The problem says that P ( T 1 , T 2 ) = 0.95 2 ∗ 0.01 + 0.05 2 ∗ 0.99 = 0.0115, because the tests are independent.By law of total probability we know that: P ( T 1 , T 2 ) = 0.95 2 ∗ 0.01 + 0.05 2 ∗ 0.99 = 0.0115Replacing, and assuming conditional independence given D, we have: P ( T 1 , T 2 ) = 0.95 2 ∗ 0.01 + 0.05 2 ∗ 0.99 = 0.0115This is the correct result, but now let's consider that: P ( T 1 , T 2 ) = P ( T 1 ) 2 We know that P ( T 1 , T 2 ) = P ( T 1 ) 2 for all i because of symmetry, so we have P ( T 1 , T 2 ) = P ( T 1 ) 2 . Again, by law of total probability: P ( T 1 ) = 0.95 ∗ 0.01 + 0.05 ∗ 0.99 ≈ 0.059 P ( T 1 ) = 0.95 ∗ 0.01 + 0.05 ∗ 0.99 ≈ 0.059So we have: P ( T 1 , T 2 ) = P ( T 1 ) 2 ≈ 0.059 2 ≈ 0.003481The second approach is wrong, but it seems legitimate to me, and I'm unable to find what's wrong.Thank's for your help, you make self studying easier.

Question

Conditional Probability and Independence nonsense in a problemThe statement:Suppose that a patient tests positive for a disease affecting 1% of the population. For a patient who has the disease, there is a 95% chance of testing positive, and for a patient who doesn&#039;t has the disease, there is a 95% chance of testing negative. The patient gets a second, independent, test done, and again tests positive. Find the probability that the patient has the disease.The problem:I can solve this problem, but I&#039;m unable to understand what is wrong with the following:Let       T    i   be the event that the patient tests positive in the i-th test, and let D be the event that the patient has the disease.The problem says that   P  (      T    1    ,      T    2    )  =      0.95    2    ∗  0.01  +      0.05    2    ∗  0.99  =  0.0115, because the tests are independent.By law of total probability we know that:  P  (      T    1    ,      T    2    )  =      0.95    2    ∗  0.01  +      0.05    2    ∗  0.99  =  0.0115Replacing, and assuming conditional independence given D, we have:  P  (      T    1    ,      T    2    )  =      0.95    2    ∗  0.01  +      0.05    2    ∗  0.99  =  0.0115This is the correct result, but now let&#039;s consider that:  P  (      T    1    ,      T    2    )  =  P  (      T    1        )    2  We know that   P  (      T    1    ,      T    2    )  =  P  (      T    1        )    2   for all i because of symmetry, so we have   P  (      T    1    ,      T    2    )  =  P  (      T    1        )    2  . Again, by law of total probability:  P  (      T    1    )  =  0.95  ∗  0.01  +  0.05  ∗  0.99  ≈  0.059  P  (      T    1    )  =  0.95  ∗  0.01  +  0.05  ∗  0.99  ≈  0.059So we have:  P  (      T    1    ,      T    2    )  =  P  (      T    1        )    2    ≈      0.059    2    ≈  0.003481The second approach is wrong, but it seems legitimate to me, and I&#039;m unable to find what&#039;s wrong.Thank&#039;s for your help, you make self studying easier.

Kennedi Lawson · Accepted Answer

The patient gets a second, independent, test done ...The word &quot;independent&quot; as used in the question does not refer to statistical independence. It refers to the second test being done without any direct causal influence from the first test. The lack of a causal relationship between the two tests does not imply the lack of a correlation between them (in fact, the results of the two tests are quite positively correlated with each other).Incidentally, the (fallacious) idea that correlation implies causation is equivalent (by contrapositive) to the (also fallacious)idea that a lack of causation implies a lack of correlation.Perhaps the folks over at the English Language &amp;amp; Usage Stack Exchange can clarify the non-mathematical use of the word &quot;independent&quot; better than I can.We know that for any two events, regardless of independence, the following is true:P(T1∩T2)=P(T1)P(T2|T1)Suppose, for contradiction, that T1 and T2 are statistically independent. By definition,P(T1∩T2)=P(T1)P(T2)In order for the first and second equations above to both be satisfied, it must be the case thatP(T2|T1)=P(T2)This is a contradiction. If the first test is positive, the second test clearly must be more likely to also be positive. Therefore,P(T2|T1)≠P(T2)⟹P(T1∩T2)≠P(T1)P(T2)⟹T1 and T2 are not independent The original problem, which asks for the value of P(D|T1,T2), can be solved using Bayes&#039; theorem, and no assumption about the statistical independence between T1 and T2 is necessary.

Answered question

Answer & Explanation

New Questions in Inferential Statistics