Conditional Probability and Independence nonsense in a problem The statement: Suppose that a patient tests positive for a disease affecting 1% of the population. For a patient who has the disease, there is a 95% chance of testing positive, and for a patient who doesn't has the disease, there is a 95% chance of testing negative. The patient gets a second, independent, test done, and again tests positive. Find the probability that the patient has the disease.

wijii4

wijii4

Answered question

2022-09-17

Conditional Probability and Independence nonsense in a problem
The statement:
Suppose that a patient tests positive for a disease affecting 1% of the population. For a patient who has the disease, there is a 95% chance of testing positive, and for a patient who doesn't has the disease, there is a 95% chance of testing negative. The patient gets a second, independent, test done, and again tests positive. Find the probability that the patient has the disease.
The problem:
I can solve this problem, but I'm unable to understand what is wrong with the following:
Let T i be the event that the patient tests positive in the i-th test, and let D be the event that the patient has the disease.
The problem says that P ( T 1 , T 2 ) = 0.95 2 0.01 + 0.05 2 0.99 = 0.0115, because the tests are independent.
By law of total probability we know that:
P ( T 1 , T 2 ) = 0.95 2 0.01 + 0.05 2 0.99 = 0.0115
Replacing, and assuming conditional independence given D, we have:
P ( T 1 , T 2 ) = 0.95 2 0.01 + 0.05 2 0.99 = 0.0115
This is the correct result, but now let's consider that:
P ( T 1 , T 2 ) = P ( T 1 ) 2
We know that P ( T 1 , T 2 ) = P ( T 1 ) 2 for all i because of symmetry, so we have P ( T 1 , T 2 ) = P ( T 1 ) 2 . Again, by law of total probability:
P ( T 1 ) = 0.95 0.01 + 0.05 0.99 0.059
P ( T 1 ) = 0.95 0.01 + 0.05 0.99 0.059
So we have:
P ( T 1 , T 2 ) = P ( T 1 ) 2 0.059 2 0.003481
The second approach is wrong, but it seems legitimate to me, and I'm unable to find what's wrong.
Thank's for your help, you make self studying easier.

Answer & Explanation

Kennedi Lawson

Kennedi Lawson

Beginner2022-09-18Added 4 answers

The patient gets a second, independent, test done ...
The word "independent" as used in the question does not refer to statistical independence. It refers to the second test being done without any direct causal influence from the first test. The lack of a causal relationship between the two tests does not imply the lack of a correlation between them (in fact, the results of the two tests are quite positively correlated with each other).
Incidentally, the (fallacious) idea that correlation implies causation is equivalent (by contrapositive) to the (also fallacious)idea that a lack of causation implies a lack of correlation.
Perhaps the folks over at the English Language & Usage Stack Exchange can clarify the non-mathematical use of the word "independent" better than I can.
We know that for any two events, regardless of independence, the following is true:
P(T1∩T2)=P(T1)P(T2|T1)
Suppose, for contradiction, that T1 and T2 are statistically independent. By definition,
P(T1∩T2)=P(T1)P(T2)
In order for the first and second equations above to both be satisfied, it must be the case that
P(T2|T1)=P(T2)
This is a contradiction. If the first test is positive, the second test clearly must be more likely to also be positive. Therefore,
P(T2|T1)≠P(T2)⟹P(T1∩T2)≠P(T1)P(T2)⟹T1 and T2 are not independent
The original problem, which asks for the value of P(D|T1,T2), can be solved using Bayes' theorem, and no assumption about the statistical independence between T1 and T2 is necessary.

Do you have a similar question?

Recalculate according to your conditions!

New Questions in Inferential Statistics

Ask your question.
Get an expert answer.

Let our experts help you. Answer in as fast as 15 minutes.

Didn't find what you were looking for?