The exercise statement (roughly): Assume there is a terrorist prevention system that has a 99% chance of correctly identifying a future terrorist and 99.9% chance of correctly identifying someone that is not a future terrorist. If there are 1000 future terrorists among the 300 million people population, and one individual is chosen randomly from the population, then processed by the system and deemed a terrorist. What is the chance that the individual is a future terrorist?Attempted exercise solution:I use the following event labels:A -&gt; The person is a future terroristB -&gt; The person is identified as a terroristThen, some other data: P ( A ) = 10 3 3 ⋅ 10 8 = 1 3 ⋅ 10 5 P ( A ¯ ) = 1 − P ( A ) P ( B ∣ A ) = 0.99 P ( B ¯ ∣ A ) = 1 − P ( B ∣ A ) P ( B ¯ ∣ A ¯ ) = 0.999 P ( B ∣ A ¯ ) = 1 − P ( B ¯ ∣ A ¯ )What I need to find is the chance that someone identified as a terrorist, is actually a terrorist. I express that through P(A | B) and use Bayes Theorem to find its value. P ( A ∣ B ) = P ( A ∩ B ) P ( B ) = P ( B ∣ A ) ⋅ P ( A ) P ( B ∣ A ) ⋅ P ( A ) + P ( B ∣ A ¯ ) ⋅ P ( A ¯ ) The answer I get after plugging-in all the values is: 3.29 ⋅ 10 − 3 , the book's answer is 3.29 ⋅ 10 − 4 .Can someone help me identify what I'm doing wrong? Also, in either case, I find that it is very unintuitive that the probability of success is so small. If someone could explain it to me in more intuitive terms I'd be very grateful.

Question

The exercise statement (roughly): Assume there is a terrorist prevention system that has a 99% chance of correctly identifying a future terrorist and 99.9% chance of correctly identifying someone that is not a future terrorist. If there are 1000 future terrorists among the 300 million people population, and one individual is chosen randomly from the population, then processed by the system and deemed a terrorist. What is the chance that the individual is a future terrorist?Attempted exercise solution:I use the following event labels:A -&amp;gt; The person is a future terroristB -&amp;gt; The person is identified as a terroristThen, some other data:  P  (  A  )  =            10      3              3      ⋅              10        8              =      1          3      ⋅              10        5              P  (            A      ¯        )  =  1  −  P  (  A  )  P  (  B  ∣  A  )  =  0.99  P  (            B      ¯        ∣  A  )  =  1  −  P  (  B  ∣  A  )  P  (            B      ¯        ∣            A      ¯        )  =  0.999  P  (  B  ∣            A      ¯        )  =  1  −  P  (            B      ¯        ∣            A      ¯        )What I need to find is the chance that someone identified as a terrorist, is actually a terrorist. I express that through P(A | B) and use Bayes Theorem to find its value.  P  (  A  ∣  B  )  =            P      (      A      ∩      B      )              P      (      B      )        =            P      (      B      ∣      A      )      ⋅      P      (      A      )              P      (      B      ∣      A      )      ⋅      P      (      A      )      +      P      (      B      ∣                        A          ¯                    )      ⋅      P      (                        A          ¯                    )      The answer I get after plugging-in all the values is:   3.29  ⋅      10          −      3      , the book&#039;s answer is   3.29  ⋅      10          −      4      .Can someone help me identify what I&#039;m doing wrong? Also, in either case, I find that it is very unintuitive that the probability of success is so small. If someone could explain it to me in more intuitive terms I&#039;d be very grateful.

faux0101d · Accepted Answer

Step 1Let&#039;s look at the   2  ×  2 table, but first, let&#039;s rewrite the notation so that it is unambiguous what the events mean. Let F be the event that a randomly chosen individual from the population is a future terrorist. Let T be the event that a randomly chosen individual from the population tests positive as a terrorist. Then                          T                                          T            ¯                                                    F                    990                    10                    1000                                                  F            ¯                                      299999                    299699001                    299999000                                  300989                    299699011                    300000000            gives the cell frequencies for the entire population. This is found by observing, for instance, that if there are 1000 future terrorists in the population, then a test that has a 99% correct positive rate means that   (  0.99  )  (  1000  )  =  990 of these 1000 future terrorists would also test positive, and the remaining 10 would be false negatives (future terrorists that the test misses). Similarly, a test that has a 99.9% correct negative rate means that   (  0.999  )  (  300  ×      10    6    −  1000  ) non-terrorists are correctly identified as such. Then the column totals are computed for T and             T      ¯      .Now it is a trivial exercise to compute the conditional probability Pr[F∣T]: This is simply  990      /    300989  =  0.00328916.The book is incorrect.Step 2What this exercise demonstrates is that when the prevalence of a particular trait is rare in a population, a diagnostic test to detect whether that trait exists in a randomly selected person must have extremely high specificity in order to have high positive predictive value. The problem, as you can see from the table, is that the group of positive-testing non-terrorists   T  ∩            F      ¯       is much, much larger than the population of terrorists. Even if the test is 100% sensitive-i.e., it never gives a false negative-all that would do is make the first row 1000, 0, 1000. The number of false positives is 299999, which is overwhelming. You need a test that will have such a high specificity that the chance of incorrectly identifying someone as a terrorist is very, very unlikely. This situation clearly has ramifications for screening tests for rare diseases, such as HIV: a test is unlikely to be simultaneously cost-effective and highly specific, that you would mitigate the false positive rate. Obviously, you really do not want to make available an HIV test that would give such a high false positive rate--it would be emotionally devastating for numerous people, not to mention it would cause anger and suspicion toward the usefulness of testing.

Answered question

Answer & Explanation

New Questions in College Statistics