Cut the knot: learn to enjoy mathematics
A math books store at a unique math study site. Learn to enjoy mathematics.
Google
Web CTK
Best sites for teachers
Sites for teachers
Sites for parents
Terms of use
Awards

Interactive Activities
CTK Exchange
CTK Insights - a blog

Games & Puzzles
What Is What
Arithmetic/Algebra
Geometry
Probability
Outline Mathematics
Make an Identity
Book Reviews
Eye Opener
Analog Gadgets
Inventor's Paradox
Did you know?...
Proofs
Math as Language
Things Impossible
Visual Illusions
My Logo
Math Poll
Cut The Knot!
MSET99 Talk
Other Math sites
Front Page
Movie shortcuts
Personal info
Reciprocal links
Privacy Policy

Guest book
News sites

Recommend this site

Best sites for teachers
Sites for teachers
Sites for parents

Education & Parenting

Manifesto: what CTK is about Search CTK Buying a book is a commitment to learning Table of content Things you can find on CTK Chronology of updates Email to Cut The Knot Recommend this page

Conditional Probability

The probability of 7 when rolling two die is 1/6 (= 6/36) because the sample space consists of 36 equiprobable elementary outcomes of which 6 are favorable to the event of getting 7 as the sum of two die. Denote this event A: P(A) = 1/6.

Consider another event B which is having at least one 2. There are still 36 elementary outcomes of which 11 are favorable to B; therefore, P(B) = 11/36. We do not know whether B happens or not, but this is a legitimate question to inquire as to what happens if it does. More specifically, what happens to the probability of A under the assumption that B took place?

The assumption that B took place reduces the set of possible outcomes to 11. Of these, only two - 25 and 52 - are favorable to A. Since this is reasonable to assume that the 11 elementary outcomes are equiprobable, the probability of A under the assumption that B took place equals 2/11. This probability is denoted P(A|B) - the probability of A assuming B: P(A|B) = 2/11. More explicitly P(A|B) is called the conditional probability of A assuming B. Of course, for any event A, P(A) = P(A|Ω), where, by convention, Ω is the universal event - the whole of the sample space - for which all available elementary outcomes are favorable.

We see that, in our example, P(A|B) ≠ P(A). In general, this may or may not be so.

Retracing the steps in the example,

  P(A|B) = P(A∩B) / P(B),

and this is the common definition of the conditional probability. The formula confirms to the earlier definitions:

  P(A|Ω) = P(A∩Ω) / P(Ω) = P(A)

since P(Ω) = 1 and A∩Ω = A (because Asubset ofΩ.)

For another example, let's look at the events associated with rolling a dice in the form of octahedron, a shape with eight faces. The sample space naturally consists of 8 equiprobable outcomes:

  Ω = {1, 2, 3, 4, 5, 6, 7, 8}.

Let A be the event of getting an odd number, B is the event getting at least 7. Then P(A) = 4/8 = 1/2, P(B) = 2/8 = 1/4. A∩B = {7}, so that

  P(A|B) = P(A∩B)/P(B) = 1/2 = P(A).

Observe also that

  P(B|A) = P(A∩B)/P(A) = 1/4 = P(B).

On the other hand, define

  A+ = {1, 2, 3, 5, 7} and
A- = {3, 5, 7}.

Then

  P(B|A+) = P(A+∩B)/P(A+) = 1/5 < 1/4 = P(B),

whilst

  P(B|A-) = P(A-∩B)/P(A-) = 1/3 > 1/4 = P(B).

So we see that, in general, there is no definite relationship between the probability P(B) and the conditional probability P(B|A). They may be equal, or one of them may be greater than the other. In the former case the events are said to be independent.

Conditional probabilities arise naturally (and prove useful) in the investigation of experiments run repeatedly where an outcome of a trial may affect the outcomes of the subsequent trials. Drawing without replacement is one class of such experiments. Making a selection by drawing a straw or a match from a bunch of apparently equal items is a common practice throughout the world [Falk, 2.3.3].

For a group of six kids, six identical-looking matches are used. An uninvolved person secretly breaks one match at its lower end and then holds all six in his/her palm, so that the lower ends are hidden, and only six upper ends are visible arranged evenly.

The first child randomly draws a match. If the match is the broken one, the child is selected, and the drawing procedure stops. If not, the second child draws a match, and so on. The drawing procedure terminates when the broken match is selected.

Let us denoted the six children by their ordinal numbers in the draw: 1, 2, ..., 6. (The randomness, or rather arbitrariness of the order can be assured by an entertaining activity of drawing shuttles.) For each child, compute the probability that he or she will be the selected person.

The probability for the first child is obviously 1/6 because there are 6 matches with equal chances of being drawn. The probability that the first child will not be selected is 5/6. If it's not, the remaining 5 matches have equal chances to be drawn by the second child who, therefore, has the probability of 1/5 of being selected on the second drawing. But this probability is conditional on the failure of the first child to win the selection. If A is the event of the first child drawing the broken match, and B is that for the second child,

 
P(B)= P(B|A)
 = P(B∩A) / P(A)
 = 1/5 / 5/6
 = 1/6.

The probability that neither of the first two children is selected 5/6·4/5 = 4/6. If C is the event of the third child being selected then

 
P(C)= P(C|AB)
 = P(C|B)
 = P(C∩B) / P(B)
 = 1/4 / 4/6
 = 1/6.

The similar reasoning applies to the remaining children so that all six of them have an equal chance to draw the broken match. The procedure is quite fair, although, at first sight, the result is counterintuitive.

In describing the survival rate and life expectancy in a certain population, let AN denote the event of reaching the age of N years and P(N) = (AN) be the corresponding probability. In other words, P(N) stands for the probability of a new-born to reach the age of N years. We are given that

  P(50) = .913,
P(55) = .881,
P(65) = .746.

This information suggests several questions. For example, what is the probability of a 50 years old man to reach the age of 55, i.e. what is P(55|50) = P(A55|A50)? Since obviously A55∩A50 = A55, we have by definition,

  P(55|50) = P(A55∩A50)/P(A50) = P(A55)/P(50) ≈ .965.

A probability that a 50 years old will die within 5 years is then a rather comforting 1 - .965 = .035. However, as it should, the probability of dying within the next 5 years grows with age. So if, for example, the probability that a man who just turned 65 will die within 5 years is .16, what is the probability for a man to survive till his 70th birthday, i.e., what is P(70)?

As before, P(70|65) = P(70)/P(65) so that P(70) = P(65)·P(70|65), but

  P(70|65) = 1 - .16 = .84.

Therefore,

  P(70) = P(65)·P(70|65) = .746·.84 ≈ .627.

Note: An interactive tool to check your understanding of conditional probabilities is available at http://www.cut-the-knot.org/Curriculum/Probability/ConditionalProbability.shtml.

References

  1. R. Falk, Understanding Probability and Statistics, A K Peters, 1993

Copyright © 1996-2008 Alexander Bogomolny

28740917Page copy protected against web site content infringement by Copyscape


Search:
Keywords:


Latest on CTK Exchange
Math
Posted by Laura
2 messages
06:56 AM, Apr-15-08

Divisibility rules - Jargon buste ...
Posted by Carolyn
2 messages
08:35 AM, Apr-04-08

drawing puzzle
Posted by martin gran
31 messages
06:53 PM, May-09-08

conway's game of life
Posted by frequency
0 messages
11:52 PM, May-12-08

Mistake on the page (an aside, Be ...
Posted by Max
4 messages
10:28 AM, Feb-28-08

Deriving functions based on diffe ...
Posted by ke_45
1 messages
12:47 PM, May-10-08

Josephus Flavius (correction)
Posted by David Turner
1 messages
09:42 AM, May-14-08