Problem 72: Commuting by train ( ) 2000 Paper II
Tabulated values of , the cumulative distribution function of a standard normal variable, should not be used in this question.
Henry the commuter lives in Cambridge and his working day starts at his office in London at 0900. He catches the 0715 train to King’s Cross with probability , or the 0720 to Liverpool Street with probability . Measured in minutes, journey times for the first train are and for the second are . Journey times from King’s Cross and Liverpool Street to his office are and , respectively. Show that Henry is more likely to be late for work if he catches the first train.
Henry makes journeys, where is large. Writing for and for , find, in terms of , , and , the expected number, , of times that Henry will be late and show that, for all possible values of ,
Henry noted that in of the occasions when he was late, he had caught the King’s Cross train. Obtain an estimate of in terms of and .
Note: A random variable is said to be if it has a normal distribution with mean and variance .
Comments
This is impossible unless you know the following result:
If and
are independent and normally
distributed according to
and , then
is also normally
distributed and .
Even if you didn’t know this result, you do now and it should not be difficult to complete the first parts
question.
For the last part, you need to know something about conditional probability, namely that
which makes sense intuitively and can easily be understood in terms of Venn diagrams. The denominator is
essentially a normalising constant. The formula may be taken as the definition of conditional probability on the
left hand side.
Solution to problem 72
Let be the random variable representing the total journey time via Kings Cross, so that
using the result mentioned on the previous page, and and let be the random variable representing the total journey time via Liverpool Street, so that
Then the probabilities of being late are, respectively, and , i.e. and . Note that .
We have
increases as increases, since , hence the given inequalities corresponding to and .
We have
An estimate for (all it ) is therefore given by
so .
Post-mortem
The result mentioned in the comment on the previous page is just the sort of thing you should try to prove yourself rather than take on trust. Unfortunately, such results in probability tend to be pretty hard to prove. You can also prove it fairly easily using generating functions (which are not in the syllabus for STEP I and II). You can also prove it from first principles. That would be difficult for you because of the awkward integrals involved, but not impossible.