Problem 72

Problem 72: Commuting by train ( $✓$ $✓$ ) 2000 Paper II

Tabulated values of $Φ (\cdot)$ , the cumulative distribution function of a standard normal variable, should not be used in this question.

Henry the commuter lives in Cambridge and his working day starts at his office in London at 0900. He catches the 0715 train to King’s Cross with probability $p$ , or the 0720 to Liverpool Street with probability $1 - p$ . Measured in minutes, journey times for the ﬁrst train are $N (55, 25)$ and for the second are $N (65, 16)$ . Journey times from King’s Cross and Liverpool Street to his office are $N (30, 144)$ and $N (25, 9)$ , respectively. Show that Henry is more likely to be late for work if he catches the ﬁrst train.

Henry makes $M$ journeys, where $M$ is large. Writing $A$ for $1 - Φ (\frac{20}{13})$ and $B$ for $1 - Φ (2)$ , ﬁnd, in terms of $A$ , $B$ , $M$ and $p$ , the expected number, $L$ , of times that Henry will be late and show that, for all possible values of $p$ ,

B M \leq L \leq A M .

Henry noted that in $\frac{3}{5}$ of the occasions when he was late, he had caught the King’s Cross train. Obtain an estimate of $p$ in terms of $A$ and $B$ .

Note: A random variable is said to be $N (μ, σ^{2})$ if it has a normal distribution with mean $μ$ and variance $σ^{2}$ .

Comments

This is impossible unless you know the following result:
If $X_{1}$ and $X_{2}$ are independent and normally distributed according to $X_{1} \sim N (μ_{1}, σ_{1}^{2})$ and $X_{2} \sim N (μ_{2}, σ_{2}^{2})$ , then $X_{1} + X_{2}$ is also normally distributed and $X_{1} + X_{2} \sim N (μ_{1} + μ_{2}, σ_{1}^{2} + σ_{2}^{2})$ .
Even if you didn’t know this result, you do now and it should not be difficult to complete the ﬁrst parts question.

For the last part, you need to know something about conditional probability, namely that

P (A | B) = \frac{P (B \cap A)}{P (B)}

which makes sense intuitively and can easily be understood in terms of Venn diagrams. The denominator is essentially a normalising constant. The formula may be taken as the deﬁnition of conditional probability on the left hand side.

Solution to problem 72

Let $T_{1}$ be the random variable representing the total journey time via Kings Cross, so that

T_{1} \sim N (55 + 30, 25 + 144) = N (85, 169),

using the result mentioned on the previous page, and and let $T_{2}$ be the random variable representing the total journey time via Liverpool Street, so that

T_{2} \sim N (65 + 25, 16 + 9) = N (90, 25) .

Then the probabilities of being late are, respectively, $P (T_{1} > 105)$ and $P (T_{2} > 100)$ , i.e. $1 - Φ (\frac{20}{13})$ and $1 - Φ (2)$ . Note that $Φ (\frac{20}{13}) < Φ (2)$ .

We have

L = [p A + (1 - p) B] M = [B + (A - B) p] M .

$L$ increases as $p$ increases, since $A > B$ , hence the given inequalities corresponding to $p = 0$ and $p = 1$ .

We have

P(Kings Cross given late) = \frac{P(Late and Kings Cross)}{P(Late)}

An estimate for $p$ (all it $\tilde{p}$ ) is therefore given by

\frac{3}{5} = \frac{A \tilde{p}}{A \tilde{p} + B (1 - \tilde{p})},

so $\tilde{p} = \frac{3 B}{2 A + 3 B}$ .

Post-mortem

The result mentioned in the comment on the previous page is just the sort of thing you should try to prove yourself rather than take on trust. Unfortunately, such results in probability tend to be pretty hard to prove. You can also prove it fairly easily using generating functions (which are not in the syllabus for STEP I and II). You can also prove it from ﬁrst principles. That would be difficult for you because of the awkward integrals involved, but not impossible.