Given an i.i.d. sequence of random variables \(X_1, X_2, \dots, X_n,\) define
\begin{align}
& X_{(1)} = \text{min}(\{X_1, X_2, \dots, X_n\}) \\
& X_{(2)} = \text{min}(\{X_1, X_2, \dots, X_n\} - \{X_{(1)}\}) \\
& X_{(3)} = \text{min}(\{X_1, X_2, \dots, X_n\} - \{X_{(1)}, X_{(2)}\}) \\
& \vdots \\
& X_{(n)} = \text{min}(\{X_1, X_2, \dots, X_n\} - \{X_{(1)}, X_{(2)}, \dots, X_{(n-1)}\}) = \text{max}(\{X_1, X_2, \dots, X_n\})
\end{align}
The order statistic of \(X_1, X_2, \dots, X_n\) is the probability that \((X_{(1)} = x_1, X_{(2)} = x_2, \dots, X_{(n)} = x_n)\) for some numbers \(x_1 < x_2 < \dots < x_n.\)
Given that \((X_{(1)} = x_1, X_{(2)} = x_2, \dots, X_{(n)} = x_n)\) occurs, there are \(n!\) possible values for \((X_1, X_2, \dots, X_n)\) since there are \(n!\) ways to order the numbers \(x_1, x_2, \dots, x_n.\) So, if we let \(f(x)\) be the pdf of the random variables in the i.i.d. sequence \(X_1, X_2, \dots, X_n,\) the order statistic has pdf \(f(x_1, \dots, x_n)\) defined by
\[f(x_1, x_2, \dots, x_n) = n!f(x_1)f(x_2) \dots f(x_n)\]
Example
Let \(X_1,\) \(X_2\) and \(X_3\) be independent, exponentially distributed random variables with parameter \(1.\) What is the probability that all of the random variables are at least \(1\) apart?
▼ Solution:
If \(X_1,\) \(X_2\) and \(X_3\) are at least one apart, then \(X_{(1)} + 1 < X_{(2)}\) and \(X_{(2)} + 1< X_{(3)}.\) The joint pdf of \(X_{(1)}, X_{(2)}, X_{(3)}\) is \(f(x_1, x_2, x_3) = 3!e^{-x_1}e^{-x_2}e^{-x_3}\) over \(0 < x_1 < x_2 < x_3.\)
1. Let \(U_1, U_2, U_3, U_4\) be independent, continuous uniform random variables over \([0, 5].\) What is the probability that some pair of the random variables are within \(1\) of each other?
Unanswered
Solution: Let \(A\) be the event that at least two of the random variables are within \(1\) of each other. Then \(A^C\) is the event that all of \(U_1, U_2, U_3\) and \(U_4\) are at least \(1\) apart. Converting to order statistics, this means that \(A^C\) occurs when \(U_{(1)} + 1 < U_{(2)},\) \(U_{(2)} + 1 < U_{(3)},\) and \(U_{(3)} + 1 < U_{(4)}.\)
The joint pdf of \(U_{(1)}, U_{(2)}, U_{(3)}\) and \(U_{(4)}\) is \(f(x_1, x_2, x_3, x_4) = \frac{4!}{5^4}\) over \(x_1 < x_2 < x_3 < x_4.\) So,
\begin{align}
P(A^C) & = \int_0^5 \int_{x_1+1}^5 \int_{x_2+1}^5 \int_{x_3+1}^5 \frac{4!}{5^4} dx_4 dx_3 dx_2 dx_1 \\
& = \frac{4!}{5^4} \int_0^5 \int_{x_1+1}^5 \int_{x_2+1}^5 4 - x_3 dx_3 dx_2 dx_1 \\
& = \frac{4!}{5^4} \int_0^5 \int_{x_1+1}^5 \frac{1}{2}(y^2 - 6y + 8) dx_2 dx_1 \\
& = \frac{4!}{5^4} \int_0^5 -\frac{1}{6}(y-4)(y-1)^2 dx_1 \\
& = \frac{4!}{5^4} \frac{5}{24} \\
& = \frac{1}{125} \\
& = 0.008
\end{align}
So, the probability of \(A\) is \(1 - 0.008 = 0.992\)