In the article "Who solved the Secretary problem?" (
Ferguson, 1989), it's claimed the secretary problem first appeared in print in
Martin Gardner's February 1960
Mathematical Games column in
Scientific American:Ask someone to take as many slips of paper as he pleases, and on each slip write a different positive number. The numbers may range from small fractions of 1 to a number the size of a
googol (1 followed by a hundred zeroes) or even larger. These slips are turned face down and shuffled over the top of a table. One at a time you turn the slips face up. The aim is to stop turning when you come to the number that you guess to be the largest of the series. You cannot go back and pick a previously turned slip. If you turn over all the slips, then of course you must pick the last one turned.Ferguson pointed out that the secretary
game remained unsolved, as a
zero-sum game with two antagonistic players. In this game: • Alice, the informed player, writes secretly distinct numbers on n cards. • Bob, the stopping player, observes the actual values and can stop turning cards whenever he wants, winning if the last card turned has the overall maximal number. • Bob wants to guess the maximal number with the highest possible probability, while Alice's goal is to keep this probability as low as possible. The difference with the basic secretary problem are two: • Alice does not have to write numbers uniformly at random. She may write them according to any
joint probability distribution to trick Bob. • Bob observes the actual values written on the cards, which he can use in his decision procedures.
Strategic analysis Alice first writes down n numbers, which are then shuffled. So, their ordering does not matter, meaning that Alice's numbers must be an
exchangeable random variable sequence X_1, X_2, ..., X_n. Alice's strategy is then just picking the trickiest exchangeable random variable sequence. Bob's strategy is formalizable as a
stopping rule \tau for the sequence X_1, X_2, ..., X_n. We say that a stopping rule \tau for Bob is a
relative rank stopping strategy if it depends on only the relative ranks of X_1, X_2, ..., X_n, and not on their numerical values. In other words, it is as if someone secretly intervened after Alice picked her numbers, and changed each number in X_1, X_2, ..., X_n into its relative rank (breaking ties randomly). For example, 0.2, 0.3, 0.3, 0.1 is changed to 2, 3, 4, 1 or 2, 4, 3, 1 with equal probability. This makes it
as if Alice played an exchangeable
random permutation on \{1, 2, ..., n\}. Now, since the only exchangeable random permutation on \{1, 2, ..., n\} is just the uniform distribution over all permutations on \{1, 2, ..., n\}, the optimal relative rank stopping strategy is the optimal stopping rule for the secretary problem, given above, with a winning probabilityPr(X_\tau = \max_{i\in 1:n} X_i) = \max_{r\in 1:n}\frac{r-1}{n} \sum_{i=r}^{n} \frac{1}{i-1}Alice's goal then is to make sure Bob cannot do better than the relative-rank stopping strategy. By the rules of the game, Alice's sequence must be exchangeable, but to do well in the game, Alice should not pick it to be independent. If Alice samples the numbers independently from some fixed distribution, it would allow Bob to do better. To see this intuitively, imagine if n=2, and Alice is to pick both numbers from the
normal distribution N(0, 1), independently. Then if Bob turns over one number and sees -3, then he can quite confidently turn over the second number, and if Bob turns over one number and sees +3, then he can quite confidently pick the first number. Alice can do better by picking X_1, X_2 that are positively correlated. So the fully formal statement is as below: : Does there exist an exchangeable sequence of random variables X_1, ..., X_n, such that for
any stopping rule \tau, the inequality Pr(X_\tau = \max_{i\in 1:n} X_i) \leq \max_{r\in 1:n}\frac{r-1}{n} \sum_{i=r}^{n} \frac{1}{i-1} holds?
Solution For n=2, if Bob plays the optimal relative-rank stoppings strategy, then Bob has a winning probability 1/2. Surprisingly, Alice has no
minimax strategy, which is closely related to a paradox of
T. Cover and the
two envelopes paradox. Concretely, Bob can play this strategy: sample a random number Y. If X_1 > Y, then pick X_1, else pick X_2. Now, Bob can win with probability strictly greater than 1/2. Suppose Alice's numbers are different, then condition on Y \not\in [\min(X_1, X_2), \max(X_1, X_2)], Bob wins with probability 1/2, but condition on Y \in [\min(X_1, X_2), \max(X_1, X_2)], Bob wins with probability 1. Note the random number Y can be sampled from
any random distribution, as long as Y \in [\min(X_1, X_2), \max(X_1, X_2)] has a nonzero probability. However, for any \epsilon > 0, Alice can construct an exchangeable sequence X_1, X_2 such that Bob's winning probability is at most 1/2 + \epsilon. But for n>2, the answer is yes: Alice can choose random numbers (which are dependent random variables) in such a way that Bob cannot play better than using the classical stopping strategy based on the relative ranks. ==Heuristic performance==