The simple resolution above assumed that the person who invented the argument for switching was trying to calculate the expectation value of the amount in Envelope A, thinking of the two amounts in the envelopes as fixed (
x and 2
x). The only uncertainty is which envelope has the smaller amount
x. However, many mathematicians and statisticians interpret the argument as an attempt to calculate the expected amount in Envelope B, given a real or hypothetical amount "A" in Envelope A. One does not need to look in the envelope to see how much is in there, in order to do the calculation. If the result of the calculation is an advice to switch envelopes, whatever amount might be in there, then it would appear that one should switch anyway, without looking. In this case, at Steps 6, 7 and 8 of the reasoning, "A" is any fixed possible value of the amount of money in the first envelope. This interpretation of the two envelopes problem appears in the first publications in which the paradox was introduced in its present-day form, Gardner (1989) and Nalebuff (1988).) It is common in the more mathematical literature on the problem. It also applies to the modification of the problem (which seems to have started with Nalebuff) in which the owner of envelope A does actually look in his envelope before deciding whether or not to switch; though Nalebuff does also emphasize that there is no need to have the owner of envelope A look in his envelope. If he imagines looking in it, and if for any amount which he can imagine being in there, he has an argument to switch, then he will decide to switch anyway. Finally, this interpretation was also the core of earlier versions of the two envelopes problem (Littlewood's, Schrödinger's, and Kraitchik's switching paradoxes); see
the history section. This kind of interpretation is often called "Bayesian" because it assumes the writer is also incorporating a prior probability distribution of possible amounts of money in the two envelopes in the switching argument.
Simple form of Bayesian resolution The simple resolution depended on a particular interpretation of what the writer of the argument is trying to calculate: namely, it assumed he was after the (unconditional)
expectation value of what's in Envelope B. In the mathematical literature on Two Envelopes Problem, a different interpretation is more common, involving the
conditional expectation value (conditional on what might be in Envelope A). To solve this and related interpretations or versions of the problem, most authors use the
Bayesian interpretation of probability, which means that probability reasoning is not only applied to truly random events like the random pick of an envelope, but also to our knowledge (or lack of knowledge) about things which are fixed but unknown, like the two amounts originally placed in the two envelopes, before one is picked at random and called "Envelope A". Moreover, according to a long tradition going back at least to
Laplace and his
principle of insufficient reason one is supposed to assign equal probabilities when one has no knowledge at all concerning the possible values of some quantity. Thus the fact that we are not told anything about how the envelopes are filled can already be converted into probability statements about these amounts. No information means that probabilities are equal. In steps 6 and 7 of the switching argument, the writer imagines that envelope A contains a certain amount
a, and then seems to believe that given that information, the other envelope would be equally likely to contain twice or half that amount. That assumption can only be correct, if prior to knowing what was in Envelope A, the writer would have considered the following two pairs of values for both envelopes equally likely: the amounts
a/2 and
a; and the amounts
a and 2
a. (This follows from
Bayes' rule in odds form: posterior odds equal prior odds times likelihood ratio). But now we can apply the same reasoning, imagining not
a but
a/2 in Envelope A. And similarly, for 2
a. And similarly, ad infinitum, repeatedly halving or repeatedly doubling as many times as you like. Suppose for the sake of argument, we start by imagining an amount of 32 in Envelope A. In order that the reasoning in steps 6 and 7 is correct
whatever amount happened to be in Envelope A, we apparently believe in advance that all the following ten amounts are all equally likely to be the smaller of the two amounts in the two envelopes: 1, 2, 4, 8, 16, 32, 64, 128, 256, 512 (equally likely powers of 2 But the problem can also be resolved mathematically without assuming a maximum amount. Nalebuff, Nickerson and Falk, The standard assumption about the way the envelopes are set up is that a sum of money is in one envelope, and twice that sum is in another envelope. One of the two envelopes is randomly given to the player (
envelope A). The originally proposed problem does not make clear exactly how the smaller of the two sums is determined, what values it could possibly take and, in particular, whether there is a minimum or a maximum sum it might contain. However, if we are using the Bayesian interpretation of probability, then we start by expressing our prior beliefs as to the smaller amount in the two envelopes through a probability distribution. Lack of knowledge can also be expressed in terms of probability. A first variant within the Bayesian version is to come up with a proper prior probability distribution of the smaller amount of money in the two envelopes, such that when Step 6 is performed properly, the advice is still to prefer Envelope B, whatever might be in Envelope A. So though the specific calculation performed in step 6 was incorrect (there is no proper prior distribution such that, given what is in the first envelope A, the other envelope is always equally likely to be larger or smaller) a correct calculation, depending on what prior we are using, does lead to the result E(B | A = a) > a for all possible values of
a. In these cases, it can be shown that the expected sum in both envelopes is infinite. There is no gain, on average, in swapping.
Second mathematical variant Though Bayesian probability theory can resolve the first mathematical interpretation of the paradox above, it turns out that examples can be found of proper probability distributions, such that the expected value of the amount in the second envelope, conditioned on the amount in the first, does exceed the amount in the first, whatever it might be. The first such example was already given by Nalebuff. Denote again the amount of money in the first envelope by
A and that in the second by
B. We think of these as random. Let
X be the smaller of the two amounts and
Y=2X be the larger. Notice that once we have fixed a probability distribution for
X then the
joint probability distribution of
A, B is fixed, since
A, B =
X, Y or
Y, X each with probability 1/2, independently of
X, Y. The
bad step 6 in the "always switching" argument led us to the finding
E(B|A=a)>a for all
a, and hence to the recommendation to switch, whether or not we know
a. Now, it turns out that one can quite easily invent proper probability distributions for
X, the smaller of the two amounts of money, such that this bad conclusion is still true. One example is analyzed in more detail, in a moment. As mentioned before, it cannot be true that whatever
a, given
A=a,
B is equally likely to be
a/2 or 2
a, but it can be true that whatever
a, given
A=a,
B is larger in expected value than
a. Suppose for example that the envelope with the smaller amount actually contains 2
n dollars with probability 2
n/3
n+1 where
n = 0, 1, 2, ... These probabilities sum to 1, hence the distribution is a proper prior (for subjectivists) and a completely decent probability law also for frequentists. Imagine what might be in the first envelope. A sensible strategy would certainly be to swap when the first envelope contains 1, as the other must then contain 2. Suppose on the other hand the first envelope contains 2. In that case, there are two possibilities: the envelope pair in front of us is either {1, 2} or {2, 4}. All other pairs are impossible. The
conditional probability that we are dealing with the {1, 2} pair, given that the first envelope contains 2, is \begin{align} P(\{1,2\} \mid 2) &= \frac{P(\{1,2\})/2}{P(\{1,2\})/2+P(\{2,4\})/2} \\ &= \frac{P(\{1,2\})}{P(\{1,2\})+P(\{2,4\})} \\ &= \frac{1/3}{1/3 + 2/9} = 3/5, \end{align} and consequently the probability it's the {2, 4} pair is 2/5, since these are the only two possibilities. In this derivation, P(\{1,2\})/2 is the probability that the envelope pair is the pair 1 and 2,
and envelope A happens to contain 2; P(\{2,4\})/2 is the probability that the envelope pair is the pair 2 and 4,
and (again) envelope A happens to contain 2. Those are the only two ways that envelope A can end up containing the amount 2. It turns out that these proportions hold in general unless the first envelope contains 1. Denote by
a the amount we imagine finding in Envelope A, if we were to open that envelope, and suppose that
a = 2
n for some
n ≥ 1. In that case the other envelope contains
a/2 with probability 3/5 and 2
a with probability 2/5. So either the first envelope contains 1, in which case the conditional expected amount in the other envelope is 2, or the first envelope contains
a > 1, and though the second envelope is more likely to be smaller than larger, its conditionally expected amount is larger: the conditionally expected amount in Envelope B is \frac{3}{5} \frac{a}{2} + \frac{2}{5} 2a = \frac{11}{10}a which is more than
a. This means that the player who looks in envelope A would decide to switch whatever he saw there. Hence there is no need to look in envelope A to make that decision. This conclusion is just as clearly wrong as it was in the preceding interpretations of the Two Envelopes Problem. But now the flaws noted above do not apply; the
a in the expected value calculation is a constant and the conditional probabilities in the formula are obtained from a specified and proper prior distribution.
Proposed resolutions through mathematical economics Most writers think that the new paradox can be defused, although the resolution requires concepts from
mathematical economics. Suppose E(B|A=a)>a for all a. It can be shown that this is possible for some probability distributions of
X (the smaller amount of money in the two envelopes) only if E(X)=\infty. That is, only if the mean of all possible values of money in the envelopes is infinite. To see why, compare the series described above in which the probability of each
X is 2/3 as likely as the previous
X with one in which the probability of each
X is only 1/3 as likely as the previous
X. When the probability of each subsequent term is greater than one-half of the probability of the term before it (and each
X is twice that of the
X before it) the mean is infinite, but when the probability factor is less than one-half, the mean converges. In the cases where the probability factor is less than one-half, E(B|A=a) for all
a other than the first, smallest
a, and the total expected value of switching converges to 0. In addition, if an ongoing distribution with a probability factor greater than one-half is made finite by, after any number of terms, establishing a final term with "all the remaining probability," that is, 1 minus the probability of all previous terms, the expected value of switching with respect to the probability that
A is equal to the last, largest
a will exactly negate the sum of the positive expected values that came before, and again the total expected value of switching drops to 0 (this is the general case of setting out an equal probability of a finite set of values in the envelopes described above). Thus, the only distributions that seem to point to a positive expected value for switching are those in which E(X)=\infty. Averaging over
a, it follows that E(B)=E(A)=\infty (because
A and
B have identical probability distributions, by symmetry, and both
A and
B are greater than or equal to
X). If we do not look into the first envelope, then clearly there is no reason to switch, since we would be exchanging one unknown amount of money (
A), whose expected value is infinite, for another unknown amount of money (
B), with the same probability distribution and infinite expected value. However, if we do look into the first envelope, then for all values observed (A=a) we would want to switch because E(B|A=a)>a for all
a. As noted by
David Chalmers, this problem can be described as a failure of dominance reasoning. Under dominance reasoning, the fact that we strictly prefer
A to
B for all possible observed values
a should imply that we strictly prefer
A to
B without observing
a; however, as already shown, that is not true because E(B)=E(A)=\infty. To salvage dominance reasoning while allowing E(B)=E(A)=\infty, one would have to replace expected value as the decision criterion, thereby employing a more sophisticated argument from mathematical economics. For example, we could assume the decision maker is an
expected utility maximizer with initial wealth
W whose utility function, u(w), is chosen to satisfy E(u(W+B)|A=a) for at least some values of
a (that is, holding onto A=a is strictly preferred to switching to
B for some
a). Although this is not true for all utility functions, it would be true if u(w) had an upper bound, \beta, as
w increased toward infinity (a common assumption in mathematical economics and decision theory).
Michael R. Powers provides necessary and sufficient conditions for the utility function to resolve the paradox, and notes that neither u(w) nor E(u(W+A))=E(u(W+B)) is required. Some writers would prefer to argue that in a real-life situation, u(W+A) and u(W+B) are bounded simply because the amount of money in an envelope is bounded by the total amount of money in the world (
M), implying u(W+A) \leq u(W+M) and u(W+B) \leq u(W+M). From this perspective, the second paradox is resolved because the postulated probability distribution for
X (with E(X)=\infty) cannot arise in a real-life situation. Similar arguments are often used to resolve the
St. Petersburg paradox.
Controversy among philosophers As mentioned above,
any distribution producing this variant of the paradox must have an infinite mean. So before the player opens an envelope the expected gain from switching is "∞ − ∞", which is not defined. In the words of
David Chalmers, this is "just another example of a familiar phenomenon, the strange behavior of infinity". They argue that decision theory should be extended so as to allow infinite expectation values in some situations. ==Smullyan's non-probabilistic variant==