Prerequisites: All Theory Chapters, except maybe Chapter 9, and the first part on poker.
This chapter looks into two versions of VNM Poker---VNM POKER(2,r,m,n) and VNM POKER(4,4,3,5) in more detail, using mixed strategies. The first part is also an exercise how to work with parameters, and uses some algebraic manipulation of expressions, and some graphing of equations.
Recall from the first discussion of VNM Poker that we play with r cards of value 1 and r cards of value 2, with initial bet of m and a raised bet of n. We assume r ≥ 2, otherwise the game would have perfect information. The extensive form of VNM POKER(2,4,2,3) looks as follows.

In our case S=2, the pure strategies for Ann are the prudent CC (check in any case), the reasonable CR (check in case of a "1" and raise in case of a "2"), the silly looking RC (raise in case of a "1" and check in case of a "2") and the aggressive RR (raise in any case). Beth's pure strategies are the prudent FF (fold in any case), the reasonable FC (fold in case of a "1", call in case of a "2"), the counterintuitive CF (call in case of a "1", fold in case of a "2"), and the aggressive CC (call in any case). The normal form with the expectations for Ann (remember, it is a zero-sum game) looks as follows:
| FF | FC | CF | CC | |
| CC | 0 | 0 | 0 | 0 |
| CR | (m(r-1))/(4r-2) | 0 | (nr-m)/(4r-2) | (n-m)r/(4r-2) |
| RC | (3r-1)m/(4r-2) | ((2r-1)m-rn)/(4r-2) | 2mr/(4r-2) | (m-n)r/(4r-2) |
| RR | m | ((2r-1)m-rn)/(4r-2) | ((2r-1)m+rn)/(4r-2) | 0 |
These expressions are obtained as follows. For every fixed pair of strategies, let Ax,y be Ann's payoff provided both players stick to their chosen strategies and Ann gets a card of value x and Beth a card of value y. For instance, if Ann plays the aggressive RR and Beth plays the reasonable FC, then A1,1=m , A1,2=-n, A2,1=m, and A2,2=0. Then Ann's total expected payoff for the pair of chosen strategies equals
As noted in the first part, Ann's strategy CC is weakly dominated by strategy CR, and strategy RC is weakly dominated by RR, therefore both can be deleted. Similar, Beth's strategy FF is weakly dominated by FC, and her strategy CF is weakly dominated by CC. Thus the remaining normal form is
| FC | CC | |
| CR | 0 | (n-m)r/(4r-2) |
| RR | ((2r-1)m-rn)/(4r-2) | 0 |
Note that the entry (n-m)r/(4r-2) is always positive. If the value ((2r-1)m-rn)/(4r-2) is not positive, then Ann's strategy CR weakly dominates RR, and Beth's strategy FC weakly dominates CC, therefore there is an equilibrium in pure strategies, CR versus FC, with expected outcome 0 in that case.
The value ((2r-1)m-rn)/(4r-2) is non-positive if rn ≥ (2r-1)m, i.e. if n/m ≥ (2r-1)/r, i.e. if the increased bet n is considerably higher than the ante m. The values (2r-1)/r equal 3/2, 5/3, and 7/8 for r=2,3,4. Even for very large r, this value is always lower than 2. In particular, if raising means doubling, the analysis is already finished for every r. Then aggressive play, bluffing, will not occur, since it is too expensive.
Assume Beth plays FC in q of the cases, and CC in the remaining 1-q. Or in terms of behavior strategies, when holding a value of 1 Beth folds with probability q, and when holding a value of 2 Beth always calls. Ann's playoff when playing CR is (1-q)(n-m)r/(4r-2). This is larger or equal to Ann's payoff when playing CC, which is q((2r-1)m-rn)/(4r-2), if
In the same way we compute Beth's best response to Ann's mixed strategy of playing CR with probability p, and playing RR with probability 1-p (which again translates easily into the behavior strategy of checking with probability p when holding a value 1 card, and raising always with a card of value 2). When Beth plays FC, Beth's expected payoff equals -(1-p)((2r-1)m-rn)/(4r-2) and when she plays CC, Beth's expected payoff equals -p(n-m)r/(4r-2). After some calculations as above, we can see that both values are equal for p = ((2r-1)m-rn)/((r-1)m), let's call this value p*. Moreover, Beth's payoff when she plays FC is smaller than her payoff when whe plays CC provided p < p*. Beth should play CC when Ann plays CR with frequency less than p*, when Ann plays too aggressively, bluffs too much. Conversely, Beth should play the lame FC when Ann plays CR more often than the value given above, that is, if Ann plays too lamely.
In essence, a clever Ann would complement the observed behavior of the opponent. Beth, on the other hand, should mirror the play, aggressive or cautious, she observes in Ann.
The key for effective play is to find out how the computer player plays by observing it playing repeatedly. Note also that it suffices to read the other player's behavior strategy, since any mixed strategy belonging to the same behavior strategy has the same payoffs. We also assume that Ann always raises and Beth always calls with a card of value 2---without that assumption the analysis would be much more difficult.
Of course, if players randomize, then we cannot tell the other's probabilities used after a few rounds, but after many rounds the Law of Large Numbers tells us that the frequencies observed are close to the probabilities used. The only problem is that we cannot observe all decisions---we don't see all of the other's decisions. If a player folds, then no player sees the others card, not even after the game is over.
What Beth can observe about Ann's play is how often Ann checks and raises, say 30% and 70% to give an example. We assume that Ann only checks with a card of value 1. Since on the long run, in about 50% of all rounds Ann has a "1", but only in 30% of all rounds Ann checks (with a "1"), in about 20% of all rounds, Ann must raise with a "1". Therefore in 20%/50% = 40% of the cases where Ann has a "1" she raises. In other words, Ann plays a mix of 60% CR and 40% RR.
Let us now discuss how Ann can observe Beth's strategy. We look only at the cases where Ann raises with a card of value 2. Let's say Beth calls in 70% of that cases, and when she calls she displays 39% of "1"s and 61% of "2"s. Among all these cases considered, Beth calls with a "1" with frequency 70%·39% ≈ 27%, Beth calls with a "2" with frequency 70%·61% ≈ 43%. These 43% are about 3/7, the fraction of considered cases where Beth holds a "2", given that Ann holds a "2", which confirms our assumption on Beth never folding with a "2". The only other option is to fold with a "1", which occurs in 30% of the cases considered. Therefore, when Beth has a "1" she calls in 27%/(27%+30%) ≈ 47% of the cases, Beth plays a mix of 53% FC and 47% CC in our example.
If n/m ≤ (2r-1)/r then there is no pure Nash equilibrium, therefore there must be a Nash equilibrium in mixed strategies, Ann playing CR with probability p, and RR with probability 1-p, and Beth playing FC with probability q, and CC with probability 1-q, with 0 < p < 1 and 0 < q < 1. Since each mix is a best response to each mix, and therefore each one of the two pure strategies is a best response to each mix, we conclude p = p* = ((2r-1)m-rn)/((r-1)m)) and q = q* = (n-m)r/((r-1)m) as discussed above.
Instead of in terms of n and m, both probabilities can also be expressed in terms of the ratio n/m:
The value of the game, Ann's payoff when both use these Nash equilibrium mixed strategies,
equals
Of course this value of the game could also be computed as p* · 0 +(1-p*) · ((2r-1)m-rn))/(4r-2)) or as q* · 0 + (1-q*) · (n-m)r/(4r-2) or as q* · ((2r-1)m-rn))/(4r-2)) + (1-q*) · 0.
Note that this value is the product of the initial bet m and an expression in the ratio n/m and r. That means that, if we keep n/m and r fixed (and therefore change n proportional to m) the value is proportional to m. Or, the value per ante V/m only depends on r and the ratio n/m:
is the area with V=0
where the pure strategies "CR" and "FC" apply, and the curve below is the curve
n/m = 2-1/r. Of course, for our game r is discrete, and values of 1.5 for r do not make sense, but still
the graph expresses a lot of information about the game.
When playing the game, somebody proposes to increase n very slightly. Whose advantage is it, Ann's or Beth's? Or it is proposed to increase the number r of duplicates, who will profit from this?
These questions are best answered looking at the graph above. Increasing r means moving to the right. The ratio V/m, where V is Ann's payoff if both play the Nash equilibrium, increases slightly. Thus playing with more stacks of cards is always advantageous for Ann.
The question of changed n is more complicated. Of course, increasing n increases the ratio n/m if m is kept fixed. That means we move vertically in the graph above. Doing so, and starting at the r-axis V/m increases until we meet the "ridge", which is visualized by the dashed line in the graph. This ridge curve has the equation n/m = 3/2 - 1/(2r) (as could be shown using a little Calculus). Therefore the answer is that increasing n slightly increases Ann advantage provided n/m < 3/2 - 1/(2r) (provided we are south of the ridge) and decreases Ann's advantage otherwise.
In order to answer the question of how the Nash equilibrium mixes would change if r is increased, or if n is increased, let us graph the values p* and q* also in terms of r and the ratio n/m. Remember that p* and q* are the percentages of Ann checking respectively Beth folding when holding a card of value 1. In these graphs the color indicates the value of p* and q*, purple being 0 (aggressive play), greenish being around 0.5, and red indicating 1. The contour curves are again combinations with constant p* and q*.


Again it can be easily seen that south of the curve n/m = 2 - 1/r, the mixes of Ann and Beth complement each other. Where Ann raises a lot with cards of value 1, Beth calls little, and vice versa. Just north of the curve Ann always checks with a "1", and Beth always folds with such a card. The stakes are too high for bluffing there.
From the graphs there follows that increasing r means that p* increases slightly while q* decreases. Increasing n slightly means a decrease of p* and an increase of q*. In any case, small changes of the parameters means small changes of p* and q*, except in the case if we cross the curve n/m = 2 - 1/r in which case p* changes from 1 to 0 or conversely. Slight causes, but dramatic implications in that case.
The Indifference Theorem from Theory Chapter 8 implies that against Beth's Nash mix of q* FC and (1-q*) CC, Ann gets the same (bast-possible) payoff with CR and RR. What about CC and RC? Well, CC is easy to analyze, the expected payoff for both players equals 0 in that case, whatever Beth plays, so Ann wastes her advantage by playing this way. How much would she waste with RC? According to the 4 × 4 payoff matrix given above, her expected payoff when playing RC against a mix of q* FC and (1-q*) CC equals
The analysis of FF and CF is left to the reader as exercise, see below.
SIMULTANEOUS VNMPOKER(S,r,m,n): In the simultaneous version both players decide simultaneously whether they want to raise for m or for n. If one of them raises for n and the other for m, the one daring the higher amount (n) wins m from the other one, regardless what the cards show. If both raise the same amount, the one with the higher card wins n respectively m from the other one, again, no win for draws of identical cards.
Both player have four pure strategies: Betting low in both cases ("LL"), Betting low with a card of value "1" and high with a card of value "2" ("LH"), the counterintuitive strategy of raising with a card of value "1" and low with a card of "2" ("HL"), and raising high for both cards ("HH"). Remember that the probability of both having a card of calue "1" is (r-1)/(2(2r-1)), wheras the probability for Ann getting a "1" and Beth getting a "2" is the slightly higher r/(2(2r-1)). Then for each pair of strategies, the four cases of card distributions have to be condsidered, the payoffs noted, and the expected value as sum of probability multiplied by payoff, for all these four cases has to be computed. We get the following payoff matrix:
| LL | LH | HL | HH | |
| LL | 0 | -m(r-1)/(2(2r-1)) | -m(3r-1)/(2(2r-1)) | -m(4r-2)/(2(2r-1)) |
| LH | m(r-1)/(2(2r-1)) | 0 | r(n-m)/(2(2r-1)) | (nr-2mr+m)/(2(2r-1)) |
| HL | m(3r-1)/(2(2r-1)) | r(m-n)/(2(2r-1)) | 0 | (1-r(n+2m))/(2(2r-1)) |
| HH | m(4r-2)/(2(2r-1)) | (-nr+2mr-m)/(2(2r-1)) | (r(n+2m)-1)/(2(2r-1)) | 0 |
Choose m=1, n=3, r=4:
| LL | LH | HL | HH | |
| LL | 0 | -3/14 | -11/14 | -6/14 |
| LH | 3/14 | 0 | 8/14 | 6/14 |
| HL | 11/14 | -8/14 | 0 | (1-r(n+2m))/(2(2r-1)) |
| HH | 6/14 | -6/14 | (r(n+2m)-1)/(2(2r-1)) | 0 |