A gambler has 2 dollars. He is allowed to play a game four times, and his goal is to maximize his probability of ending with at least 6 dollars. If the gambler bets b dollars then, with probability 0.4, he wins the game and increases his capital position by b dollars; with probability 0.6, he loses the game and decreases his capital by b dollars. On any play of game gambler may not bet more than he has available. Determine the betting strategy that would maximize the gamblers probability of attaining at least 6 dollars by the end of fourth game. Assume that bets of zero dollars (not betting) are permissible.
By using probabilistic dynamic programming solve this.
Hint: Let ft(d) be the probability by the end of game 4, the gambler will have at least 6 dollars, where d is the amount that he has at the beginning of game t which he can use for betting, and let bt(d) be the amount that he bets to attain ft(d).
max{0.4*f_{t+1}(d+b) + 0.6*f_{t+1}(d−b)}
But I don't understand how I can make a recursive relationship between ft(d) and ft+1 and solve this problem using hint.
By using probabilistic dynamic programming solve this.
Hint: Let ft(d) be the probability by the end of game 4, the gambler will have at least 6 dollars, where d is the amount that he has at the beginning of game t which he can use for betting, and let bt(d) be the amount that he bets to attain ft(d).
max{0.4*f_{t+1}(d+b) + 0.6*f_{t+1}(d−b)}
But I don't understand how I can make a recursive relationship between ft(d) and ft+1 and solve this problem using hint.
Last edited: