18 Kelly Betting

Author

Jean-Stanislas Denain, Jacob Steinhardt

In this lecture, we will describe Kelly Betting, which is a reasonable heuristic to decide how much of your money to invest in an asset that is profitable in expectation, but risky.

As a first example, consider the following situation:

Coin Flip

You pay $n <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi></math>$ dollars, and a fair coin is flipped:

if the outcome is Heads, you receive $3 n <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>3</mn><mi>n</mi></math>$ dollars: your net earnings are $2 n <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>2</mn><mi>n</mi></math>$ dollars
if the outcome is Tails, you receive nothing: your net earnings are $- n <math xmlns="http://www.w3.org/1998/Math/MathML"><mo>-</mo><mi>n</mi></math>$ dollars

Question: What value of $n <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi></math>$ do you choose?

This is a good bet in expectation: your expected wealth is $12×(3n)+12×(−n)=+n<math xmlns="http://www.w3.org/1998/Math/MathML"><mfrac><mn>1</mn><mn>2</mn></mfrac><mo>×</mo><mo stretchy="false">(</mo><mn>3</mn><mi>n</mi><mo stretchy="false">)</mo><mo>+</mo><mfrac><mn>1</mn><mn>2</mn></mfrac><mo>×</mo><mo stretchy="false">(</mo><mo>−</mo><mi>n</mi><mo stretchy="false">)</mo><mo>=</mo><mo>+</mo><mi>n</mi></math>$ dollars. Therefore, if you wanted to maximize your expected amount of dollars, you should choose the highest possible value of $n <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi></math>$ and bet all of your money.

However, this is very risky: if you bet all of your money, you will lose everything with probability $12<math xmlns="http://www.w3.org/1998/Math/MathML"><mfrac><mn>1</mn><mn>2</mn></mfrac></math>$ , if the coin comes up Tails. Thus, most of you are unlikely to bet all of your money, but rather some fraction of your money. The Kelly Criterion provides a useful rule of thumb to choose this fraction.

To introduce Kelly betting, we will first introduce a sequential version of the Coin Flip situation above, and show how maximizing the most likely or median value of your wealth can lead to more reasonable choices decisions than maximizing your expected wealth. Then, we will introduce the Kelly Criterion in a more general setting. Finally, we will explain how to generalize Kelly betting to situations where you have to choose between multiple bets.

18.1 Maximizing the typical value of your wealth

Consider the following iterated version of the Coin Flip example above:

Iterated Coin Flip

There are 100 days. Every day, you get the same offer:

You pay some fraction $α <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi></math>$ of your current wealth
A fair coin is flipped
- if the outcome is Heads, you receive $3 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>3</mn></math>$ times the amount you paid
- if the outcome is Tails, you receive nothing

We make the simplifying assumption that $α <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi></math>$ is the same for every day: you have to choose it once and for all.

Question: What fraction $α <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi></math>$ of your current wealth should you bet on each day?

Let $x <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi></math>$ be your wealth at the beginning of the first day. After the first coin flip, your wealth becomes:

$x - α x + 3 α x = (1 + 2 α) x <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi><mo>-</mo><mi>α</mi><mi>x</mi><mo>+</mo><mn>3</mn><mi>α</mi><mi>x</mi><mo>=</mo><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mn>2</mn><mi>α</mi><mo stretchy="false">)</mo><mi>x</mi></math>$ if the coin comes up Heads
$x - α x = (1 - α) x <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi><mo>-</mo><mi>α</mi><mi>x</mi><mo>=</mo><mo stretchy="false">(</mo><mn>1</mn><mo>-</mo><mi>α</mi><mo stretchy="false">)</mo><mi>x</mi></math>$ if the coin comes up Tails

After 100 days, your wealth is

$(1 + 2 α) k (1 - α) 100 - k x, with k \sim Binomial (n = 100, p = 1 / 2) . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mn>2</mn><mi>α</mi><msup><mo stretchy="false">)</mo><mi>k</mi></msup><mo stretchy="false">(</mo><mn>1</mn><mo>-</mo><mi>α</mi><msup><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mn>100</mn><mo>-</mo><mi>k</mi></mrow></msup><mi>x</mi><mo>,</mo><mtext> with </mtext><mi>k</mi><mo>\sim</mo><mtext> Binomial</mtext><mo stretchy="false">(</mo><mi>n</mi><mo>=</mo><mn>100</mn><mo>,</mo><mi>p</mi><mo>=</mo><mn>1</mn><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>2</mn><mo stretchy="false">)</mo><mo>.</mo></math>$

As in the previous example, the expected value of this quantity is greatest when $α = 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi><mo>=</mo><mn>1</mn></math>$ : to maximize your expected wealth, you should bet all of your money every day. However, this means that every day you have a probability of one half of losing all of your money. Even though you have a tiny chance of becoming enourmously rich, you will be ruined with overwhelming probability $1 - (1 / 2) 100 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>1</mn><mo>-</mo><mo stretchy="false">(</mo><mn>1</mn><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>2</mn><msup><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mn>100</mn></mrow></msup></math>$ .

Instead of looking at the expectation of your wealth after 100 days, you could focus on its value in the typical case, by considering the modal or median value of your wealth. This corresponds to taking $k = 50 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>k</mi><mo>=</mo><mn>50</mn></math>$ , hence a typical wealth of

$[(1 + 2 α) (1 - α)] 50 . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo stretchy="false">[</mo><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mn>2</mn><mi>α</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mn>1</mn><mo>-</mo><mi>α</mi><mo stretchy="false">)</mo><msup><mo stretchy="false">]</mo><mrow data-mjx-texclass="ORD"><mn>50</mn></mrow></msup><mo>.</mo></math>$

You can then maximize this modal value over $α <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi></math>$ : this is tantamount to finding the maximum of the second degree polynomial $(1 + 2 α) (1 - α) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mn>2</mn><mi>α</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mn>1</mn><mo>-</mo><mi>α</mi><mo stretchy="false">)</mo></math>$ , which is reached at $α=14<math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi><mo>=</mo><mfrac><mn>1</mn><mn>4</mn></mfrac></math>$ , leading to a median wealth of $(98)50<math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><mfrac><mn>9</mn><mn>8</mn></mfrac><mo data-mjx-texclass="CLOSE">)</mo></mrow><mrow data-mjx-texclass="ORD"><mn>50</mn></mrow></msup></math>$ .

Let us now consider a different bet:

Iterated Biased Coin Flip

Once again, there are 100 days, and every day you get the same offer:

You pay some fraction $α <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi></math>$ of your current wealth
With probability $23<math xmlns="http://www.w3.org/1998/Math/MathML"><mfrac><mn>2</mn><mn>3</mn></mfrac></math>$ , you receive twice the amount you paid
With probability $13<math xmlns="http://www.w3.org/1998/Math/MathML"><mfrac><mn>1</mn><mn>3</mn></mfrac></math>$ , you receive nothing

Question: What fraction $α <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi></math>$ of your current wealth should you bet on each day?

The same reasoning as before shows that your wealth after 100 days is

$(1 + α) k (1 - α) 100 - k, with k \sim Binomial (n = 100, p = 2 / 3) . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mi>α</mi><msup><mo stretchy="false">)</mo><mi>k</mi></msup><mo stretchy="false">(</mo><mn>1</mn><mo>-</mo><mi>α</mi><msup><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mn>100</mn><mo>-</mo><mi>k</mi></mrow></msup><mo>,</mo><mtext> with </mtext><mi>k</mi><mo>\sim</mo><mtext> Binomial</mtext><mo stretchy="false">(</mo><mi>n</mi><mo>=</mo><mn>100</mn><mo>,</mo><mi>p</mi><mo>=</mo><mn>2</mn><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>3</mn><mo stretchy="false">)</mo><mo>.</mo></math>$

The median and most likely value of this random variable is $(1+α)23⋅100(1−α)13⋅100<math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mi>α</mi><msup><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mfrac><mn>2</mn><mn>3</mn></mfrac><mo>⋅</mo><mn>100</mn></mrow></msup><mo stretchy="false">(</mo><mn>1</mn><mo>−</mo><mi>α</mi><msup><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mfrac><mn>1</mn><mn>3</mn></mfrac><mo>⋅</mo><mn>100</mn></mrow></msup></math>$ . So we want to find $α \in [0, 1] <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi><mo>\in</mo><mo stretchy="false">[</mo><mn>0</mn><mo>,</mo><mn>1</mn><mo stretchy="false">]</mo></math>$ that maximizes the cubic $(1 + α) 2 (1 - α) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mi>α</mi><msup><mo stretchy="false">)</mo><mn>2</mn></msup><mo stretchy="false">(</mo><mn>1</mn><mo>-</mo><mi>α</mi><mo stretchy="false">)</mo></math>$ : this corresponds to $α=13<math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi><mo>=</mo><mfrac><mn>1</mn><mn>3</mn></mfrac></math>$ .

In both these examples, we saw how maximizing the modal or median value of your wealth leads to more intuitive levels of risk-taking than maximizing the expected value. In the next section, we will introduce the Kelly Criterion in a more general setting.

TODO: include some simulations

18.2 Kelly Betting: the general case

18.2.1 Maximizing the Expected Logarithm

Suppose that:

Your initial wealth is $S <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>S</mi></math>$ dollars
Every day $t <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>t</mi></math>$ :
- You invest some fraction $α <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi></math>$ of your current wealth
- You receive $X t <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>X</mi><mi>t</mi></msub></math>$ times the amount you paid

Moreover, we make the following assumptions:

The $X t <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>X</mi><mi>t</mi></msub></math>$ are sampled independently from the same distribution: $X t i i d \sim p <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>X</mi><mi>t</mi></msub><mover><mo>\sim</mo><mrow><mi>i</mi><mi>i</mi><mi>d</mi></mrow></mover><mi>p</mi></math>$
$α X t > - 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi><msub><mi>X</mi><mi>t</mi></msub><mo>></mo><mo>-</mo><mn>1</mn></math>$ almost surely: you cannot lose more than what you invested

Then, after $T <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>T</mi></math>$ days, your wealth is:

$S (1 + α X 1) (1 + α X 2) \dots (1 + α X T) . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>S</mi><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mi>α</mi><msub><mi>X</mi><mn>1</mn></msub><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mi>α</mi><msub><mi>X</mi><mn>2</mn></msub><mo stretchy="false">)</mo><mo>\dots</mo><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mi>α</mi><msub><mi>X</mi><mi>T</mi></msub><mo stretchy="false">)</mo><mo>.</mo></math>$

In general, we understand the limiting behavior of sums of random variables better than the limiting behavior of products. This motivates taking the logarithm, and forming the average:

$1TT∑t=1log(1+αXt).<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mfrac><mn>1</mn><mi>T</mi></mfrac><munderover><mo data-mjx-texclass="OP" movablelimits="false">∑</mo><mrow data-mjx-texclass="ORD"><mi>t</mi><mo>=</mo><mn>1</mn></mrow><mi>T</mi></munderover><mi>log</mi><mo data-mjx-texclass="NONE">⁡</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><mn>1</mn><mo>+</mo><mi>α</mi><msub><mi>X</mi><mi>t</mi></msub><mo data-mjx-texclass="CLOSE">)</mo></mrow><mo>.</mo></math>$

As $T <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>T</mi></math>$ tends to infinity, we can apply the Law of Large Numbers to this average, which converges almost surely to the expected logarithm:

$E X \sim p [log (1 + α X)] . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mrow data-mjx-texclass="ORD"><mi>X</mi><mo>\sim</mo><mi>p</mi></mrow></msub><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mi>log</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mi>α</mi><mi>X</mi><mo stretchy="false">)</mo><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>.</mo></math>$

We can then pick $α <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi></math>$ to maximize this expectation, which is called the Kelly Criterion.

18.2.2 The Kelly Criterion for binary bets

The following bet is a common use case of the Kelly Criterion:

With probability $p <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>p</mi></math>$ , you win and are returned $b + 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi><mo>+</mo><mn>1</mn></math>$ times your investment, so you net $b <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi></math>$ times your investment.
With probability $q = 1 - p <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>q</mi><mo>=</mo><mn>1</mn><mo>-</mo><mi>p</mi></math>$ , you lose and receive nothing.

In this case, the expected logarithm is

$p log (1 + b α) + q log (1 - α) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>p</mi><mi>log</mi><mo data-mjx-texclass="NONE"></mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><mn>1</mn><mo>+</mo><mi>b</mi><mi>α</mi><mo data-mjx-texclass="CLOSE">)</mo></mrow><mo>+</mo><mi>q</mi><mi>log</mi><mo data-mjx-texclass="NONE"></mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><mn>1</mn><mo>-</mo><mi>α</mi><mo data-mjx-texclass="CLOSE">)</mo></mrow></math>$

The derivative vanishes when:

$pb1+bα−q1−α=0,<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mfrac><mrow><mi>p</mi><mi>b</mi></mrow><mrow><mn>1</mn><mo>+</mo><mi>b</mi><mi>α</mi></mrow></mfrac><mo>−</mo><mfrac><mi>q</mi><mrow><mn>1</mn><mo>−</mo><mi>α</mi></mrow></mfrac><mo>=</mo><mn>0</mn><mo>,</mo></math>$

which is equivalent to:

$α=p−qb.<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>α</mi><mo>=</mo><mi>p</mi><mo>−</mo><mfrac><mi>q</mi><mi>b</mi></mfrac><mo>.</mo></math>$

This fraction increases when when your probability of winning $p <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>p</mi></math>$ increases, and when $b <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi></math>$ increases. It vanishes when $b = q / p <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi><mo>=</mo><mi>q</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mi>p</mi></math>$ : in this case, the bet has zero expectation and you have no edge, so the Kelly criterion recommends betting nothing.

As an example, consider the Iterated Biased Coin Flip example from the previous section. In that case, $p = 2 / 3 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>p</mi><mo>=</mo><mn>2</mn><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>3</mn></math>$ , $q = 1 / 3 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>q</mi><mo>=</mo><mn>1</mn><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>3</mn></math>$ and $b = 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi><mo>=</mo><mn>1</mn></math>$ , thus the Kelly criterion suggests $α = 1 / 3 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi><mo>=</mo><mn>1</mn><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>3</mn></math>$ . This is the same as the result obtained by maximizing the median value of the wealth.

18.3 Choosing between multiple bets

So far, we have considered cases where there is only one possible bet: you can either invest your money in this bet, or keep it. However, a more realistic situation is that at any point in time you have to choose between multiple bets. In these cases, how should you split your money between these bets?

For example, suppose you could invest a fraction $α 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>α</mi><mn>1</mn></msub></math>$ of your wealth in a first bet with return $X (1) \sim p 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>X</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">(</mo><mn>1</mn><mo stretchy="false">)</mo></mrow></msup><mo>\sim</mo><msub><mi>p</mi><mn>1</mn></msub></math>$ , and a fraction $α 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>α</mi><mn>2</mn></msub></math>$ of your wealth in a second bet with return $X (2) \sim p 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>X</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">(</mo><mn>2</mn><mo stretchy="false">)</mo></mrow></msup><mo>\sim</mo><msub><mi>p</mi><mn>2</mn></msub></math>$ . Using the same reasoning as the previous section, you could maximize the following quantity over $(α 1, α 2) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><msub><mi>α</mi><mn>1</mn></msub><mo>,</mo><msub><mi>α</mi><mn>2</mn></msub><mo stretchy="false">)</mo></math>$ :

$E X (1) \sim p 1, X (2) \sim p 2 [log (1 + α 1 X (1) + α 2 X (2))] . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mrow data-mjx-texclass="ORD"><msup><mi>X</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">(</mo><mn>1</mn><mo stretchy="false">)</mo></mrow></msup><mo>\sim</mo><msub><mi>p</mi><mn>1</mn></msub><mo>,</mo><msup><mi>X</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">(</mo><mn>2</mn><mo stretchy="false">)</mo></mrow></msup><mo>\sim</mo><msub><mi>p</mi><mn>2</mn></msub></mrow></msub><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mi>log</mi><mo data-mjx-texclass="NONE"></mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><mn>1</mn><mo>+</mo><msub><mi>α</mi><mn>1</mn></msub><msup><mi>X</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">(</mo><mn>1</mn><mo stretchy="false">)</mo></mrow></msup><mo>+</mo><msub><mi>α</mi><mn>2</mn></msub><msup><mi>X</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">(</mo><mn>2</mn><mo stretchy="false">)</mo></mrow></msup><mo data-mjx-texclass="CLOSE">)</mo></mrow><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>.</mo></math>$

However, this optimization problem often becomes intractable as the number of possible bets becomes large. For example, think about the enormous number of possible investments a trader can make at any point in time.

One idea could be to use the Taylor approximation $log(1+z)≃z−z22<math xmlns="http://www.w3.org/1998/Math/MathML"><mi>log</mi><mo data-mjx-texclass="NONE">⁡</mo><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mi>z</mi><mo stretchy="false">)</mo><mo>≃</mo><mi>z</mi><mo>−</mo><mfrac><msup><mi>z</mi><mn>2</mn></msup><mn>2</mn></mfrac></math>$ . For a single bet, this yields:

$EX∼p[log(1+αX)]≃αE(X)−α22E(X2),<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mrow data-mjx-texclass="ORD"><mi>X</mi><mo>∼</mo><mi>p</mi></mrow></msub><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mi>log</mi><mo data-mjx-texclass="NONE">⁡</mo><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mi>α</mi><mi>X</mi><mo stretchy="false">)</mo><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>≃</mo><mi>α</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mo stretchy="false">(</mo><mi>X</mi><mo stretchy="false">)</mo><mo>−</mo><mfrac><msup><mi>α</mi><mn>2</mn></msup><mn>2</mn></mfrac><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mo stretchy="false">(</mo><msup><mi>X</mi><mn>2</mn></msup><mo stretchy="false">)</mo><mo>,</mo></math>$

which is maximized by $α=E(X)E(X2).<math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi><mo>=</mo><mfrac><mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mo stretchy="false">(</mo><mi>X</mi><mo stretchy="false">)</mo></mrow><mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mo stretchy="false">(</mo><msup><mi>X</mi><mn>2</mn></msup><mo stretchy="false">)</mo></mrow></mfrac><mo>.</mo></math>$ Intuitively, this approximation favors bets whose returns have a high expected value but a low variance.

When choosing between $n <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi></math>$ bets with returns $X (i) \sim p i <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>X</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">(</mo><mi>i</mi><mo stretchy="false">)</mo></mrow></msup><mo>\sim</mo><msub><mi>p</mi><mi>i</mi></msub></math>$ , a first simpler case is when these bets are all independent. Then, the same reasoning applies to each bet independently, and

$αi∝E(X(i))E(X(i)2).<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>α</mi><mi>i</mi></msub><mo>∝</mo><mfrac><mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mo stretchy="false">(</mo><msup><mi>X</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">(</mo><mi>i</mi><mo stretchy="false">)</mo></mrow></msup><mo stretchy="false">)</mo></mrow><mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mo stretchy="false">(</mo><msup><mi>X</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">(</mo><mi>i</mi><msup><mo stretchy="false">)</mo><mn>2</mn></msup></mrow></msup><mo stretchy="false">)</mo></mrow></mfrac><mo>.</mo></math>$

The formula becomes more complicated when the $X (i) <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>X</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">(</mo><mi>i</mi><mo stretchy="false">)</mo></mrow></msup></math>$ are no longer independent:

$E∀i,X(i)∼pi[log(1+n∑i=1αiXi)]≃α⊤E(X)−12α⊤E(XX⊤)α,<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="normal">∀</mi><mi>i</mi><mo>,</mo><msup><mi>X</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">(</mo><mi>i</mi><mo stretchy="false">)</mo></mrow></msup><mo>∼</mo><msub><mi>p</mi><mi>i</mi></msub></mrow></msub><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mi>log</mi><mo data-mjx-texclass="NONE">⁡</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><mn>1</mn><mo>+</mo><munderover><mo data-mjx-texclass="OP" movablelimits="false">∑</mo><mrow data-mjx-texclass="ORD"><mi>i</mi><mo>=</mo><mn>1</mn></mrow><mi>n</mi></munderover><msub><mi>α</mi><mi>i</mi></msub><msub><mi>X</mi><mi>i</mi></msub><mo data-mjx-texclass="CLOSE">)</mo></mrow><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>≃</mo><msup><mi>α</mi><mi mathvariant="normal">⊤</mi></msup><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mo stretchy="false">(</mo><mi>X</mi><mo stretchy="false">)</mo><mo>−</mo><mfrac><mn>1</mn><mn>2</mn></mfrac><msup><mi>α</mi><mi mathvariant="normal">⊤</mi></msup><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mo stretchy="false">(</mo><mi>X</mi><msup><mi>X</mi><mi mathvariant="normal">⊤</mi></msup><mo stretchy="false">)</mo><mi>α</mi><mo>,</mo></math>$

which is maximized by $α = E (X X ⊤) - 1 E (X) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>α</mi><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mo stretchy="false">(</mo><mi>X</mi><msup><mi>X</mi><mi mathvariant="normal">⊤</mi></msup><msup><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mo stretchy="false">(</mo><mi>X</mi><mo stretchy="false">)</mo></math>$ (assuming that $E (X X ⊤) <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="double-struck">E</mi></mrow><mo stretchy="false">(</mo><mi>X</mi><msup><mi>X</mi><mi mathvariant="normal">⊤</mi></msup><mo stretchy="false">)</mo></math>$ is invertible). Qualitatively, this approximation favors diversification, by assigning greater fractions $α i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>α</mi><mi>i</mi></msub></math>$ to bets that are less dependent on the others.