A very mathy paper with detailed derivations. Useful for getting comfortable with cont time Bayesian learning in portfolio-related problems.
Intro
Historical background: Markowitz 1952 studies one-period portfolio choice problem in mean-variance framework. Then, CAPM was introduced based on this idea. Samuelson and Merton then generalized Markowitz problem to multi-period consumption/investment problem (in discrete and continuous time, respectively).
Specifically, Merton used PDE techniques to characterize optimal consumption and portfolio (investment) cohice processes. Extensions were introduced such as transaction costs & credit constraints.
Major advances to solve Merton’s problem in full generality by Karatzas et al using martingale methods. This work itself was extended and we can use martingale method to solve Merton’s problem for almost any smooth utility function.
Prop1: under , is independent of , .
Proof: we know that is independent of under . We try to use this condition to show that , we have .
We first use some algebraic manipulations to show that .
then recall , where .
Thus, . We use this to show that .
Then, we recall that the standard normal distribution has charactersitic function . So . Thus we are done.
Now we can start deriving an expression for . And an expression for the dynamics of .
Optimal Portfolio Choice
note that this is a Merton portfolio choice problem, the “strategy” here refers to the shares of stocks held, not the trading speed as in A&C.
Given strategy , the portfolio value process satisfies:
Note also that here refers to the portfolio value at , not the market volume later when we consider the A&C type problem.
We rewrite portfolio value process as:
we can then plug in the expression for .
We also rewrite (recall ) using and .
Finally, starting at , we could express and using the integram forms.
Define value function :
Note that here is not value function, but the “wealth” process (of the portfolio), while is the current log stock price.
The associated HJB is:
, with terminal condition .
To solve this HJB, use an ansatz:
Plugging in, we get a simple, linear parabolic PDE, and can use the classic Feynman-Kac representation to write a strong solution of it.
Then, paper gives derivation of similar results for the case of CRRA agents.
Optimal portfolio choice in Gaussian case: two routes
again, solving optimal portfolio choice problem boils down to solving linear parabolic PDEs in the CARA & CRRA case. one case for which these PDEs have closed form solutions is when the prior is Gaussian.
There are two routs to solve the problems with PDEs: using as state var, or using . (again, recall is log stock price)
First, we can show that under Gaussian prior, we have simplified expression for and its dynamics:
- define
classic result is that the posterior distro of given is . Note that the covariance matrix process is deterministic.
the problem can be written with two sets of state vars:
We can consider the previous optimization problem as being described by:
or,
Because is affine in for all in the Gaussian case, we see from the Feynman-Kac representation that , is a polynomial of degree 2 in . But looking for this polynomial using PDE is cumbersome.
The main reason is that is in fact a more natural variable to solve the problem, than . In fact, the best ansatz if we try to solve the problem using state is:
.
plugging in this form to HJB we get system of linear ODEs for and .
Now, solve the problem using as a state var. define value function via:
where:
Then again, formulate HJB with terminal condition, make ansatz, and derive the associated system of linear ODEs.
Online learning & execution costs
Present a general optimization problem, derive HJB, derive a simpler PDE using ansatz. Then focus on special cases.
Again, here I consider the special case of a single risky asset, along with a bond with return .
The risky asset has price dynamics: . is unknown, with prior distro denoted by .
Again, let
Theorem 8: define function , this function is well-defined. Let’s define another function via: , then .
Then, define by . We can show that is a BM adapted to , and .
Now, we use inventory level
and cash level
as state variables.
1.
2.
Here is a deterministic process modelling the market volume for the risky asset. models execution costs, and satisfies certain properties like convexity. (In Almgren-Chriss, )
Writing all the processes in the integral form:
(starting at time , when stock price is at , agent’s cash is at , and agent uses trading stategy )
Suppose agent’s initial state is . The optimization problem is:
- often in liquidation problem assume .
Introduce value function , we can write a general HJB and terminal conditions.
Can also make an ansatz of the form of the value function: , and then the HJB as well as the terminal conditions would be written in terms of this , which is not dependent on current wealth level .
Note that in the general case, this simpler HJB is still not linear, and does not have closed form solutions generally (hence numerical schemes are required such as S-DFP may be of relevance). however, in the special case where:
- Gaussian prior
- execution costs and penalty functions are quadratic as in Almgren-Chriss
then solving the problem boils down to solve a system of ODEs. Specifically, we can show that in this special case, , where solve a system of ODEs.
Finally, some numerical experiments. Main feature is trend following: buy stocks when stock price increases, sell stocks when price decreases, in a smooth manner.