OPTIMAL TIME-CONSISTENT INVESTMENT AND REINSURANCE STRATEGIES FOR MEAN-VARIANCE INSURER UNDER THE DEPENDENT RISK MODEL


扩展功能
	加入收藏夹

	复制引文信息

	加入引用管理器

	Email Alert

	RSS
本文作者相关文章
	LIU Sheng-wang

	LI Bing

LIU Sheng-wang, LI Bing

School of Sciences, Hebei University of Technology, Tianjin 300401, China

Received date: 2017-05-25; Accepted date: 2017-10-18

Foundation item: Supported by the National Natural Science Foundation of China (11201111; 11471218)

Biography: Liu Shengwang (1990-), male, born at Dingzhou, Hebei, postgraduate, major in stochastic process and its application

Abstract: In this paper, we study the optimal investment-reinsurance problem in a risk model with two dependent classes of insurance business. Under the criterion of mean-variance, we aim to seek the corresponding time-consistent strategies within a game theoretic framework. By solving an extended Hamilton-Jacobi-Bellman system, the closed-form expressions of the optimal time-consistent investment-reinsurance strategies and the optimal value function are derived. Finally, some numerical illustrations are presented to show the impact of model parameters on the optimal strategies.

Keywords: equilibrium strategy Hamilton-Jacobi-Bellman equation mean-variance criterion proportional reinsurance

基于相依风险模型框架均值方差准则下的最优时间一致的投资再保险策略问题

刘胜旺, 李冰

河北工业大学理学院, 天津 300401

摘要：本文研究了在相依风险模型的框架下保险公司的最优投资和再保险问题.在均值方差准则下，利用博弈论的相关理论，求解扩展的HJB方程系统，得到最优时间一致的投资和再保险策略以及相应的最优值函数，并通过数值例子展现模型参数对最优策略的影响.

关键词：均衡策略 HJB方程均值方差准则比例再保险

1 Introduction

Nowadays, investment and reinsurance are playing increasingly important roles in insurance business. Optimal investment and reinsurance problems for insurers attracted much attention. Most existing works adopted the utility maximization or ruin probability minimization as objection functions. For example, Browne [1] considered a diffusion risk model and obtained investment strategies of maximizing the exponential utility or minimizing the probability of ruin. Yang and Zhang [2] studied the optimal investment strategies for an insurer to maximize the expected exponential utility of terminal wealth or maximize survival probability, where the surplus process is satisfied by a jump-diffusion model. Furthermore, Xu et al. [3], Gu et al. [4], Liang et al. [5] and Guan and Liang [6] investigated optimal investment-reinsurance strategies for an insurer to optimize the expected utility of terminal wealth in different situations.

Recently, optimal investment and reinsurance problems for insurers under the mean-variance criterion, introduced by Markowitz [7], drew much attention. For example, Bäuerle [8] considered an optimal proportional reinsurance problem and got closed-form optimal strategy under mean-variance criterion, where the surplus process is modelled by classical Cramér-Lundberg model. Bai and Zhang [9] studied the optimal investment-reinsurance strategy for the mean-variance problem, where the surplus of the insurer is depicted by Cramér-Lundberg model and an approximated diffusion model. Zeng et al. [10] assumed that the surplus of an insurer is modeled by a jump-diffusion process, and derived closed-form optimal investment policies by stochastic maximum principle under benchmark and mean-variance criteria.

However, it is a well-known fact that the mean-variance criterion lacks the iterated-expectation property. As a result, stochastic control problem for mean-variance criterion is time-inconsistent, that is, a control maximizing the mean-variance utility at time zero may not be optimal at later time. In this case, Bellman optimality principle fails. The main difficulty when facing a time-inconsistent control problem is that, we cannot use the standard dynamic programming principle to characterize the Hamilton-Jacobi-Bellman equation in general.

Because the time-consistency of strategies is important for a rational decision maker, the main approach to obtain the time-consistent strategy is to formulate the problem within a non-cooperate game theoretic framework, where player $t$ can be regarded as the future incarnation of ourselves at time $t$. Then we aim to derive the equilibrium strategy of the game. For more details, we refer the readers to Björk and Murgoci [11], Björk et al. [12], Ekeland and Lazrak [13], Ekeland and Pirvu [14], Krusell and Smith [15], Phelps and Pollak [16], Strotz [17] and references therein. However, as far as we know, there are a few literatures concerning equilibrium strategies for optimal investment and reinsurance problems under the mean-variance criterion. For example, Zeng and Li [18] are the first to present the optimal time-consistent investment and reinsurance strategies for mean-variance, where the surplus of the insurer is modelled by the diffusion model and the price processes of the risky assets are driven by geometric Brownian motions. Later on, Li et al. [19] studied the case with state dependent risk aversion and they derived equilibrium strategies via some class of well posed integral equations. Zeng et al. [20] considered the equilibrium investment and reinsurance strategies for mean-variance insurers with constant risk aversion where both the surplus process and the risky asset's price process follow a geometric Lévy processes. Zhao et al. [21] considered an optimal time-consistent investment and reinsurance problem taking into account a defaultable security for an insurer under the mean-variance criterion in a jump-diffusion risk model.

Although the research on optimal investment-reinsurance for an insurer and meanvariance problem increased rapidly, only a few papers dealed with the investment-reinsurance problem with dependent risks. The research about dependent risks can be found in Liang and Yuen [22], Yuen et al. [23], Centeno [24], Bi et al. [25].

In this paper, we aim to derive optimal time-consistent investment and reinsurance strategies for the mean-variance insurers with constant risk aversion, where the surplus process is a dependent risk model and the financial market consists of one risk-free asset and one risky assets whose price process follows geometric Brownian motion.

The rest of this paper is organized as follows: in Section 2, the model and some assumptions are described; in Section 3, we formulate the optimization problem and provide a verification theorem; in Section 4, we derive the optimal time-consistent investment and reinsurance strategies and the optimal value function. Finally, some numerical illustrations and sensitivity analysis for our results are provided in Section 5.

2 The Model

Let $(\Omega, \mathcal{F}, P)$ be a given complete probability space with a filtration $\{\mathcal{F}_t\}_{t\in[0, T]}$ satisfying the usual condition, where $T$ is a positive finite constant and represents the time horizon. All stochastic processes introduced blew are assumed to be well defined and adapted processes in this space. In addition, we suppose that the insurer has two dependent classes of insurance business such as motor and life insurance.

The surplus process of the insurer is modeled by

$ R(t)=R_0+ct-\Bigg(\sum\limits_{i=1}^{N_{1}(t)+N(t)}Y_i+\sum\limits_{i=1}^{N_{2}(t)+N(t)}Z_i\Bigg), $

where $R_0$ is the deterministic initial surplus of the insurer and the constant $c$ is the premium rate. ${N_{1}(t)}$, ${N_{2}(t)}$, and ${N(t)}$ are three independent Poisson process with intensity parameters $\lambda_{1}>0$, $\lambda_{2}>0$, and $\lambda>0$, respectively. $Y_i$ is the claim size random variables for the first class with common distribution $F_Y(\cdot)$ and $Z_i$ is the claim size random variables for the second class with common distribution $F_Z(\cdot)$; $\{Y_i, i\geq 1\}$ are assumed to be an i.i.d. sequence with $E(Y_i)=\mu_{1Y}>0$ and $E(Y_i^2)=\mu_{2Y}>0$ and $\{Z_i, i\geq 1\}$ are assumed to be an i.i.d. sequence with $E(Z_i)=\mu_{1Z}>0$ and $E(Z_i^2)=\mu_{2Z}>0$. Thus the compound Poisson process $\hat{S}_1(t):=\sum\limits_{i=1}^{N_{1}(t)+N(t)}Y_i$ represents the cumulative amount of claims for the first class in time interval $[0, t]$ and $\hat{S}_2(t):=\sum\limits_{i=1}^{N_{2}(t)+N(t)}Z_i$ represents the cumulative amount of claims for the second class in time interval $[0, t]$. ${N_{1}(t)}$, ${N_{2}(t)}$, ${N(t)}$, $Y_i$ and $Z_i$ are mutually independent. It is obvious that the dependence of the two classes of business is due to a common shock governed by the counting process $N(t)$. Here, the premium rate is supposed to be calculated according to the expected value principle, i.e., $c={(1+\theta_1)(\lambda_1+\lambda)\mu_{1Y}+(1+\theta_2)(\lambda_2+\lambda)\mu_{1Z}}$, where $\theta_1$ and $\theta_2$ are the safety loadings of the insurer for the first class claim and second class claims, respectively.

Moreover, we allow the insurance company to continuously reinsure a fraction of its claim with the retention levels $q_1(t)(\geq0)$ and $q_2(t)(\geq0)$ for $\{Y_i, i\geq 1\}$ and $\{Z_i, i\geq 1\}$, respectively. It means that the insurer pays $q_1(t)Y$ (or $q_2(t)Z$) of a claim occurring at time $t$ and the new businessman pays $(1-q_1(t))Y$ (or $(1-q_2(t))Z$). Let the reinsurance premium also be calculated by the expected value principle. For the new business, the premium has to be paid at rate $(1-q_1(t))(1+\eta_1)(\lambda_1+\lambda)\mu_{1Y}+(1-q_2(t))(1+\eta_2)(\lambda_2+\lambda)\mu_{1Z}$, $\eta_1$ and $\eta_2$ are the safety loadings of the reinsurer for the first class claims and second class claims, respectively. Without loss of generality, we assume that $\eta_i>\theta_i$, $i=1, 2$. Note that for the insurance company, $q_i(t)\in[0, 1]$ corresponds to a reinsurance cover and $q_i(t)>1$ would mean that the company can take an extra insurance business from other companies for $i=1, 2$. After reinsurance, the premium of the insurer is equal to

$ \begin{eqnarray*} c^q(t)&=&c-[(1-q_1(t))(1+\eta_1)(\lambda_1+\lambda)\mu_{1Y}+(1-q_2(t))(1+\eta_2)(\lambda_2+\lambda)\mu_{1Z}]\\ &=&[(1+\eta_1)q_1(t)+\delta_1](\lambda_1+\lambda)\mu_{1Y}+[(1+\eta_2)q_2(t)+\delta_2](\lambda_2+\lambda)\mu_{1Z}, \end{eqnarray*} $

where $\delta_1=\theta_1-\eta_1$, $\delta_2=\theta_2-\eta_2$. Then the surplus process of the insurer is

$ dR^q(t)=c^q(t)dt-q_1(t)d\hat{S}_1(t)-q_2(t)d\hat{S}_2(t). $

Suppose that a financial market consist of a risk-free asset (bond) and a risky asset (stock). The price process of the risk-free asset is modeled by

$ \begin{eqnarray*} \left\{ \begin{array} {l}dS_0(t)=r_0(t)S_0(t)dt, \qquad t\in[0, T], \\ S_0(0)=s_0, \end{array} \right. \end{eqnarray*} $

where $r_0(t)>0$ is the interest rate of the risk-free asset. The price of the risky asset satisfies the following stochastic differential equation

$ \begin{eqnarray*} \left\{ \begin{array} {l}dS_1(t)=S_1(t)[r_1(t)dt+\sigma_1(t)dW(t)], \qquad t\in[0, T], \\ S_1(0)=s_1, \end{array} \right. \end{eqnarray*} $

where $r_1(t)$ ($>r_0(t)$) is the appreciation rate and $\sigma_1(t)$ is the volatility coefficient; $\{W(t)\}$ is a one-dimensional standard Brownian motion, which is independent of ${N_{1}(t)}$, ${N_{2}(t)}$, ${N(t)}$, $Y_i$ and $Z_i$. We assume that $r_0(t)$, $r_1(t)$ and $\sigma_1(t)$ are continuous bounded deterministic functions on $[0, T]$.

Let $X(t)$ denote the insurer's wealth at time $t$. A trading strategy is denoted by $\pi=\{(q_1(t), q_2(t), $ $\beta(t))\}_{t\in[0, t]}$, where $\beta(t)$ is the dollar amount invested in the risky asset at time $t$. The dollar amount invested in the risk-free asset at time $t$ is $X^\pi(t)-\beta(t)$, where $X^\pi(t)$ is the wealth process associated with strategy $\pi$. Then the surplus process $X^\pi(t)$ can be described as

$ dX^\pi(t)=\frac{X^\pi(t)-\beta(t)}{S_0(t)}dS_0(t)+\frac{\beta(t)}{S_1(t)}dS_1(t)+c^q(t)dt-q_1(t)d\hat{S}_1(t)\\\ \ \ \ \ \ \ \ \ \ \ \ -q_2(t)d\hat{S}_2(t)\nonumber\\ \ \ \ \ \ \ \ \ \ \ \ =\{r_0(t)X^\pi(t)+r(t)\beta(t)+c^q(t)\}dt+\beta(t)\sigma_1(t)dW(t)\nonumber\\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ -q_1(t)d\sum\limits_{i=1}^{N_{1}(t)+N(t)}Y_i-q_2(t)d\sum\limits_{i=1}^{N_{2}(t)+N(t)}Z_i, $

(2.1)

where $r(t)=r_1(t)-r_0(t)$.

Definition 2.1 (Admissible strategy) For any fixed $t\in[0, T]$, a strategy $\pi=\{(q_1(s), $ $q_2(s), \beta(s))\}_{s\in[t, T]}$ is said to be admissible if it satisfies that

(ⅰ) $(q_1(s), q_2(s), \beta(s))$ is $\mathcal{F}_s$-predictable;

(ⅱ) $\forall s\in[t, T]$, $q_1(s)\geq0, q_2(s)\geq 0$ and $E[\displaystyle\int_t^T(q_1(s)^2+ q_2(s)^2+\beta(s)^2)ds]<+\infty$;

(ⅲ) $(\pi, X^\pi)$ is the unique solution to the stochastic differential equation (2.1).

For any initial condition $(t, x)\in[0, T]\times R$, let $\Pi(t, x)$ denote the set of all admissible strategies.

3 Problem Formulation in A Game Theoretic Framework

In this section, we will formulate the problem within a game theoretic framework, which is developed by Björk and Murgoci [11]. We consider an optimization problem for the insurer to maximize the expected utility of the terminal wealth, where the utility function is of mean-variance form, that is, for any $(t, x)\in([0, T]\times R)$, the objective function which we want to maximize is given by

$ \begin{eqnarray} J(t, x, \pi)=E_{t, x}[X^\pi(T)]-\frac{\gamma}{2}{\rm Var}_{t, x}[X^\pi(T)], \end{eqnarray} $

(3.1)

where $E_{t, x}[\cdot]=E[\cdot\mid X_t^\pi=x]$, ${\rm Var}_{t, x}[\cdot]={\rm Var}[\cdot\mid X_t^\pi=x]$, $x$ is the initial surplus of the insurer, $\gamma$ is a positive constant representing the degree of risk aversion of the insurer. For convenience, we rewrite the reward function as

$ \begin{eqnarray*} J(t, x, \pi)=E_{t, x}[F(X_T^\pi)]+G(E_{t, x}[X_T^\pi]), \end{eqnarray*} $

where $F(x)=x-\frac{\gamma}{2}x^2$ and $G(x)=\frac{\gamma}{2}x^2$.

First, we present the following definition of an equilibrium control, which is from Björk and Murgoci [11].

Definition 3.1 (Equilibrium strategy) We say that an admissible strategy $\pi^*$ is an equilibrium strategy if for all given $\pi\in R^+\times R^+\times R$, $h>0$ and $(t, x)\in[0, T]\times R$,

$ \begin{eqnarray*} \liminf\limits_{h\rightarrow 0}\frac{J(t, x, \pi^*)-J(t, x, \pi_h)}{h}\geq 0, \end{eqnarray*} $

where $\pi_h$ is denoted by

$ \begin{eqnarray*} \pi_h(s, y)=\left\{ \begin{array} {l}\pi~~~ {\rm for}\;\; t\leq s < t+h, \: y\in R, \\ \pi^*(s, y)~~~ {\rm for}\;\; t+h\leq s \leq T, \: y\in R. \end{array} \right. \end{eqnarray*} $

The corresponding equilibrium value function $V(t, x)$ is defined by

$ \begin{eqnarray} V(t, x)=J(t, x, \pi^*)=E_{t, x}[X^{\pi^*}(T)]-\frac{\gamma}{2}{\rm Var}_{t, x}[X^{\pi^*}(T)]. \end{eqnarray} $

(3.2)

Based on the definition above, the equilibrium strategy is time-consistent and hereafter we call the equilibrium strategy and the corresponding equilibrium value function the optimal time-consistent strategy and the optimal value function for problem (3.1), respectively. Therefore, we assume that the aim of the insurers is to find an equilibrium strategy and the corresponding equilibrium value function.

Let $C^{1, 2}([0, T]\times R)$ denote the space of $\phi(t, x)$ and its derivatives $\phi_t(t, x)$, $\phi_x(t, x)$, $\phi_{xx}(t, x)$ are continuous on $[0, T]\times R$. For any function $\phi(t, x)\in C^{1, 2}([0, T]\times R)$ and any fixed $\pi\in\Pi$, the usual infinitesimal generator $\mathscr {A}^\pi$, which is described in Björk and Murgoci [11], for the jump-diffusion process (2.1) is defined by

$ \begin{eqnarray*} \begin{array}{cll} \mathscr {A}^\pi \phi(t, x)&=&\phi_t(t, x)+\phi_x(t, x)[r_0(t)x+r(t)\beta^\pi(t)+c^q(t)]+\frac{1}{2}\phi_{xx}(t, x)\sigma_1(t)^2\beta^\pi(t)^2\\ &&+\lambda_1E[\phi(t, x-q_1^\pi(t) Y)-\phi(t, x)]+\lambda_2E[\phi(t, x-q_2^\pi(t) Z)-\phi(t, x)]\\ &&+\lambda E[\phi(t, x-q_1^\pi(t) Y-q_2^\pi(t) Z)-\phi(t, x)]. \end{array} \end{eqnarray*} $

Then, we obtain the following extended Hamilton-Jacobi-Bellman system and the verification theorem.

Theorem 3.2 (Verification theorem) For the optimization problem (3.1), if there exist two real value functions $U(t, x)$, $g(t, x)$ $\in C^{1, 2}([0, T]\times R)$ satisfying the following extended HJB system: $\forall(t, x)\in[0, T]\times R$,

$\sup\limits_{\pi\in\Pi(t, x)}\{\mathscr{A}^\pi U(t, x)-\mathscr{A}^\pi\frac{\gamma}{2}g(t, x)^2+\gamma g(t, x)\mathscr{A}^\pi g(t, x)\}=0, $

(3.3)

$ U(T, x)=x, $

(3.4)

$ \mathscr{A}^{\pi^*}g(t, x)=0, $

(3.5)

$ g(T, x)=x, $

(3.6)

where

$ \begin{eqnarray*} \pi^*=\arg\sup\limits_{\pi\in\Pi(t, x)}\{\mathscr{A}^\pi U(t, x)-\mathscr{A}^\pi\frac{\gamma}{2}g(t, x)^2+\gamma g(t, x)\mathscr{A}^\pi g(t, x)\}, \end{eqnarray*} $

then $V(t, x)=U(t, x)$, $E_{t, x}[X^{\pi^*}]=g(t, x)$, and $\pi^*$ is the optimal time-consistent strategy.

The proof of this theorem is similar to Theorem $4.1$ of Björk and Murgoci [11].

4 Solution to the Optimization Problem

In this section, we solve the investment-reinsurance optimization problem under the mean-variance criterion.

Suppose that there exist two functions $U(t, x)$ and $g(t, x)$ satisfying the condition given in Theorem $3.2$. After elementary calculation, the following result is given

$ \sup\limits_{\pi\in\Pi(t, x)}\{U_t(t, x)+U_x(t, x)[r_0(t)x+r(t)\beta(t)+c^q(t)]+\frac{1}{2}(U_{xx}(t, x)\\-\gamma g_x(t, x)^2)\sigma_1(t)^2\beta(t)^2\nonumber \\ +\lambda_1E[U(t, x-q_1(t)Y)-\frac{\gamma}{2}g(t, x-q_1(t)Y)(g(t, x-q_1(t)Y)-2g(t, x))] \nonumber \\ +\lambda_2E[U(t, x-q_2(t)Z)-\frac{\gamma}{2}g(t, x-q_2(t)Z)(g(t, x-q_2(t)Z)-2g(t, x))] \nonumber \\ +\lambda E[U(t, x-q_1(t)Y-q_2(t)Z)-\frac{\gamma}{2}g(t, x-q_1(t)Y-q_2(t)Z)(g(t, x-q_1(t)Y\nonumber \\ -q_2(t)Z)-2g(t, x))]-(\lambda_1+\lambda_2+\lambda)[U(t, x)+\frac{\gamma}{2}g(t, x)^2]\}=0. $

(4.1)

In the following, we aim to solve the optimization problem for the mean-variance criterion. Given the linear structure of (3.5) and (4.1), as well as the boundary conditions, it is natural to guess that

$ U(t, x)=A(t)x+B(t), ~~ A(T)=1, ~~ B(T)=0, $

(4.2)

$ g(t, x)=a(t)x+b(t), ~~ a(T)=1, ~~ b(T)=0. $

(4.3)

The corresponding partial derivatives are

$ \begin{eqnarray*} &&U_t(t, x)=\dot{A}(t)x+\dot{B}(t), ~~ U_x(t, x)=A(t), ~~ U_{xx}(t, x)=0, \\ &&g_t(t, x)=\dot{a}(t)x+\dot{b}(t), ~~ g_x(t, x)=a(t), ~~ g_{xx}(t, x)=0, \end{eqnarray*} $

where $\dot{A}(t)=\frac{dA(t)}{dt}$, $\dot{B}(t)=\frac{dB(t)}{dt}$, $\dot{a}(t)=\frac{da(t)}{dt}$ and $\dot{b}(t)=\frac{db(t)}{dt}$. Plugging $U(t, x)$, $g(t, x)$ and the above derivatives into (4.1) yields

$ \begin{eqnarray} &&\sup\limits_{\pi\in\Pi(t, x)}\{\dot{A}(t)x+\dot{B}(t)+A(t)[r_0(t)x+\delta_1(\lambda_1+\lambda)\mu_{1Y}+\delta_2(\lambda_2+\lambda)\mu_{1Z} \nonumber \\ &&+\eta_1(\lambda_1+\lambda)\mu_{1Y}q_1(t)+\eta_2(\lambda_2+\lambda)\mu_{1Z}q_2(t)+r(t)\beta(t)]-\frac{\gamma}{2}a(t)^2[(\lambda_1+\lambda)\mu_{2Y}q_1(t)^2 \nonumber\\ &&+(\lambda_2+\lambda)\mu_{2Z}q_2(t)^2+2\lambda\mu_{1Y}\mu_{1Z}q_1(t)q_2(t)+\sigma_1(t)^2\beta(t)^2]\}=0. \end{eqnarray} $

(4.4)

Let

$ \begin{eqnarray} f(q_1(t), q_2(t))&=&-\frac{\gamma}{2}a(t)^2(\lambda_1+\lambda)\mu_{2Y}q_1(t)^2-\frac{\gamma}{2}a(t)^2(\lambda_2+\lambda)\mu_{2Z}q_2(t)^2 \nonumber\\ &&+A(t)\eta_1(\lambda_1+\lambda)\mu_{1Y}q_1(t)+A(t)\eta_2(\lambda_2+\lambda)\mu_{1Z}q_2(t)\nonumber\\ &&-\gamma a(t)^2\lambda\mu_{1Y}\mu_{1Z}q_1(t)q_2(t). \end{eqnarray} $

(4.5)

Differentiating the function $h$ with respect to $q_1(t)$ and $q_2(t)$, respectively, we obtain

$ \begin{eqnarray*} \left\{ \begin{array}{lcl} \frac{\partial f(q_1(t), q_2(t))}{\partial q_1(t)}&=&-\gamma a(t)^2(\lambda_1+\lambda)\mu_{2Y}q_1(t)+A(t)\eta_1(\lambda_1+\lambda)\mu_{1Y}-\gamma a(t)^2\lambda\mu_{1Y}\mu_{1Z}q_2(t), \\ \frac{\partial f(q_1(t), q_2(t))}{\partial q_2(t)}&=&-\gamma a(t)^2(\lambda_2+\lambda)\mu_{2Z}q_2(t)+A(t)\eta_2(\lambda_2+\lambda)\mu_{1Z}-\gamma a(t)^2\lambda\mu_{1Y}\mu_{1Z}q_1(t), \\ \frac{\partial^2f(q_1(t), q_2(t))}{\partial q^2_1(t)}&=&-\gamma a(t)^2(\lambda_1+\lambda)\mu_{2Y}, \\ \frac{\partial^2f(q_1(t), q_2(t))}{\partial q^2_2(t)}&=&-\gamma a(t)^2(\lambda_2+\lambda)\mu_{2Z}, \\ \frac{\partial^2f(q_1(t), q_2(t))}{\partial q_1(t)\partial q_2(t)}&=&-\gamma a(t)^2\lambda\mu_{1Y}\mu_{1Z}. \end{array} \right. \end{eqnarray*} $

Then, the Hessian matrix is

$ \begin{eqnarray*} H=\left( \begin{array}{cc} \frac{\partial^2f(q_1(t), q_2(t))}{\partial q^2_1(t)}&\frac{\partial^2f(q_1(t), q_2(t))}{\partial q_1(t)\partial q_2(t)} \\ \frac{\partial^2f(q_1(t), q_2(t))}{\partial q_2(t)\partial q_1(t)} & \frac{\partial^2f(q_1(t), q_2(t))}{\partial q^2_2(t)} \end{array} \right) =\left( \begin{array}{cc} -\gamma a(t)^2(\lambda_1+\lambda)\mu_{2Y}&-\gamma a(t)^2\lambda\mu_{1Y}\mu_{1Z} \\ -\gamma a(t)^2\lambda\mu_{1Y}\mu_{1Z}&-\gamma a(t)^2(\lambda_2+\lambda)\mu_{2Z} \end{array} \right). \end{eqnarray*} $

Because of

$ \begin{eqnarray*}&&-\gamma a(t)^2(\lambda_1+\lambda)\mu_{2Y}<0, ~~ -\gamma a(t)^2(\lambda_2+\lambda)\mu_{2Z}<0, \\ &&\gamma^2a^4(t)[(\lambda_1+\lambda)(\lambda_2+\lambda)\mu_{2Y}\mu_{2Z}-\lambda^2\mu_{1Y}^2\mu_{1Z}^2]>0, \end{eqnarray*} $

it is easy to see that without restrictions $q_1(t)\geq 0$ and $q_2(t)\geq 0$, the maximizer $(q_1^{\pi^*}(t), q_2^{\pi^*}(t))$ is the solution of the equations

$ \begin{eqnarray*} \left\{ \begin{array}{ccc} -\gamma a(t)^2(\lambda_1+\lambda)\mu_{2Y}q_1(t)+A(t)\eta_1(\lambda_1+\lambda)\mu_{1Y}-\gamma a(t)^2\lambda\mu_{1Y}\mu_{1Z}q_2(t)&=&0, \\ -\gamma a(t)^2(\lambda_2+\lambda)\mu_{2Z}q_2(t)+A(t)\eta_2(\lambda_2+\lambda)\mu_{1Z}-\gamma a(t)^2\lambda\mu_{1Y}\mu_{1Z}q_1(t)&=&0. \end{array} \right. \end{eqnarray*} $

That is

$ q_1^{\pi^*}(t)=\frac{A(t)[-\lambda(\lambda_2+\lambda)\mu_{1Y}\mu_{1Z}^2\eta_2+(\lambda_1+\lambda)(\lambda_2+\lambda)\mu_{1Y}\mu_{2Z}\eta_1]}{\gamma a(t)^2[(\lambda_1+\lambda)(\lambda_2+\lambda)\mu_{2Y}\mu_{2Z}-\lambda^2\mu_{1Y}^2\mu_{1Z}^2]}, $

(4.6)

$ q_2^{\pi^*}(t)=\frac{A(t)[-\lambda(\lambda_1+\lambda)\mu_{1Z}\mu_{1Y}^2\eta_1+(\lambda_1+\lambda)(\lambda_2+\lambda)\mu_{1Z}\mu_{2Y}\eta_2]}{\gamma a(t)^2[(\lambda_1+\lambda)(\lambda_2+\lambda)\mu_{2Y}\mu_{2Z}-\lambda^2\mu_{1Y}^2\mu_{1Z}^2]}. $

(4.7)

We assume that $A(t)>0$ initially, which is obviously true once we obtain the explicit expression of $A(t)$ (see (4.11)). Because of the constraints of $q_1(t)\geq 0$, $q_2(t)\geq 0$ and the fact that $\frac{\lambda \mu_{1Z}^2}{(\lambda_1+\lambda)\mu_{2Z}} < 1 <\frac{(\lambda_2+\lambda)\mu_{2Y}}{\lambda\mu_{1Y}^2}$, we will discuss the following three cases.

Case 1 $\frac{\lambda\mu_{1Z}^2}{(\lambda_1+\lambda)\mu_{2Z}}\eta_2\leq\eta_1\leq\frac{(\lambda_2+\lambda)\mu_{2Y}}{\lambda\mu_{1Y}^2}\eta_2$, which leads to $q_1^{\pi^*}(t)\geq0$ and $q_2^{\pi^*}(t)\geq0$.

Case 2 $\eta_1<\frac{\lambda \mu_{1Z}^2}{(\lambda_1+\lambda)\mu_{2Z}}\eta_2$, which leads to $q_1^{\pi^*}(t)<0$ and $q_2^{\pi^*}(t)>0$.

Case 3 $\eta_1>\frac{(\lambda_2+\lambda)\mu_{2Y}}{\lambda\mu_{1Y}^2}\eta_2$, which leads to $q_1^{\pi^*}(t)>0$ and $q_2^{\pi^*}(t)<0$.

In addition, from equation (4.4), we can get

$ \begin{eqnarray} \beta^{\pi^*}(t)=\frac{A(t)r(t)}{\gamma\sigma_1(t)^2a(t)^2}. \end{eqnarray} $

(4.8)

In the following, we only give the detailed discussion for Case 1. The results in Case 2 and Case 3 can be derived similarly.

Inserting (4.6), (4.7) and (4.8) into (4.4) and (3.5), we have

$ (\dot{A}(t)+r_0(t)A(t))x+\dot{B}(t)+[\delta_1(\lambda_1+\lambda)\mu_{1Y}+\delta_2(\lambda_2+\lambda)\mu_{1Z}]A(t) +\frac{\xi_1(t)A(t)^2}{2\gamma a(t)^2}=0, ~~~~~~~~ $

(4.9)

$ (\dot{a}(t)+r_0(t)a(t))x+\dot{b}(t)+[\delta_1(\lambda_1+\lambda)\mu_{1Y}+\delta_2(\lambda_2+\lambda)\mu_{1Z}]a(t) +\frac{\xi_2(t)A(t)}{\gamma a(t)}=0, $

(4.10)

where

$ \xi_1(t)=\frac{(\lambda_1+\lambda)(\lambda_2+\lambda)[(\lambda_1+\lambda)(2\mu_{1Z}-\mu_{2Z})\mu_{1Y}^2\eta_1^2+(\lambda_2+\lambda)\mu_{1Z}^2\mu_{2Y}\eta_2^2 -2\lambda\mu_{1Y}^2\mu_{1Z}^2\eta_1\eta_2]}{(\lambda_1+\lambda)(\lambda_2+\lambda)\mu_{2Y}\mu_{2Z}-\lambda^2\mu_{1Y}^2\mu_{1Z}^2}\\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ +\frac{r(t)^2}{\sigma_1(t)^2}, \\ \xi_2(t)=\frac{(\lambda_1+\lambda)(\lambda_2+\lambda)[(\lambda_1+\lambda)\mu_{1Y}^2\mu_{2Z}\eta_1^2+(\lambda_2+\lambda)\mu_{1Z}^2\mu_{2Y}\eta_2^2 -2\lambda\mu_{1Y}^2\mu_{1Z}^2\eta_1\eta_2]}{(\lambda_1+\lambda)(\lambda_2+\lambda)\mu_{2Y}\mu_{2Z}-\lambda^2\mu_{1Y}^2\mu_{1Z}^2} +\frac{r(t)^2}{\sigma_1(t)^2}. $

To ensure (4.9) and (4.10) hold, it must have

$ \begin{eqnarray*} &&\dot{A}(t)+r_0(t)A(t)=0, ~~ A(T)=1, \\ &&\dot{B}(t)+[\delta_1(\lambda_1+\lambda)\mu_{1Y}+\delta_2(\lambda_2+\lambda)\mu_{1Z}]A(t)+\frac{\xi_1(t)A(t)^2}{2\gamma a(t)^2}=0, ~~ B(T)=0, \\ &&\dot{a}(t)+r_0(t)a(t)=0, ~~ a(T)=1, \\ &&\dot{b}(t)+[\delta_1(\lambda_1+\lambda)\mu_{1Y}+\delta_2(\lambda_2+\lambda)\mu_{1Z}]a(t)+\frac{\xi_2(t)A(t)}{\gamma a(t)}=0, ~~ b(T)=0. \end{eqnarray*} $

Solving the above equations, we obtain

$ A(t)=e^{\displaystyle\int_t^Tr_0(s)ds}, $

(4.11)

$ B(t)=[\delta_1(\lambda_1+\lambda)\mu_{1Y}+\delta_2(\lambda_2+\lambda)\mu_{1Z}]\displaystyle\int_t^T e^{\displaystyle\int_u^Tr_0(s)ds}du+\frac{1}{2\gamma}\displaystyle\int_t^T\xi_1(s)ds, $

(4.12)

$ a(t)=e^{\displaystyle\int_t^Tr_0(s)ds}, $

(4.13)

$ b(t)=[\delta_1(\lambda_1+\lambda)\mu_{1Y}+\delta_2(\lambda_2+\lambda)\mu_{1Z}]\displaystyle\int_t^T e^{\displaystyle\int_u^Tr_0(s)ds}du+\frac{1}{\gamma}\displaystyle\int_t^T\xi_2(s)ds. $

(4.14)

Substituting (4.11) and (4.13) into (4.6)-(4.8), we have

$ \begin{eqnarray} q_1^{\pi^*}(t)&=&\frac{-\lambda(\lambda_2+\lambda)\mu_{1Y}\mu_{1Z}^2\eta_2+(\lambda_1+\lambda)(\lambda_2+\lambda)\mu_{1Y}\mu_{2Z}\eta_1}{\gamma [(\lambda_1+\lambda)(\lambda_2+\lambda)\mu_{2Y}\mu_{2Z}-\lambda^2\mu_{1Y}^2\mu_{1Z}^2]}e^{-\displaystyle\int_t^Tr_0(s)ds}, \nonumber\\ q_2^{\pi^*}(t)&=&\frac{-\lambda(\lambda_1+\lambda)\mu_{1Z}\mu_{1Y}^2\eta_1+(\lambda_1+\lambda)(\lambda_2+\lambda)\mu_{1Z}\mu_{2Y}\eta_2}{\gamma [(\lambda_1+\lambda)(\lambda_2+\lambda)\mu_{2Y}\mu_{2Z}-\lambda^2\mu_{1Y}^2\mu_{1Z}^2]}e^{-\displaystyle\int_t^Tr_0(s)ds}, \\ \beta^{\pi^*}(t)&=&\frac{r(t)}{\gamma\sigma_1(t)^2}e^{-\displaystyle\int_t^Tr_0(s)ds}.\nonumber \end{eqnarray} $

(4.15)

The above discussion leads to the following theorem.

Theorem 4.1 In Case 1, the optimal time-consistent strategy is $\pi^*=(q_1^{\pi^*}(t), q_2^{\pi^*}(t), $ $\beta^{\pi^*}(t))$, where $q_1^{\pi^*}(t)$, $q_2^{\pi^*}(t)$ and $\beta^{\pi^*}(t)$ are given by (4.15) and the optimal value function is given by

$ \begin{eqnarray} V(t, x)=U(t, x)&=&xe^{\displaystyle\int_t^Tr_0(s)ds}+[\delta_1(\lambda_1+\lambda)\mu_{1Y}+\delta_2(\lambda_2+\lambda)\mu_{1Z}]\nonumber \\ &&\times\displaystyle\int_t^Te^{\displaystyle\int_u^Tr_0(s)ds}du+\frac{1}{2\gamma}\displaystyle\int_t^T\xi_1(s)ds \end{eqnarray} $

(4.16)

and

$ \begin{eqnarray} E_{t, x}[X^{\pi^*}(T)]=g(t, x)&=&xe^{\displaystyle\int_t^Tr_0(s)ds}+[\delta_1(\lambda_1+\lambda)\mu_{1Y}+\delta_2(\lambda_2+\lambda)\mu_{1Z}]\nonumber \\ &&\times\displaystyle\int_t^Te^{\displaystyle\int_u^Tr_0(s)ds}du+\frac{1}{\gamma}\displaystyle\int_t^T\xi_2(s)ds. \end{eqnarray} $

(4.17)

According to Theorem $4.1$ and the optimal value function given by (3.2), we have

$ \begin{eqnarray} Var_{t, x}[X^{\pi^*}(T)]=\frac{2}{\gamma}(E_{t, x}[X^{\pi^*}(T]-V(t, x))=\frac{1}{\gamma^2}\displaystyle\int_t^T2\xi_2(s)-\xi_1(s)ds. \end{eqnarray} $

(4.18)

From (4.17) and (4.18), we can get the relationship between the expectation and the variance of the terminal wealth under the optimal strategy as below

$ \begin{eqnarray} E_{t, x}[X^{\pi^*}(T)]&=&xe^{\displaystyle\int_t^Tr_0(s)ds}+[\delta_1(\lambda_1+\lambda)\mu_{1Y}+\delta_2(\lambda_2+\lambda)\mu_{1Z}]\displaystyle\int_t^Te^{\displaystyle\int_u^Tr_0(s)ds}du \nonumber\\ &&+\sqrt{\frac{{\rm Var}_{t, x}[X^{\pi^*}(T)]}{\displaystyle\int_t^T2\xi_2(s)-\xi_1(s)ds}}\displaystyle\int_t^T\xi_2(s)ds. \end{eqnarray} $

(4.19)

The relationship is known as the efficient frontier of problem (3.1) at initial state $(t, x)$ in modern portfolio theory. In addition, by Theorem $4.1$, we find that the optimal reinsurance strategy is independent of the parameters of the risky asset and the optimal investment strategy is independent of the parameters of the insurance business.

5 Numerical Illustration and Sensitivity Analysis

In this section, we provide some numerical examples to illustrate the effects of model parameters on the optimal time-consistent reinsurance-investment strategy. For convenience, but without loss of generality, we only analyze the results of the original model with $r_0(t)=r_0$, $r_1(t)=r_1$, $\sigma_1(t)=\sigma_1$ for all $t\in[0, T]$. Throughout the numerical analyses, unless otherwise stated, the values of parameter are given as follows: $\lambda=1$, $\lambda_1=3$, $\lambda_2=4$, $\mu_{1Y}=0.3$, $\mu_{1Z}=0.3$, $\mu_{2Y}=0.4$, $\mu_{2Z}=0.4$, $r_0=0.06$, $r_1=0.12$, $\sigma_1=0.18$, $\gamma=0.5$, $\theta_1=0.2$, $\theta_2=0.2$, $\eta_1=0.3$, $\eta_2=0.3$, $T=10$, $t=0$, $x=1$.

Some numerical illustrations and sensitivity analysis for the optimal time-consistent reinsurance strategy and the optimal time-consistent investment strategy are presented in this section. We only give the detailed analysis for the optimal time-consistent reinsurance strategy $q_1^{\pi^*}(t)$. The analysis result for $q_2^{\pi^*}(t)$ can be derived similarly.

Figure 1 shows that the optimal time-consistent reinsurance strategy $q_1^{\pi^*}(t)$ increases with respect to time $t$, namely, as time elapses, the insurer should keep more insurance business by purchasing less reinsurance or acquire more new business. In addition, subgraphs (a) and (b) illustrate that when the coefficient of the insurer's risk aversion $\gamma$ or intensity parameter $\lambda$ increases, the insurer will purchase more reinsurance or acquire less new business. Subgraph (c) illustrates that when the intensity of the claims for the first class insurance business $\lambda_1$ increases, the insurer will purchase less reinsurance or acquire more new business. Subgraph (d) illustrates that the intensity of the claims for the second class insurance business $\lambda_2$ has a very little impact on the optimal time-consistent reinsurance strategy $q_1^{\pi^*}(t)$.

Figure 1 The impact of parameters on the optimal time-consistent reinsurance strategy

Figure 2 shows that the optimal time-consistent investment strategy $\beta^{\pi^*}(t)$ increase with respect to time $t$. Further, the optimal time-consistent investment strategy is decreasing while $\gamma$ increases. It is reasonable because a large value of $\gamma$ means more risk averse and the insurer will invest less money to the risky asset.

Figure 2 The impact of parameters on the optimal time-consistent investment strategy

6 Conclusion

We have studied the mean-variance optimal investment-reinsurance problem in a risk model with two dependent classes of insurance business, where the two claim number processes are correlated through a common shock component. Since the dynamic mean-variance problem is time-inconsistent, we tackle the problem from a game theoretic perspective. By adopting the approach developed in by Björk and Murgoci [11], we obtain the optimal time-consistent investment-reinsurance strategies and the corresponding optimal value function. Finally, the effects of parameters on the optimal time-consistent strategies are presented. In future research, it would be interesting to extend our analysis to some more general situations, such as adopting a wealth-dependent risk aversion coefficient and using a jump-diffusion process or more general Levy process for the risky asset price process. Of course, these problems are more complicated. To solve such problems, we need to adopt much more sophisticated techniques.

References

[1]	Browne S. Optimal investment policies for a firm with random risk process:exponential utility and minimizing the probability of ruin[J]. Math. Oper. Res., 1995, 20(4): 937–958. DOI:10.1287/moor.20.4.937

[2]	Yang Hailiang, Zhang Lihong. Optimal investment for insurer with jump-diffusion risk process[J]. Insur. Math. Econ., 2005, 37(3): 615–634. DOI:10.1016/j.insmatheco.2005.06.009

[3]	Xu Lin, Wang Rongming, Yao Dingjun. On maximizing the expected terminal utility by investment and reinsurance[J]. J. Ind. Manag. Optim., 2017, 4(4): 801–815.

[4]	Gu Mengdi, Yang Yipeng, Li Shoude, Zhang Jingyi. Constant elasticity of variance model for proportional reinsurance and investment strategies[J]. Insur. Math. Econ., 2010, 46(3): 580–587. DOI:10.1016/j.insmatheco.2010.03.001

[5]	Liang Zhibin, Yuen Kam Chuen, G uo, Junyi. Optimal proportional reinsurance and investment in a stock market with Ornstein-Uhlenbeck process[J]. Insur. Math. Econ., 2011, 49(2): 207–215. DOI:10.1016/j.insmatheco.2011.04.005

[6]	Guan Guohui, Liang Zongxia. Optimal reinsurance and investment strategies for insurer under interest rate and inflation risks[J]. Insur. Math. Econ., 2014, 55(1): 105–115.

[7]	Markowitz H. Portfolio selection[J]. J. Financ., 1952, 7(1): 77–91.

[8]	Bäuerle N. Benchmark and mean-variance problems for insurers[J]. Math. Meth. Oper. Res., 2005, 62(1): 159–165. DOI:10.1007/s00186-005-0446-1

[9]	Bai Lihua, Zhang Huayue. Dynamic mean-variance problem with constrained risk control for the insurers[J]. Math. Meth. Oper. Res., 2008, 68(1): 181–205. DOI:10.1007/s00186-007-0195-4

[10]	Zeng Yan, Li Zhongfei, Liu Jingjun. Optimal strategies of benchmark and mean variance portfolio selection problems for insurers[J]. Ind. Manag. Optim., 2010, 6(3): 483–496. DOI:10.3934/jimo

[11]	Björk T, Murgoci A. A general theory of Markovian time inconsistent stochastic control problems[J]. Ssrn Electronic J., 2010, 18(3): 545–592.

[12]	Björk T, Murgoci A, Zhou Xunyu. Mean-variance portfolio optimization with state-dependent risk aversion[J]. Math. Finance, 2014, 24(1): 1–24. DOI:10.1111/j.1467-9965.2011.00515.x

[13]	Ekeland I, Lazrak A. Equilibrium policies when preferences are timeinconsistent[J]. Queueing Syst., 2008, arXiv: 0808.3790v1. http://www.oalib.com/paper/3912339

[14]	Ekeland I, Pirvu T A. Investment and consumption without commitment[J]. Math. Financ. Econ., 2008, 2(1): 57–86. DOI:10.1007/s11579-008-0014-6

[15]	Krusell P, Smith A. Consumption and savings decisions with quasigeometric discounting[J]. Econometrica, 2003, 71(1): 366–375.

[16]	Phelps E S, Pollak R A. On second-best national saving and game equilibrium growth[J]. Rev. Econom. Stud., 1968, 35(2): 185–199. DOI:10.2307/2296547

[17]	Strotz R. Myopia and inconsistency in dynamic utility maximization[J]. Rev. Econ. Stud., 1955, 23(3): 165–180. DOI:10.2307/2295722

[18]	Zeng Yan, Li Zhongfei. Optimal time-consistent investment and reinsurance policies for meanvariance insurers[J]. Insur. Math. Econ., 2011, 49(1): 145–154. DOI:10.1016/j.insmatheco.2011.01.001

[19]	Li Yongwu, Li Zhongfei. Optimal time-consistent investment and reinsurance strategies for meanvariance insurers with state dependent risk aversion[J]. Insur. Math. Econ., 2013, 53(1): 86–97. DOI:10.1016/j.insmatheco.2013.03.008

[20]	Zeng Yan, Li Zhongfei, Lai Yongzeng. Time-consistent investment and reinsurance strategies for mean-variance insurers with jumps[J]. Insur. Math. Econ., 2013, 52(3): 498–507. DOI:10.1016/j.insmatheco.2013.02.007

[21]	Zhao Hui, Shen Yang, Zeng Yan. Time-consistent investment-reinsurance strategy for mean-variance insurers with a defaultable security[J]. J. Math. Anal. Appl., 2016, 437(2): 1036–1057. DOI:10.1016/j.jmaa.2016.01.035

[22]	Liang Zhibin, Yuen Kam Chuen. Optimal dynamic reinsurance with dependent risks:variance premium principle[J]. Scand. Actuar. J., 2016, 2016(1): 18–36. DOI:10.1080/03461238.2014.892899

[23]	Yuen Kam Chuen, Liang Zhibin, Zhou Ming. Optimal proportional reinsurance with common shock dependene[J]. Insur. Math. Econ., 2015, 64: 1–13. DOI:10.1016/j.insmatheco.2015.04.009

[24]	Centeno M. Dependent risks and excess of loss reinsurance[J]. Insur. Math. Econ., 2005, 37(2): 229–238. DOI:10.1016/j.insmatheco.2004.12.001

[25]	Bi Junna, Liang Zhibin, Xu Fangjun. Optimal mean-variance investment and reinsurance problems for the risk model with common shock dependence[J]. Insur. Math. Econ., 2016, 70: 245–258. DOI:10.1016/j.insmatheco.2016.06.012