Hostname: page-component-cd9895bd7-gxg78 Total loading time: 0 Render date: 2024-12-23T05:49:59.715Z Has data issue: false hasContentIssue false

On the cumulant transforms for Hawkes processes

Published online by Cambridge University Press:  23 February 2023

Young Lee*
Affiliation:
Harvard University
Thorsten Rheinländer*
Affiliation:
Technische Universität Wien
*
*Postal address: Harvard University, 1 Oxford Street, Cambridge, MA 02138, USA.
**Postal address: Wiedner Hauptstr. 8/E105-1 1040 Vienna, Austria.
Rights & Permissions [Opens in a new window]

Abstract

We consider the asset price as the weak solution to a stochastic differential equation driven by both a Brownian motion and the counting process martingale whose predictable compensator follows shot-noise and Hawkes processes. In this framework, we discuss the Esscher martingale measure where the conditions for its existence are detailed. This generalizes certain relationships not yet encountered in the literature.

Type
Original Article
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of Applied Probability Trust

1. Introduction

We initiate the computation of the Esscher martingale measure for a class of general semimartingales whose predictable compensators have shot-noise and Hawkes process flavors.

Many different models to describe the evolution of financial assets have been proposed in the literature. These include the exponential Lévy processes (Merton, Reference Merton1976; Carr and Geman, Reference Carr and Geman2002) with their complements of stochastic volatility counterparts (cf. e.g. Hull and White, Reference Hull and White1987; Barndorff-Nielsen and Shephard, Reference Barndorff-Nielsen and Shephard2001). One approach to modelling the stock price process using a class of pure-jump stochastic processes has been launched by Eberlein and Jacod (Reference Eberlein and Jacod1997), with favourable arguments addressing the empirical realities and their implications of the no-arbitrage principle (Madan et al., Reference Madan, Carr and Chang1998). The price process we consider is capable of capturing contagion and jump clustering, a phenomenon that is empirically relevant when financial markets are in distress. Under these circumstances, stock price crashes occur more frequently than predicted by standard stochastic volatility models (Aït-Sahalia et al., Reference Aït-Sahalia, Cacho-Diaz and Laeven2015). In particular, asset price crashes tend to cluster over short time spans, and standard models are unable to replicate this pattern of crash clustering. Therefore, related Hawkes models have been proposed to better replicate the empirical patterns found in asset returns and lead to an improved fit for option pricing models (Boswijk and Lalu, 2016).

In particular, we study the Esscher martingale measure when the asset price follows a stochastic differential equation driven by both a Brownian motion and the counting process martingale whose predictable compensator follows shot-noise and Hawkes processes (Brémaud and Massoulié, Reference Brémaud and Massoulié2002; Dassios and Zhao, Reference Dassios and Zhao2011; Boumezoued, Reference Boumezoued2016). Such models typically generate markets which are incomplete, wherein there exist infinitely many martingale measures which are equivalent to the physical measure describing the evolution of the underlying stock price process. Each equivalent martingale measure corresponds to a set of derivative prices respecting the no-arbitrage principle. The Esscher martingale measure for Lévy processes and stochastic volatility models is well studied (Hubalek and Sgarra, Reference Hubalek and Sgarra2009). On another spectrum, Dassios and Zhao (Reference Dassios and Zhao2012) discussed the Esscher transform for a generalized contagion model for the purpose of importance sampling and ruin probability computations. The authors showed that under the Esscher transform, ruin becomes certain, and this facilitates simulation more efficiently than under the original measure, where the ruin is not certain and is even rare. Similar ideas for improving the simulation of rare events in the spirit of measure changes can be found in Asmussen (Reference Asmussen1985).

However, the Esscher martingale measure for a new class of compound shot-noise and Hawkes point processes has not been examined. Augmented with a Brownian component in a stochastic logarithm framework, we formulate the asset price as a general semimartingale whose drift is modulated by a one-layer neural network (Schmidhuber, Reference Schmidhuber2015). This model may introduce flexibility and extend the standard log-Brownian paradigm behaviour, capturing unusual business cycles or other economic benchmarks. We determine the Esscher martingale measure in concert with the procedures outlined in Kallsen and Shiryaev (Reference Kallsen and Shiryaev2002).

Section 2 summarizes the basic definitions and main results concerning that of shot-noise and Hawkes processes. In Section 3, we present our asset model as well as the basic properties of cumulant processes. In Section 4 we explicate our martingale approach to uncover the Esscher martingale measure. The conditions for absence of arbitrage and existence of solutions are discussed. Finally, the no-arbitrage valuation of derivatives is detailed in Section 5. We give some concluding remarks in Section 6. In the present text, we generally use the notation of Jacod and Shiryaev (Reference Jacod and Shiryaev2003), where $\mu_N$ and $\nu_N(dx,dt)$ respectively denote the jump measure associated to N and its compensator.

2. Preliminaries and point processes

Let $(\Omega,\mathcal{G},{\mathbb{P}})$ be a probability space. We fix a finite time horizon T, and all stochastic processes are defined on [0, T]. Let N be a simple point process on $\mathbb{R}$ , that is, a family $\{N(W)\}_{W\in\mathcal{B}(\mathbb{R})}$ of random variables with values in $\{0,1,2,...\}\,\cup\,\{\infty\}$ indexed by the Borel $\sigma$ -algebra $\mathcal{B}(\mathbb{R})$ of the real line $\mathbb{R}$ , where $N(W)=\sum_{n\in\mathbb{Z}}\mathbb{1}_W(\tau_n)$ and $(\tau_n)_{n\in\mathbb{Z}}$ is a sequence of extended real-valued random variables such that almost surely (a.s.) $\tau_0\leq 0<\tau_1$ and $\tau_n<\tau_{n+1}$ on $\{\tau_n<\infty\}\cap\{\tau_{n+1}>-\infty\}$ for every $n\in\mathbb{Z}$ . We further assume that it is nonexplosive, wherein $\tau_n\rightarrow \infty$ a.s. as $n\rightarrow\infty$ (see p. 47 of Daley and Vere-Jones, Reference Daley and Vere-Jones2003 and p. 8 of Last and Brandt, Reference Last and Brandt1995, respectively). Let $\mathcal{H}_t=\sigma(N(W),\ W\in\mathcal{B}(\mathbb{R}),\ W\subset(\!-\!\infty,t))$ . We also assume that our probability space supports a Brownian motion B, and denote its ${\mathbb{P}}$ -augmented natural filtration by $(\mathcal{F}_t)$ . Let $(\mathcal{G}_t)=(\mathcal{H}_t)\vee(\mathcal{F}_t)$ . We assume the following martingale invariance property: all $(\mathcal{F}_t)$ - and $(\mathcal{H}_t)$ -martingales remain martingales in the larger filtration $(\mathcal{G}_t)$ .

The process $\lambda_t$ is called the $\mathcal{G}_t$ -conditional intensity of N if for all intervals (s,t], we have

(2.1) \begin{align}{\mathbb{E}}[N((s,t])\,|\,\mathcal{G}_s]={\mathbb{E}}\left[\int_s^t\lambda_u\,du\,\Big|\,\mathcal{G}_s\right],\qquad \textrm{a.s.}\end{align}

We use the notation $N_t\;:\!=\;N(0,t]$ to denote the number of points in the interval (0, t]. The linear Hawkes process with levels of excitation $h\;:\;\mathbb{R}_+\mapsto\mathbb{R}_+$ is a simple point process N admitting the $\mathcal{G}_t$ -intensity

(2.2) \begin{align}\lambda_t = \bar{\lambda}_t + \int_{-\infty}^{t-}h(t-u)dN_u + \int_{-\infty}^{t-}\breve{h}(t-\breve{u})d\breve{N}_{\breve{u}},\end{align}

where $\bar{\lambda}_t>0$ , h (resp. $\breve{h}$ ) is such that $h(\!\cdot\!)\;:\;\mathbb{R}^{+}\rightarrow\mathbb{R}^+$ , and we always assume that $\|h\|_{L^1}=\int_0^{\infty}h(t)dt<\infty$ , which ensures stationarity in the point process N (Daley and Vere-Jones, Reference Daley and Vere-Jones2003). $\breve{N}$ is an inhomogeneous Poisson process with non-random and bounded intensity $\sigma$ . The sequence of event times u gives the jump times of N, and this models the Hawkes or self-exciting property; they are endogenous by construction. By the same token, exogenous events occur at times $\breve{u}$ ; this is known as the shot-noise or commonly referred to as the ‘Cox component’ (Cox and Isham, Reference Cox and Isham1980). Furthermore, the point process martingale $\widetilde{N}$ associated with the Hawkes process is given by

(2.3) \begin{align}\widetilde{N}_{\cdot} \;:\!=\; N_{\cdot} - \int_0^{\cdot} \lambda_u\,du.\end{align}

Characterizations of h and $\breve{h}$ . One such function for h (and that of $\breve{h}$ ) takes the following form: $h(t)=\alpha e^{-\delta t}$ , $t\in[0,T]$ , where $\alpha>0$ and $\delta>0$ . Furthermore, we assume that $\delta>\alpha$ (resp. $\breve{\alpha}$ ), thus guaranteeing stationarity of N (Brémaud and Massoulié, Reference Brémaud and Massoulié2002; Dassios and Zhao, Reference Dassios and Zhao2011). In the case of

(2.4) \begin{align}\bar{\lambda}_t={\lambda^{\flat}} + (\lambda_0-\lambda^{\flat})e^{-\delta t},\end{align}

where $\lambda_0>0$ is the initial intensity at time $t=0$ , and ${\lambda^{\flat}}\geq 0$ is a constant with ${\lambda}^{\flat}<\lambda_0$ , we remark that $\lambda_t$ is Markovian and the pair $(N_t,\lambda_t)_{t\in[0,T]}$ forms a Markov process (Oakes, Reference Oakes1975). Another widely used function is the so-called power decay function, which is particularly used in the field of seismology; it takes the form $h(t)=\beta(t+\gamma)^{-\eta-1}$ , where $\beta\geq 0$ , $\gamma>0$ , $\eta>0$ , and $\eta\gamma^{\eta}>\beta$ for stationary of the point process to hold. When $\eta=0$ , we recover the Omori formula (Omori, Reference Omori1894). When no constraints are placed on $\eta$ , we obtain the modified Omori formula (Utsu et al., Reference Utsu, Ogata and Matsu’ura1995). For further discussion of this type of function, we refer the reader to Ogata (Reference Ogata1998).

3. Structure condition and cumulant processes

We choose the money market account as numéraire, and the interest rate is taken to be null. We assume that the asset price dynamics $\tilde{S}$ under ${\mathbb{P}}$ is given by

(3.1) \begin{align}{\mathbb{P}}\,:\,\,\, \frac{d\tilde{S}_t}{\tilde{S}_t} = a^{{\boldsymbol{\theta}}}_t\,dt + b_t\,dB_t + d(f_t^{H}(x)\,*{(\mu_N-\nu_N)})_t \;:\!=\;dS_t,\end{align}

where $\mu_N$ denotes the jump measure of N while $\nu_N$ denotes its predictable compensator. In our framework of compound shot-noise and Hawkes point processes, we no end have

(3.2) \begin{align}\nu_N\;:\!=\;\nu_N(dx,dt) = \nu(dx)\lambda_t\,dt.\end{align}

The deterministic function $a^{{\boldsymbol{\theta}}}_t$ , representing the drift of the stock price process, is formulated to be a flexible family of functions in the form of a one-layer neural network,

(3.3) \begin{align}a^{{\boldsymbol{\theta}}}_t\;:\!=\;a(t,{\boldsymbol{\theta}})=g_1\left(\alpha+\sum_{k=1}^K \beta_k\cdot g_2 (\gamma_k\cdot t + \pi_k)\right),\end{align}

where $g_1(\!\cdot\!), g_2(\!\cdot\!)$ are bounded activation functions, and K is some number of hidden units (Zell, Reference Zell1997). The quantity ${\boldsymbol{\theta}}$ denotes the set of parameters $\{\alpha,(\beta_k,\gamma_k,\pi_k)_{k=1}^K\}$ , which modulate the drift of the stock price process. Each term $g_2(\gamma_k\cdot t+\pi_k)$ may be seen as a hidden representation of the input time. By choosing K sufficiently large, we can model complex nonlinear drift of the stock price process. Some choices of activation functions include the sigmoid $g_2(z)=(1+e^{-z})^{-1}$ , the inverse square $g_2(z)=(1+z^2)^{-1}$ , the hyperbolic tangent $g_2(z)=\tanh\!(z)$ , and the bounded rectified linear activation function (or ReLU for short), given by $g_2(z)=\min\!(\!\max\!(0,z),D)$ for some predetermined $D>0$ that defines the maximal output value that this function could produce (Schmidhuber, Reference Schmidhuber2015). Furthermore, we assume that $b>\epsilon$ and $f^{H}> -1$ are deterministic and bounded.

To sum up, we work with the following restrictions on the parameters $(a^{{\boldsymbol{\theta}}},b,f^H)$ .

Assumption 1. The quantities $a_t,b_t>\epsilon$ and $f^H_t$ are deterministic, bounded functions with

(3.4) \begin{align}\int (f^H_t(x))^2\nu(dx)<\infty.\end{align}

We often suppress the time variable in the notation, in accordance with convention. Note that $\tilde{S}$ is the stochastic exponential of S, and thus solving the differential equation yields

(3.5) \begin{align}\tilde{S}_t=\mathcal{E}\left(\int_0^{\cdot} a^{{\boldsymbol{\theta}}}\,ds + b\,dB + d(f^{H}(x)\,*{(\mu_N-\nu_N)})\right)_t,\end{align}

where $\mathcal{E}$ is the Doléans-Dade exponential. Our main goal in this manuscript is to calculate as explicitly as possible the Esscher martingale measure in the sense of Kallsen and Shiryaev (Reference Kallsen and Shiryaev2002) for the stochastic logarithm $S=\int d\tilde{S}/\tilde{S}_{-}$ , with probability measures ${\mathbb{Q}}$ such that S is a local ${\mathbb{Q}}$ -martingale. By Theorem III.33 of Protter (Reference Protter2005), the price process $\tilde{S}=\int \tilde{S}_{-} dS$ is a local ${\mathbb{Q}}$ -martingale as well. Moreover we assume that the asset price process S satisfies the following structure condition: the decomposition of S can be uniquely written in the form $S = M + A$ , where

(3.6) \begin{align}A = \int \rho\, d\langle M,M\rangle\end{align}

and $\rho$ is a predictable process satisfying $K_T\;:\!=\;\int_0^T \rho_t^2\,d\langle M,M\rangle_t <\infty$ , ${\mathbb{P}}$ -a.s. (Ansel and Stricker, Reference Ansel and Stricker1992). The notation $\langle M,M\rangle$ denotes the predictable compensator of the quadratic variation process [M, M]. In our Hawkes–Cox-driven price process from Equation (3.1), it is readily computed that

(3.7) \begin{align}\rho = \frac{a^{\boldsymbol{\theta}}}{b^2+\lambda_{-} {\int_{{\mathbb{R}}}} (f^H(x))^2\nu(dx)}\end{align}

as well as that

(3.8) \begin{align}K_T=\int_0^T \frac{(a^{\boldsymbol{\theta}}_t)^2}{b_t^2 +\lambda_t {\int_{{\mathbb{R}}}} (f_t^H(x))^2\nu(dx)}\,dt <\infty,\end{align}

since both the point processes N and $\breve{N}$ are nonexplosive and $\lambda_t$ is finite for $t\in[0,T]$ . Hence S is a special semimartingale (see p. 209 in He et al. (Reference He, Wang and Yan1998)).

One potential approach for the evaluation of a pricing measure is related to the Esscher martingale measure (Madan and Milne, Reference Madan and Milne1991; Gerber and Shiu, Reference Gerber and Shiu1994; Eberlein and Keller, Reference Eberlein and Keller1995; Chan, Reference Chan1999). In this section, we recall some definitions from Kallsen and Shiryaev (Reference Kallsen and Shiryaev2002), which lays out an extended abstract theory for Esscher martingale transforms for general semimartingales.

We remark that if X is a semimartingale, then L(X) denotes the set of predictable X-integrable processes. Recall that a semimartingale X is called special if it can be decomposed as $X = X_0 + M + A$ for some local martingale M and some predictable process A of finite variation, null at $0.$ We denote by $\mathcal{M}_{\textit{loc}}$ the class of all local martingales.

Definition 1. Let X be a real-valued semimartingale. X is called exponentially special if $\exp\!(X-X_0)$ is a special semimartingale. A predictable process A is called the exponential compensator of X if $\exp (X - {X_0} - A) \in {{\cal M}_{{\rm{loc}}}}$ .

Definition 2. The modified Laplace cumulant process $K^X(\vartheta)$ of X in $\vartheta$ is defined to be the exponential compensator of $\int\vartheta dX$ .

Lemma 1. Let $\vartheta\in L(S)$ be such that $\int \vartheta dS$ is exponentially special. Then the modified Laplace cumulant process of S in $\vartheta$ is given by

(3.9) \begin{align}K^{S}(\vartheta)_t = \int_0^t \tilde{\kappa}(\vartheta)_s\,ds,\end{align}

where

(3.10) \begin{align}\tilde{\kappa}(\vartheta)_t = \left(a^{\boldsymbol{\theta}}_t-\lambda_t {\int_{{\mathbb{R}}}} f_t^H(x)\nu(dx)\right)\vartheta_t + \frac{1}{2}b_t^2\vartheta_t^2 + (e^{{\int_{{\mathbb{R}}}} f_t^H(x)\nu(dx)\vartheta_t}-1)\lambda_t.\end{align}

Proof. This follows immediately from Theorem 2.18 of Kallsen and Shiryaev (Reference Kallsen and Shiryaev2002) following the semimartingale characteristics.

The derivative of the cumulant process $DK^{S}(\vartheta)=\int D\tilde{\kappa}^S(\vartheta)\lambda dt$ in the sense of Definition 2.22 in Kallsen and Shiryaev (Reference Kallsen and Shiryaev2002) is given by

(3.11) \begin{align}D\tilde{\kappa}^S(\vartheta) = a^{\boldsymbol{\theta}}-\lambda {\int_{{\mathbb{R}}}} f^H(x)\nu(dx) + b^2\vartheta + f^H(x)\lambda e^{\vartheta f^H(x)}.\end{align}

Assume that there exists a solution $\vartheta^{\sharp}$ to the martingale equation

(3.12) \begin{align}DK^S(\vartheta)=0.\end{align}

In the case when

(3.13) \begin{align}G^{\sharp}_t\;:\!=\;\exp\left(\int_{0+}^t\vartheta^{\sharp}_s dS_s - K^{S}(\vartheta^{\sharp})_t\right)\end{align}

is a martingale, we can then define a probability measure ${\mathbb{Q}}^{\sharp}$ by

(3.14) \begin{align}\frac{d{{\mathbb{Q}}}^{\sharp}}{d{\mathbb{P}}}=\exp\left(\int_{0+}^T\vartheta^{\sharp}_t dS_t - K^{S}(\vartheta^{\sharp})_T\right).\end{align}

The density process $(G^{\sharp}_t)_{t\in[0,T]}$ now defines a unique equivalent martingale measure ${\mathbb{Q}}^{\sharp}$ for S, known as the Esscher martingale measure for the process S (Theorems 4.4 and 4.5 in Kallsen and Shiryaev, Reference Kallsen and Shiryaev2002).

4. Esscher martingale measure

In this section, we spell out the necessary and sufficient conditions for $(G^{\sharp}_t)_{t\in[0,T]}$ to be an Esscher martingale measure.

Lemma 2. Define the following functions:

(4.1) \begin{align}m_1(\vartheta)=a^{\boldsymbol{\theta}},\quad m_2(\vartheta)=\lambda {\int_{{\mathbb{R}}}} f^H(x)\nu(dx) - b^2\vartheta - \lambda{\int_{{\mathbb{R}}}} f^H(x)e^{\vartheta f^H(x)}\nu(dx).\end{align}

Then there exists a bounded function $\vartheta_t^{\sharp}\;:\;[0,T]\rightarrow\mathbb{R}$ solving $m_1(\vartheta_t^{\sharp})=m_2(\vartheta_t^{\sharp})$ for all $t\in[0,T]$ and hence Equation (3.12).

Proof. Observe that for a fixed $t\in[0,T]$ , $m_2$ is decreasing in $\vartheta$ , since

(4.2) \begin{align}m_2^{\prime}(\vartheta)=-b^2-\lambda{\int_{{\mathbb{R}}}} (f^H(x))^2 e^{ \vartheta f^H(x)}\nu(dx)<0,\end{align}

using the fact that $\lambda>0$ is well defined for each $t\in[0,T]$ and b is bounded. Also, note that $m_2(\!-\!\infty)=\infty$ , $m_2(\infty)=-\infty$ . Moreover, $m_1$ is bounded since $a^{\boldsymbol{\theta}}$ is bounded. Hence we conclude by the mean value theorem that there exists for all $t\in[0,T]$ a number $\vartheta_t^{\sharp}$ such that $m_1(\vartheta_t^{\sharp})=m_2(\vartheta_t^{\sharp})$ . Note that $(m_2)^{-1}(\!\sup a^{\boldsymbol{\theta}})\leq\vartheta^{\sharp}\leq (m_2)^{-1}(\!\inf a^{\boldsymbol{\theta}})$ , where $(m_2)^{-1}$ is the inverse function of $m_2$ ; hence $t\mapsto\vartheta_t^{\sharp}$ is a bounded function on [0, T].

We now turn our attention to proving that the process $G^{\sharp}$ is a martingale. For this we will need the following criterion, due to the criterion of Theorem 9 in Protter and Shimbo (Reference Protter and Shimbo2008).

Proposition 1. Let $\widetilde{M}$ be a locally square-integrable martingale such that $\Delta\widetilde{M}>-1$ . If

(4.3) \begin{align}{\mathbb{E}}\left[\exp{\left(\frac{1}{2}\langle \widetilde{M}^c,\widetilde{M}^c\rangle_T+\langle \widetilde{M}^d,\widetilde{M}^d\rangle_T\right)}\right]<\infty,\end{align}

where $\widetilde{M}^c$ and $\widetilde{M}^d$ are the continuous and purely discontinuous martingale parts of $\widetilde{M}$ , then the Doléans-Dade $\mathcal{E}(\widetilde{M})$ is a strictly positive martingale on [0, T].

Lemma 3. Define the following processes:

(4.4) \begin{align}G^{\sharp}=\mathcal{E}(R^{\sharp})\end{align}

with

(4.5) \begin{align}R^{\sharp}=\int b\vartheta^{\sharp}\,dB + \left(e^{\vartheta^{\sharp}(e^{f^H(x)}-1)} -1\right)*{(\mu_N-\nu_N)}.\end{align}

Then $G^{\sharp}$ is a martingale.

Proof. We proceed to evaluate the following quantity:

\begin{align*}\frac{1}{2}\langle R^{\sharp},R^{\sharp}\rangle^c_T+\langle R^{\sharp},R^{\sharp}\rangle^d_T=\frac{1}{2}\int_0^T b_t^2 (\vartheta^{\sharp}_t)^2\,dt + \int_0^T \left({\int_{{\mathbb{R}}}} e^{\vartheta^{\sharp}_t(e^{f_t^H(x)}-1)}\nu(dx) -1\right)^2 \lambda_t \,dt<\infty\end{align*}

with probability 1, by the boundedness of the coefficients and that of $\vartheta^{\sharp}$ , and using the fact that our point processes are nonexplosive. It follows that ${\mathbb{E}}[\!\exp(\frac{1}{2}\langle N,N\rangle^c_T+\langle N,N\rangle^d_T)]<\infty$ . Hence the assertion follows from Proposition 1.

Following the existence of solutions with the martingale property ensured, we state the main result with regard to the Esscher measure.

Theorem 2. Let the conditions of Assumption 1 hold. Then there exists a function $\vartheta^{\sharp}\;:\;[0,T]\rightarrow\mathbb{R}$ with $\vartheta^{\sharp}_t\;:\!=\;\vartheta^{\sharp}(t)$ which solves the martingale equation (3.12) for any $t\in[0,T]$ , i.e.

\begin{align*}a^{\boldsymbol{\theta}}-\lambda {\int_{{\mathbb{R}}}} f^H(x)\nu(dx) + b^2\vartheta + f^H(x)\lambda e^{\vartheta f^H(x)}=0.\end{align*}

Furthermore, the derivative

(4.6) \begin{align}\frac{d{\mathbb{Q}}^{\sharp}}{d{\mathbb{P}}}=\mathcal{E}\left(\int_{0}^{\cdot}\vartheta_t b_t\,dB_t + \left(e^{\vartheta^{\sharp}_t(e^{f_t^H(x)}-1)}-1 \right)\ast (\mu_N-\nu_N) \right)_T\end{align}

defines a probability measure ${\mathbb{Q}}^{\sharp}\sim{\mathbb{P}}$ on $\mathcal{G}_T$ , which is called the Esscher martingale measure for S. Moreover, under ${\mathbb{Q}}^{\sharp}$ , we have that

(4.7) \begin{align}B_t^{{\mathbb{Q}}^{\sharp}}\;:\!=\;B_t - \int_0^t \vartheta_s^{\sharp}b_s ds\end{align}

is a ${\mathbb{Q}}^{\sharp}$ -Brownian motion, and the predictable compensator $\nu^{\sharp}_N$ under ${\mathbb{Q}}^{\sharp}$ is given by

(4.8) \begin{align}\nu^{\sharp}_N\;:\!=\;\nu^{\sharp}_N(dx,dt)=e^{\vartheta_t^{\sharp}(e^{f_t^H(x)}-1)}\nu(dx)\lambda_t\,dt.\end{align}

Proof. From Lemma 2, there exists a function $\vartheta^{\sharp}$ solving Equation (3.12). To complete the proof, we apply Theorem 4.4 in Kallsen and Shiryaev (Reference Kallsen and Shiryaev2002) to conclude that the density in Equation (4.6) defines an equivalent local martingale measure for $\mathcal{E}(S)$ . By Lemma 3, we have that $G^{\sharp}$ is a proper martingale and thus defines a density process. They dynamics under ${\mathbb{Q}}^{\sharp}$ follows from Girsanov’s theorem (cf. Theorem III.40 in Protter (Reference Protter2005)).

Remark 1.

  1. (1) The role of $\vartheta^{\sharp}$ and $a^{\boldsymbol{\theta}}$ . For the Esscher martingale measure to exist, the choice of $\boldsymbol{\theta}$ and the concrete specification of the drift $a^{\boldsymbol{\theta}}$ must be such that there exists a solution $\vartheta^{\sharp}$ that solves the martingale equation (3.12); otherwise we say that the Esscher martingale measure does not exist.

  2. (2) Family of drifts and existence of $\vartheta^{\sharp}$ . It is well known that the dynamics of asset returns cannot be adequately described with constant drift and volatility (Mandelbrot and Taylor, Reference Mandelbrot and Taylor1967; Clark, Reference Clark1973). There have been many attempts to extend the model by describing the evolution of drift and volatility beyond constants (cf. e.g. Masi et al., Reference Masi, Kabanov and Runggaldier1995, and references therein).

  3. (3) Special cases. In our case, the formulation of the flexible family of drifts in Equation (3.1) is an interesting one provided that $\vartheta^{\sharp}$ exists. This one-layer neural network drift naturally subsumes the standard constant drift as a special case if we let $\beta_k=\gamma_k=\pi_k=0$ . Second, for identity activation of $g_1(\!\cdot\!)$ and $g_2(\!\cdot\!)$ a suitable indicator function, we get an $a^{\boldsymbol{\theta}}_t$ that is piecewise constant on intervals and could be utilized as part of modelling a regime switching; see for example Çetin and Verschuere (Reference Çetin and Verschuere2009) and Elliott and Siu (Reference Elliott and Siu2010).

Example (Hawkes with neural network drift). Suppose the stock price process takes the form

(4.9) \begin{align}{\mathbb{P}}\,:\,\,\, \frac{d\tilde{S}_t}{\tilde{S}_t} = a^{{\boldsymbol{\theta}}}_t\,dt + b\,\,dB_t + d(f^H(x)\,*{(\mu_N-\nu_N)})_t,\qquad t\in[0,T],\end{align}

where f is a bounded function independent of time t and b is a constant. We set $a_t^{\boldsymbol{\theta}}$ to be

(4.10) \begin{align}a_t^{\boldsymbol{\theta}}=\alpha+\frac{\beta_1}{1+e^{-(\gamma_1t+\pi_1)}}+\frac{\beta_2}{1+e^{-(\gamma_2t+\pi_2)}},\quad t\in[0,T],\end{align}

where $\boldsymbol{\theta}=(\alpha,\beta_1,\beta_2,\gamma_1,\gamma_2,\pi_1,\pi_2)$ , all of which are taken to be positive constants. We remark that Assumption 1 is satisfied for these choices of $a^{\boldsymbol{\theta}},b,$ and f. We set $g_1(\!\cdot\!)$ and $g_2(\!\cdot\!)$ to be the identity and the sigmoid activation functions, respectively, while setting the number of hidden units to $K = 2$ . We first check for existence of a solution $\vartheta^{\sharp}$ to the martingale equation (3.12), which in this example returns

(4.11) \begin{align}&\quad\alpha+\frac{\beta_1}{1+e^{-(\gamma_1t+\pi_1)}}+\frac{\beta_2}{1+e^{-(\gamma_2t+\pi_2)}}-\lambda {\int_{{\mathbb{R}}}} f^H(x)\nu(dx) + b^2\vartheta + f^H(x)\lambda e^{\vartheta f^H(x)}=0.\end{align}

Define the quantities

(4.12) \begin{align} m_1 &= \alpha+\frac{\beta_1}{1+e^{-(\gamma_1t+\pi_1)}}+\frac{\beta_2}{1+e^{-(\gamma_2t+\pi_2)}},\end{align}
(4.13) \begin{align} m_2 &= \lambda {\int_{{\mathbb{R}}}} f^H(x)\nu(dx) - b^2\vartheta - f^H(x)\lambda e^{\vartheta f^H(x)}.\end{align}

Invoke Lemma 2 to ensure the existence of a solution $\vartheta^{\sharp}$ to Equation (4.11). An illustration is given in Figure 1.

Figure 1. An illustration of the existence of $\vartheta^{\sharp}$ solving Equation (4.11), where $m_1$ and $m_2$ are given in Equations (4.12) and (4.13), respectively.

From Theorem 2, we see that the process

(4.14) \begin{align}B_t - \int_0^t \sigma_s^{\sharp} ds\end{align}

is a ${\mathbb{Q}}^{\sharp}$ -Brownian motion, where we have defined

(4.15) \begin{align}\sigma^{\sharp}\;:\!=\;\vartheta^{\sharp}b.\end{align}

Furthermore, the quantity

(4.16) \begin{align}\nu^{\sharp}_N(dx,dt)=e^{\vartheta_t^{\sharp}(e^{f^H(x)}-1)}\nu(dx)\lambda_t\,dt\end{align}

is the predictable compensator $\nu^{\sharp}_N$ of $\mu_N$ under ${\mathbb{Q}}^{\sharp}$ . The process $\sigma^{\sharp}$ represents the risk premium associated with the continuous martingale part B, and the predictable function $e^{\vartheta_t^{\sharp}(e^{f^H(x)}-1)}-1$ is interpreted as the risk premium associated with the jumps of the discontinuous part of $\mu_N$ (Theorem III.3.23 in Jacod and Shiryaev, Reference Jacod and Shiryaev2003), where $\vartheta^{\sharp}$ solves Equation (4.11).

5. Valuation of contingent claims

Having computed the Esscher martingale measure, we now detail the optimal hedging strategy as well as the differential equations for the prices of derivatives under this measure.

5.1. Mean-variance hedging

One criterion for hedging is mean-variance hedging (Bouleau and Lamberton, Reference Bouleau and Lamberton1989). Consider a contingent claim with maturity $T>0$ , defined by a $\mathcal{G}_T$ -measurable random variable H. Define the initial capital $V_0$ and a trading strategy which will be defined by a portfolio $(\psi^0_t,\psi_t)_{t\in[0,T]}$ taking values in $\mathbb{R}^2$ . As before, we choose the savings account as numéraire and work with a zero interest rate. In mean-variance hedging, we look for a self-financing trading strategy given by an initial capital $V_0$ and a portfolio over the lifetime of the contingent claim which minimizes the shortfall at the terminal date T in a mean square sense:

\begin{eqnarray*}\inf_{\psi,V_0} {\mathbb{E}_{\mathbb{Q}^{\sharp}}}|H-V_T|^2,\quad V_T=V_0+\int_0^T \psi d\tilde{S}_t\end{eqnarray*}

where

\begin{eqnarray*}H-V_T=H-V_0-{\int_0^T}\psi_t d\tilde{S}_t.\end{eqnarray*}

In particular, we have chosen a pricing rule given by the Esscher martingale measure ${\mathbb{Q}}^{\sharp}$ . Let $H\in \mathcal{L}^2(\Omega,\mathbb{G},{{\mathbb{Q}}^{\sharp}})$ ; H has finite variance and $(\tilde{S}_t)_{t\in[0,T]}$ is a square-integrable ${{\mathbb{Q}}^{\sharp}}$ -martingale. Let us consider those $(\psi_t)_{t\in[0,T]}$ whose terminal values satisfy

(5.1) \begin{eqnarray}{\mathcal{X}}\;:\!=\;\left\{\psi\;:\;{\mathbb{E}_{\mathbb{Q}^{\sharp}}}\Big({\int_0^T}\psi^2_td[\tilde{S},\tilde{S}]_t\Big)<{\infty}\right\}.\end{eqnarray}

Define $\mathcal{L}^2(\tilde{S})$ as the set of portfolios $\psi$ satisfying ( $5.1$ ). Since $\psi\in\mathcal{L}^2(\tilde{S})$ , and using the fact that $\tilde{S}$ is a martingale under ${{\mathbb{Q}}^{\sharp}}$ , the gains process $G(\psi)=\int_0^{\cdot} \psi d\tilde{S}$ is also a square-integrable martingale. The mean-variance hedging problem can now be recast as

\begin{eqnarray*}\inf_{\psi\in L^2(\tilde{S}),V_0} {\mathbb{E}_{\mathbb{Q}^{\sharp}}}\left|H-V_0-{\int_0^T}\psi_td\tilde{S}_t\right|^2,\end{eqnarray*}

or equivalently

\begin{equation*}\inf_{\psi\in L^2(\tilde{S}),V_0}{\mathbb{E}_{\mathbb{Q}^{\sharp}}}\left|H-V_0-G_T(\psi)\right|^2.\end{equation*}

The condition ( $5.1$ ) implies that the value process V is a square-integrable martingale, and we have ${\mathbb{E}_{\mathbb{Q}^{\sharp}}}[V_T]=V_0$ . Applying the identity $\mathbb{E}(Z^2)=(\mathbb{E}(Z))^2+\mathbb{V}(Z)$ to the random variable $Z\;:\!=\;H-V_0-G_T(\psi)$ , we obtain

\begin{eqnarray*}J_0(V_0,\psi)&\;:\!=\;&{\mathbb{E}_{\mathbb{Q}^{\sharp}}}\left|H-V_0-{\int_0^T}\psi_t d\tilde{S}_t\right|^2\\[5pt] &=&|{\mathbb{E}_{\mathbb{Q}^{\sharp}}}[H]-V_0|^2+\mathbb{V}(H-V_0-G_T(\psi)).\end{eqnarray*}

If the writer of the contingent claim tries to minimize the residual risk $J_0(V_0,\psi)$ , the optimal value that he/she will ask for is a premium

\begin{eqnarray*}V_0={\mathbb{E}_{\mathbb{Q}^{\sharp}}}[H].\end{eqnarray*}

We see that ${\mathbb{E}_{\mathbb{Q}^{\sharp}}}[H]$ is the initial value of any strategy $\psi\in \mathcal{L}^2(\tilde{S})$ designed to minimize the shortfall at maturity, and we take this as the definition of the price associated with our contingent claim H at time 0. By the same token, if the writer sells the option at time $t>0$ and intends to minimize the remaining risk $J_t(V_t,\psi)\;:\!=\;{\mathbb{E}_{\mathbb{Q}^{\sharp}}}\left( \left|H-V_0-{\int_0^T}\psi_td\tilde{S}_t \right| ^2 \big| \mathcal{G}_t\right)$ , he/she will ask a premium $V_t={\mathbb{E}_{\mathbb{Q}^{\sharp}}}(H\,|\,\mathcal{G}_t)$ . We will take this quantity to define the price of the contingent claim H at time t under the Esscher measure ${{\mathbb{Q}}^{\sharp}}$ .

For illustrative purposes, in the framework of pricing and hedging derivatives, we work with the asset price process in Equation (3.1), setting $a^{\boldsymbol{\theta}}_t$ and $b_t$ to be constants $a>0$ and $b>0$ , respectively, and setting $f^H_t(x)=f^H(x)$ to be a bounded function independent of time. Furthermore, we assume that $h(t)=\alpha e^{-\delta t}$ and, without loss of generality, that $\breve{h}=0$ in Equation (2.2). With this, we see that $(N,\lambda)$ is jointly Markov and $\lambda_t$ can be recorded with an alternative expression

(5.2) \begin{align}\lambda = \int -\delta (\lambda-\lambda^{\flat})dt +\int \alpha dN\end{align}

to Equation (2.2) when $\bar\lambda$ takes the form in Equation (2.4). From Theorem 2, the Esscher martingale measure ${{\mathbb{Q}}^{\sharp}}$ exists and the dynamics of $\tilde{S}$ takes the form

(5.3) \begin{align}{{\mathbb{Q}}^{\sharp}}:\qquad \frac{d\tilde{S}}{S_{-}}=b\,dB^{{{\mathbb{Q}}^{\sharp}}}+\int f^H(x)(\mu_N-\nu_N^{\sharp}),\end{align}

where the quantities ${{\mathbb{Q}}^{\sharp}}$ and $\nu^{\sharp}$ are defined in Theorem 2, and $\vartheta^{\sharp}$ solves Equation (3.12).

5.1.1. Pricing

Consider a contingent claim of European type with maturity T and payoff $H(\tilde{S}_T)$ . The payoff H is assumed to satisfy

(5.4) \begin{align}|H(z_1)-H(z_2)|\leq k|z_1-z_2|\end{align}

for some $k>0$ . For put and call options, this assumption is satisfied with the choice of $k=1$ . The value $C_t$ of such a derivative claim is given by $C_t=C(t,\tilde{S}_t,\lambda_t)$ , where

\begin{equation*}C(t,s,\lambda)=e^{-r(T-t)}E[H(\tilde{S}_T)\,|\,\tilde{S}_t=s,\lambda_t=\lambda].\end{equation*}

Let H be a contingent claim with terminal payoff $H=H(\tilde{S}_T)$ . Then the price of H at time t is given by $C(t,s,\lambda)$ where

\begin{eqnarray*}C\;:\;[0,T]\times[0,{\infty})\times(0,{\infty})\quad &\rightarrow&\quad \mathbb{R},\\[5pt] (t,s,\lambda)\quad&\mapsto& \quad C(t,s,\lambda)={\mathbb{E}_{\mathbb{Q}^{\sharp}}}\left[H(\tilde{S}_T)\,|\,\tilde{S}_t=s,\lambda_t=\lambda\right].\end{eqnarray*}

Assume that $C\in \mathcal{C}^{1,2}$ , i.e., the functions $(t,s,\lambda)\mapsto C(t,\tilde{S},\lambda)$ are continuously differentiable with respect to t and twice continuously differentiable with respect to s. Hence the Itô formula can be applied to $C_t=C(t,\tilde{S}_t,\lambda_t)$ between 0 and T. Define

\begin{equation*}\widetilde{C}^{{{\mathbb{Q}}^{\sharp}},*}_t\;:\!=\;\sup_{t\in[0,T]}|C_t|=\sup_{t\in[0,T]}C_t,\end{equation*}

since $H\geq0$ . Applying the Doob inequality to the martingale C yields

\begin{equation*}{\mathbb{E}_{\mathbb{Q}^{\sharp}}}(\widetilde{C}^{{{\mathbb{Q}}^{\sharp}},*}_t)^2\leq 4{\mathbb{E}_{\mathbb{Q}^{\sharp}}} H^2<{\infty}\end{equation*}

since H has finite variance. By the Burkholder–Davis–Gundy inequality, there exists a constant $c>0$ such that

\begin{eqnarray*}c{\mathbb{E}_{\mathbb{Q}^{\sharp}}}\{[C,C]_T\}\leq {\mathbb{E}_{\mathbb{Q}^{\sharp}}}(\widetilde{C}^{{{\mathbb{Q}}^{\sharp}},*}_T)^2 <{\infty}.\end{eqnarray*}

Hence we obtain

\begin{equation*}{\mathbb{E}_{\mathbb{Q}^{\sharp}}}\{[C,C]_T\}<{\infty}.\end{equation*}

This implies that C is a square-integrable ${{\mathbb{Q}}^{\sharp}}$ -martingale. With the expression of (5.2) and by Corollary I-3.16 of Jacod and Shiryaev (Reference Jacod and Shiryaev2003), we conclude that the finite-variation term vanishes, giving us the PDE

(5.5) \begin{align}&\frac{\partial C}{\partial t}- s\lambda{\int_{{\mathbb{R}}}}f^H(x)e^{\vartheta^{\sharp}(e^{f^H(x)}-1)}\nu(dx) \frac{\partial C}{\partial s} +\frac{1}{2}s^2 b^2\frac{\partial^2 C}{\partial s^2}-\delta(\lambda-a)\frac{\partial C}{\partial\lambda}\notag\\[5pt] &\qquad\qquad+\lambda{\int_{{\mathbb{R}}}}(C(t,s(1+f^H(x)),\lambda+\alpha)-C(t,s,\lambda))e^{\vartheta^{\sharp}(e^{f^H(x)}-1)}\nu(dx)=0\end{align}

with terminal condition

(5.6) \begin{equation}C(T,s,\lambda)=H(s),\qquad \lambda> 0,\,s\geq 0.\end{equation}

Having derived the Esscher price for European-type contracts, we now characterize the optimal hedging strategy.

5.1.2. Optimal hedging strategy

Recall that under ${{\mathbb{Q}}^{\sharp}}$ , the stock price process $\tilde{S}$ is a square-integrable martingale. Consider now a self-financing trading strategy with $\psi\in L^2(\tilde{S})$ ; the value process V of the portfolio is also a martingale whose value at maturity $T>0$ is

\begin{eqnarray*}{{\mathbb{Q}}^{\sharp}}\;:\qquad V_T=V_0+{\int_0^T}\psi_t d\tilde{S}_t=V_0+G_T(\psi).\end{eqnarray*}

Also, $C(T,\tilde{S}_T,\lambda_T)=H(\tilde{S}_T)=H$ and $C(0,\tilde{S}_0,\lambda_0)={\mathbb{E}_{\mathbb{Q}^{\sharp}}}[H(\tilde{S}_T)]=V_0$ , so that

\begin{equation*}H-V_0-G_T(\psi)=C(T,\tilde{S}_T,\lambda_T)-C(0,\tilde{S}_0,\lambda_0)-G_T(\psi).\end{equation*}

Taking into consideration Equation (5.5), we evaluate the quantity

\begin{align*}&C(T,\tilde{S}_T,\lambda_T)-C(0,\tilde{S}_0,\lambda_0)-G_T(\psi)\\[5pt] &={\int_0^T}\left(\psi_t{\tilde{S}_{t-}}-{\tilde{S}_{t-}}\frac{\partial}{\partial s}C(t,{\tilde{S}_{t-}},\lambda_{t-})\right)b\, dB_t\\[5pt] &\qquad +{\int_0^T}{\int_{{\mathbb{R}}}}\big(\psi_t{\tilde{S}_{t-}} f^H(x)-(C(t,{\tilde{S}_{t-}}(1\!+\!f^H(x)),\lambda_{t-}+\alpha)-C(t,{\tilde{S}_{t-}},\lambda_{t-})\big)(\mu_N-\nu^{\sharp}_N)_t,\end{align*}

so that

\begin{align*}J_0(V_0,\psi)&={\mathbb{E}_{\mathbb{Q}^{\sharp}}}\left[{\int_0^T} \tilde{S}^2_{t-}\left(\psi_t-\frac{\partial}{\partial s}(t,{\tilde{S}_{t-}},\lambda_{t-})dt\right)^2 b^2 dt\right]\\[5pt] &\quad+{\mathbb{E}_{\mathbb{Q}^{\sharp}}}\left[{\int_0^T}{\int_{{\mathbb{R}}}}(\psi_t{\tilde{S}_{t-}} f^H(x)\right.\\[5pt] &\left.\qquad-(C(t,{\tilde{S}_{t-}}(1+f^H(x)),\lambda_{t-}+\alpha)-C(t,{\tilde{S}_{t-}},\lambda_{t-})^2))\nu_N^{\sharp}(dx,dt)\right].\end{align*}

To obtain the optimal risk-minimizing hedge, we minimize the above expression with respect to $\psi$ :

\begin{align*}&\tilde{S}^2_{t-}\sigma^2\left(\psi_t-\frac{\partial}{\partial s}(t,{\tilde{S}_{t-}},\lambda_{t-})\right)+\\[5pt] &\qquad\lambda{\int_{{\mathbb{R}}}}({\tilde{S}_{t-}}\psi_t f^H(x)-(C(t,{\tilde{S}_{t-}}(1+f^H(x)),\lambda_{t-}+\alpha)-C(t,{\tilde{S}_{t-}},\lambda_{t-}))\\[5pt] &\qquad\qquad\cdot{\tilde{S}_{t-}} f^H(x)e^{\vartheta^{\sharp}(e^{f^H(x)}-1)}\nu(dx)\overset{!}{=}0.\end{align*}

Hence we have that the minimal-risk hedge amounts to holding a position in the underlying ${(S_t)_{t>0}}$ equal to $\psi_t=\Delta(t,{\tilde{S}_{t-}},\lambda_{t-})$ , where

\begin{align*}&\Delta(t,s,\lambda)\\[5pt] &=\frac{b^2 \frac{\partial}{\partial s}C(t,s,\lambda)+\frac{\lambda}{s}{\int_{{\mathbb{R}}}}(C(t,s(1+f^H(x)),\lambda+\alpha)-C(t,s,\lambda))e^{\vartheta^{\sharp}(e^{f^H(x)}-1)}\nu(dx)}{b^2+\lambda{\int_{{\mathbb{R}}}}(f^H(x))^2e^{\vartheta^{\sharp}(e^{f^H(x)}-1)}\nu(dx)}.\end{align*}

A sanity check for convexity yields that

\begin{align*}\tilde{S}^2_{t-}b^2+\lambda_{t-}{\int_{{\mathbb{R}}}}\tilde{S}^2_{t-}(f^H(x))^2 e^{\vartheta^{\sharp}(e^{f^H(x)}-1)}\nu(dx)>0.\end{align*}

6. Postlude

(1) Integro-differential equations. The assumption that the kernel is of exponential form, i.e., $h(t)=\alpha e^{-\delta t}$ , in Equation (2.2) is instrumental to the results derived in Section 5, owing to the joint Markovian structure of N and $\lambda$ . For general kernels h, it is unclear whether explicit expressions can be obtained. This is due to the non-Markovian nature of the pair $(N,\lambda)$ induced by such kernel functions (cf. discussions in Boumezoued, Reference Boumezoued2016, and Gao et al., Reference Gao, Zhou and Zhu2018).

Our derivations and heuristics suggest that the same result should hold for a wide class of parameter specifications satisfying Assumption 1, provided that Lemma 2 is respected, guaranteeing the existence of $\vartheta^{\sharp}$ to Equation (3.12). In Section 5, we stick to simple specifications for the sake of illustration. Furthermore, since the payoff functions H satisfy the Lipschitz condition in Equation (5.4), these results hold for put and call options, as well as any combinations of these, such as butterfly spreads and strangles, to name a few.

(2) Other martingale measures. Given the emergence of new financial markets where an established market for exchange-traded derivatives has yet to emerge, it is often not possible to identify the statistical martingale measure (see Chapter 6 in Schoutens, Reference Schoutens2003) for the purpose of derivative valuation. The statistical martingale measure is a pricing measure ‘chosen by the market’ through the minimization of some distance between the observed and theoretical option prices. It may be tricky to compute this measure in the absence of a liquid market with actively traded vanilla options written on the underlying asset.

To overcome this problem, we propose the use of the Esscher martingale measure, which, under general conditions, minimizes the entropy-Hellinger distance. This observation has been documented; see Remark 10 in Hubalek and Sgarra, Reference Hubalek and Sgarra2009, which explicates the relationships (cf. Theorem 4.3 in Choulli and Stricker, Reference Choulli and Stricker2005, and Theorem 4.4 in Kallsen and Shiryaev, Reference Kallsen and Shiryaev2002). Other martingale measures, such as the minimal entropy martingale measure (Delbaen et al., Reference Delbaen2002), are often chosen as well. That said, the identification of this measure is quite involved and goes beyond the scope of this paper, so it is left to future research.

(3) Open problems. We have studied the Esscher martingale measure for a general stochastic differential equation augmented by a counting process whose predictable compensator has shot-noise and Hawkes elements. The techniques of proof are fairly general and may be applied to various extensions of the proposed framework. One possibility may be to generalize the Cox element $\breve{N}$ to a Lévy subordinator, which is a.s. increasing.

Acknowledgements

We are thankful to three anonymous referees for their helpful comments on improving the manuscript. This work was conducted while Y. L. was at Harvard University.

Competing interests

There were no competing interests to declare which arose during the preparation or publication process of this article.

References

Aït-Sahalia, Y., Cacho-Diaz, J. and Laeven, R. J. A. (2015). Modeling financial contagion using mutually exciting jump processes. J. Financial Econom. 117, 585606.CrossRefGoogle Scholar
Ansel, J.-P. and Stricker, C. (1992). Lois de martingale, densités et décomposition de Föllmer Schweizer. Ann. Inst. H. Poincaré Prob. Statist. 28, 375392.Google Scholar
Asmussen, S. (1985). Conjugate processes and the simulation of ruin problems. Stoch. Process. Appl. 20, 213229.CrossRefGoogle Scholar
Barndorff-Nielsen, O. and Shephard, N. (2001). Modelling by Lévy processes for financial econometrics. In Lévy Processes, Birkhäuser, Boston, pp. 283318.CrossRefGoogle Scholar
Boswijk, H., Peter, R. J. A. L. and Lalu, A. (2016). Asset returns with self-exciting jumps: option pricing and estimation with a continuum of moments. Tech. Rep., University of Amsterdam.Google Scholar
Bouleau, N. and Lamberton, D. (1989). Residual risks and hedging strategies in Markovian markets. Stoch. Process. Appl. 33, 131150.CrossRefGoogle Scholar
Boumezoued, A. (2016). Population viewpoint on Hawkes processes. Adv. Appl. Prob. 48, 463480.CrossRefGoogle Scholar
Brémaud, P. and Massoulié, L. (2002). Power spectra of general shot noises and Hawkes point processes with a random excitation. Adv. Appl. Prob. 34, 205222.CrossRefGoogle Scholar
Carr, P. and Geman, H. (2002). The fine structure of asset returns: an empirical investigation. J. Business 75, 305332.CrossRefGoogle Scholar
Chan, T. (1999). Pricing contingent claims on stocks driven by Lévy processes. Ann. Appl. Prob. 9, 504528.CrossRefGoogle Scholar
Choulli, T. and Stricker, C. (2005). Minimal entropy-Hellinger martingale measure in incomplete markets. Math. Finance 15, 465490.CrossRefGoogle Scholar
Clark, P. K. (1973). A subordinated stochastic process model with finite variance for speculative prices. Econometrica 41, 135155.CrossRefGoogle Scholar
Cox, D. R. and Isham, V. (1980). Point Processes. Chapman and Hall/CRC, Boca Raton.Google Scholar
Daley, D. J. and Vere-Jones, D. (2003). An Introduction to the Theory of Point Processes, 2nd edn. Springer, New York.Google Scholar
Dassios, A. and Zhao, H. (2011). A dynamic contagion process. Adv. Appl. Prob. 43, 814846.CrossRefGoogle Scholar
Dassios, A. and Zhao, H. (2012). Ruin by dynamic contagion claims. Insurance Math. Econom. 51, 93106.CrossRefGoogle Scholar
Delbaen, F. et al. (2002). Exponential hedging and entropic penalties. Math. Finance 12, 99123.CrossRefGoogle Scholar
Eberlein, E. and Jacod, J. (1997). On the range of options prices. Finance Stoch. 1, 131140.CrossRefGoogle Scholar
Eberlein, E. and Keller, U. (1995). Hyperbolic distributions in finance. Bernoulli 1, 281299.CrossRefGoogle Scholar
Elliott, R. and Siu, T. (2010). On risk minimizing portfolios under a Markovian regime-switching Black–Scholes economy. Ann. Operat. Res. 176, 271291.CrossRefGoogle Scholar
Gao, X., Zhou, X. and Zhu, L. (2018). Transform analysis for Hawkes processes with applications in dark pool trading. Quant. Finance 18, 265282.CrossRefGoogle Scholar
Gerber, H. and Shiu, E. (1994). Option pricing by Esscher transforms. Trans. Soc. Actuaries 46, 99140.Google Scholar
He, S., Wang, J. and Yan, J. (1998). Semimartingale Theory and Stochastic Calculus. CRC Press, Boca Raton.Google Scholar
Hubalek, F. and Sgarra, C. (2009). On the Esscher transforms and other equivalent martingale measures for Barndorff-Nielsen and Shephard stochastic volatility models with jumps. Stoch. Process. Appl. 119, 21372157.CrossRefGoogle Scholar
Hull, J. and White, A. (1987). The pricing of options on assets with stochastic volatilities. J. Finance 42, 281300.CrossRefGoogle Scholar
Jacod, J. and Shiryaev, A. (2003). Limit Theorems for Stochastic Processes, 2nd edn. Springer, Berlin.CrossRefGoogle Scholar
Kallsen, J. and Shiryaev, A. N. (2002). The cumulant process and Esscher’s change of measure. Finance Stoch. 6, 397428.CrossRefGoogle Scholar
Last, G. and Brandt, A. (1995). Marked Point Processes on the Real Line: The Dynamical Approach. Springer, New York.Google Scholar
Madan, D. B., Carr, P. and Chang, E. C. (1998). The variance gamma process and option pricing. Europ. Finance Rev. 2, 79105.CrossRefGoogle Scholar
Madan, D. B. and Milne, F. (1991). Option pricing with v. g. martingale components. Math. Finance 1, 3955.Google Scholar
Mandelbrot, B. and Taylor, H. M. (1967). On the distribution of stock price differences. Operat. Res. 15, 10571062.CrossRefGoogle Scholar
Masi, G. B. D., Kabanov, Y. and Runggaldier, W. J. (1995). Mean-variance hedging of options on stocks with Markov volatilities. Theory Prob. Appl. 39, 172182.CrossRefGoogle Scholar
Merton, R. C. (1976). Option pricing when underlying stock returns are discontinuous. J. Financial Econom. 3, 125144.CrossRefGoogle Scholar
Oakes, D. (1975). The Markovian self-exciting process. J. Appl. Prob. 12, 6977.CrossRefGoogle Scholar
Ogata, Y. (1998). Space-time point-process models for earthquake occurrences. Ann. Inst. Statist. Math. 50, 379402.CrossRefGoogle Scholar
Omori, F. (1894). On the aftershocks of earthquakes. J. College Sci. Imperial Univ. Tokyo 7, 111120.Google Scholar
Protter, P. (2005). Stochastic Integration and Differential Equations. Springer, Berlin, Heidelberg.CrossRefGoogle Scholar
Protter, P. and Shimbo, K. (2008). No arbitrage and general semimartingales. In Markov Processes and Related Topics: A Festschrift for Thomas G. Kurtz (Inst. Math. Statist. Collect. 4), Institute of Mathematical Statistics, Beachwood, OH, pp. 267283.CrossRefGoogle Scholar
Schmidhuber, J. (2015). Deep learning in neural networks: an overview. Neural Networks 61, 85117.CrossRefGoogle ScholarPubMed
Schoutens, W. (2003). Lévy Processes in Finance: Pricing Financial Derivatives. John Wiley, Chichester.CrossRefGoogle Scholar
Utsu, T., Ogata, Y. and Matsu’ura, R. S. (1995). The centenary of the Omori formula for a decay law of aftershock activity. J. Phys. Earth 43, 133.CrossRefGoogle Scholar
Zell, A. (1997). Simulation neuronaler Netze. Oldenbourg, Munich.Google Scholar
Çetin, U. and Verschuere, M. (2009). Pricing and hedging in carbon emissions markets. Internat. J. Theoret. Appl. Finance 12, 949967.CrossRefGoogle Scholar
Figure 0

Figure 1. An illustration of the existence of $\vartheta^{\sharp}$ solving Equation (4.11), where $m_1$ and $m_2$ are given in Equations (4.12) and (4.13), respectively.