Hostname: page-component-745bb68f8f-s22k5 Total loading time: 0 Render date: 2025-01-24T18:41:23.821Z Has data issue: false hasContentIssue false

Hiring and firing – a signaling game

Published online by Cambridge University Press:  12 November 2024

Erik Ekström*
Affiliation:
Uppsala University
Topias Tolonen-Weckström*
Affiliation:
Uppsala University
*
*Postal address: P.O. Box 256, SE-751 05 Uppsala, Sweden.
*Postal address: P.O. Box 256, SE-751 05 Uppsala, Sweden.
Rights & Permissions [Opens in a new window]

Abstract

We study a signaling game between an employer and a potential employee, where the employee has private information regarding their production capacity. At the initial stage, the employee communicates a salary claim, after which the true production capacity is gradually revealed to the employer as the unknown drift of a Brownian motion representing the revenues generated by the employee. Subsequently, the employer has the possibility to choose a time to fire the employee in case the estimated production capacity falls short of the salary. In this setup, we use filtering and optimal stopping theory to derive an equilibrium in which the employee provides a randomized salary claim and the employer uses a threshold strategy in terms of the conditional probability for the high production capacity. The analysis is robust in the sense that various extensions of the basic model can be solved using the same methodology, including cases with positive firing costs, incomplete information about an individual’s own type, as well as an additional interview phase.

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (https://creativecommons.org/licenses/by-nc-sa/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is included and the original work is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Applied Probability Trust

1. Introduction

Incomplete information is a key ingredient in many hiring processes, where full knowledge about the true capacity of a potential employee is rarely available to the employer at the hiring time. Instead, if the candidate is hired, such information is gradually revealed to the employer over time. On the other hand, the potential employee would typically possess more accurate information, and would use this additional information when providing their salary claim. Naturally, a high salary is costly for the employer, and thus increases the risk for the employee of being fired. Therefore, there is a trade-off in the choice between a high salary claim to increase personal income and a small salary claim to decrease the risk of being fired.

To model one possible instance of the strategic interaction between an employer and a potential employee, we set up and study a stochastic game with asymmetric information between two players. The game is informally described as follows.

  1. (i) The capacity $\mu$ of the employee (Player 1) is a random variable with a known two-point distribution.

  2. (ii) At time $t=0$ , Player 1 learns about the realization of the random variable $\mu$ , and presents to the employer (Player 2) a non-negotiable salary claim C; the salary can only take two values.

  3. (iii) At time 0, Player 2 observes the salary claim, and subsequently also noisy observations of $\mu$ , based upon which a choice is made of a stopping time $\tau$ to terminate the employment; here, $\tau=0$ corresponds to a case in which the salary claim is not accepted (no hiring), $0<\tau<\infty$ corresponds to an accepted salary claim, but with firing in finite time, and $\tau=\infty$ to an accepted salary claim, with no firing taking place.

  4. (iv) Up to the termination time $\tau$ , Player 1 receives compensation at the chosen rate C per unit of time. Player 2, on the other hand, earns a net payment stream consisting of increments of a stochastic process $\mu t+ \sigma W_t-Ct$ , where W is a Brownian motion which accounts for random fluctuations about the mean rate $\mu-C$ .

This is a signaling game, with two possible types of Player 1, corresponding to the two possible values of $\mu$ , and where Player 1 sends a signal by choosing the salary level C. As such, there is an incomplete and asymmetric information structure since the players have different knowledge about $\mu$ .

Variants of such signaling games with asymmetric information have a long history in the literature on hiring of staff and salary formation. A classical reference is [Reference Spence23], where an example with a job seeker that can have two different types is studied. The job seeker knows their own type and chooses an education level, where the cost of education depends on the type, thereby conveying information to the employer. In the setup of [Reference Spence23], all information is conveyed at the initial time, and the signal that consists of the chosen education level does not influence the actual ability of the employee. Extensions are studied in [Reference Alós-Ferrer and Prat1, Reference Daley and Green6] with a type-dependent continuation value for the job seeker, thus allowing for a future change in the salary level. In particular, the setup includes cases of employer learning based on an additional report (such as an interview phase) or based on the employee’s on-the-job performance. For related studies allowing for uncertainty about an individual’s own type, see [Reference Weiss24], and for more possible types of job-seeker and competition between employers, see [Reference Dato, Grunewald, Kräkel and Müller7]. Also, a signaling game outside the job market is explored in [Reference Daley and Green5], where an owner of a company and several potential buyers are considered. The seller holds private information of the company type, and buyers learn gradually from noisy observations of the unknown type and from the actions (lack of actions) of the seller. The hiring problem that we consider is also related to so-called principal-agent problems (see, e.g., [Reference Cvitanić, Possamaï and Touzi4, Reference Holmström and Milgrom15, Reference Sannikov21]), in which a principal seeks to set up a compensation scheme to control the effort of the agent.

The methods we use to derive an equilibrium in the signaling game rely on a combination of stochastic filtering and stochastic control theory. For stochastic filtering of the drift of a diffusion process we refer to [Reference Liptser and Shiryaev19], and for a classical application to sequential hypothesis testing of the drift of Brownian motion, where filtering and optimal stopping theory is combined, see [22, Chapter 4]. For other studies of combined filtering and control, see, e.g., [Reference De Angelis8, Reference Ekström and Vaicenavicius10, Reference Lakner17] for single-agent problems, and [Reference Cardaliaguet and Rainer3, Reference Ekström, Lindensjö and Olofsson9, Reference Grün13] for strategic setups.

In contrast to the setup of [Reference Spence23] and many subsequent papers, the current study does not use education level as a signal. Instead, we equip the employee with the right to provide a salary claim, which is then paid out continuously until a possible firing time. This allows us to study the trade-off between a high personal income and the risk of becoming a burden for the company. In line with the literature on signaling games as above (see also [Reference Fudenberg and Tirole12, Reference Harsanyi14, Reference Osborne and Rubinstein20]) we use the concept of perfect Bayesian equilibrium (PBE) as a solution concept. We show the existence of a semi-separating PBE, in which the strong type always chooses the high salary, whereas the weak type randomizes between the low and the high salary.

In Section 2, a precise formulation of our hiring-and-firing game is presented, and Section 3 recalls some standard results from filtering theory. In Section 4 we argue heuristically to derive a candidate equilibrium of strategies; the candidate equilibrium is then verified to in fact be a PBE. While our Bayesian game setup is rather simplistic, with only two possible types of employee and two possible salary claims, it may serve as a benchmark for more involved problems. This is illustrated in Section 5, where a few such extensions are briefly discussed. For these extensions, the solution of the benchmark case presented in Sections 24 is utilized.

2. Setup

To describe the game in further detail, let W be a standard Brownian motion, and let $\mu$ be a modified Bernoulli-distributed random variable independent of W with $\mathbb{P}(\mu=\mu_1)=p=1-\mathbb{P}(\mu=\mu_0)$ , where $\mu_0$ , $\mu_1$ , and p are known constants with $\mu_0<\mu_1$ and $p\in(0,1)$ . We assume that the employee (Player 1) generates a payment stream to the employer (Player 2) modeled as the increments of a process $X_t = \mu t + \sigma W_t$ , where $\sigma$ is a positive constant. The random variable $\mu$ will be referred to as the capacity of the employee.

Player 1 knows their own type, i.e. the realization of their capacity $\mu$ , and gives at the initial time $t=0$ a salary claim C in the set $\{c_0,c_1\}$ , where $0<c_0<c_1$ . More precisely, allowing for randomized strategies, a strategy of Player 1 consists of a pair $a=(a_0,a_1)\in\mathcal P$ , where $\mathcal P=[0,1]^2$ is the unit square. Here, $a_i$ represents the conditional probability of choosing $C=c_1$ given that Player 1 is of type i, $i=0,1$ .

To describe the possible strategies of Player 2, denote by $\mathbb F^{X}=(\mathcal F^{X}_t)_{t\geq 0}$ the augmentation of the filtration generated by the process X, and by $\mathbb F=(\mathcal F_t^{X,C})_{t\geq 0}$ the augmentation of the filtration generated by the process X and the random variable C. Also, let $\mathcal T^X$ be the collection of $\mathbb F^{X}$ -stopping times, and $\mathcal T$ be the collection of $\mathbb F$ -stopping times. Clearly, since C only takes two possible values, any stopping time $\tau\in\mathcal T$ can be decomposed as

\begin{equation*} \tau= \left\{ \begin{array}{l@{\quad}l} \tau_0 & \mbox{on }\{C=c_0\}, \\[4pt] \tau_1 & \mbox{on }\{C=c_1\}, \end{array} \right.\end{equation*}

where $(\tau_0,\tau_1)\in\mathcal T^X\times\mathcal T^X$ . Conversely, defining $\tau$ in this way for a given pair $(\tau_0,\tau_1)\in\mathcal T^X\times\mathcal T^X$ yields that $\tau\in\mathcal T$ . Thus we may identify $\mathcal T$ with $\mathcal T^X\times\mathcal T^X$ , and we therefore write $\tau=(\tau_0,\tau_1)$ .

In addition to a pair $(a,\tau)\in\mathcal P\times \mathcal T$ of strategies, the definition of a perfect Bayesian equilibrium (given below) also requires the specification of a belief system $\Pi_0=\big(\Pi_0^0,\Pi_0^1\big)\in\mathcal P$ . Here, $\Pi_0^i$ represents the probability that Player 2 assigns to the event $\{\mu=\mu_1\}$ conditional on the signal $C=c_i$ , $i=0,1$ .

The payoff structure of the game is now described as follows. Up to the stopping time $\tau$ , Player 1 receives compensation for their work at rate C per unit of time. Player 2, on the other hand, receives increments of the net payment stream

(1) \begin{equation} X_t-Ct=(\mu-C)t + \sigma W_t\end{equation}

per unit of time. Both players seek to maximize their expected discounted future payoff. More precisely, for a given triple $(a,\tau,\Pi_0)\in \mathcal P\times\mathcal T\times \mathcal P$ with $a=(a_0,a_1)$ , $\tau=(\tau_0,\tau_1)$ , and $\Pi_0=\big(\Pi_0^0,\Pi_0^1\big)$ , and for a given discount rate $r>0$ , define

\begin{align*} J_1^0(a,\tau) & = (1-a_0)\mathbb{E}\bigg[\int_0^{\tau_0}{\mathrm{e}}^{-rt}c_0\,{\mathrm{d}} t \mid \mu=\mu_0\bigg] + a_0\mathbb{E}\bigg[\int_0^{\tau_1}{\mathrm{e}}^{-rt}c_1\,{\mathrm{d}} t \mid \mu=\mu_0\bigg], \\ J_1^1(a,\tau) & = (1-a_1)\mathbb{E}\bigg[\int_0^{\tau_0}{\mathrm{e}}^{-rt}c_0\,{\mathrm{d}} t \mid \mu=\mu_1\bigg] + a_1\mathbb{E}\bigg[\int_0^{\tau_1}{\mathrm{e}}^{-rt}c_1\,{\mathrm{d}} t \mid \mu=\mu_1\bigg].\end{align*}

Then $J_1^i$ represents the expected payoff for Player 1 given the capacity $\mu=\mu_i$ , $i=0,1$ . Similarly, define

\begin{align*}J_2^0(\tau,\Pi_0) = \mathbb{E}_{\Pi_0^0}\bigg[\int_0^{\tau_0}{\mathrm{e}}^{-rt}(\mu-c_0)\,{\mathrm{d}} t\bigg], \qquad J_2^1(\tau,\Pi_0) = \mathbb{E}_{\Pi_0^1}\bigg[\int_0^{\tau_1}{\mathrm{e}}^{-rt}(\mu-c_1)\,{\mathrm{d}} t\bigg],\end{align*}

so that $J_2^i$ is the expected payoff for Player 2 given that $C=c_i$ , $i=0,1$ . Here, the subindex in the expected value indicates that the expected value is calculated using the belief system $\Pi_0$ as the initial probability of the type $\mu=\mu_1$ .

Remark 1. Note that Player 1 is equipped with randomized strategies, whereas Player 2 is not. The intuitive reason for this is as follows. Player 1 acts at time 0 (revealing the realization of C), and once this is done, the game collapses to a single-player optimization problem of choosing a non-anticipative stopping strategy for Player 2. In such a Markovian optimal stopping problem (cf. Section 4.1), however, the optimal value is attained for a hitting time (a pure strategy), and there are typically no other strategies (e.g. mixed strategies) that are optimal.

Remark 2. While a key ingredient in our setup is asymmetric information about the capacity $\mu$ , we point out that the setup itself, including the numerical values of all parameters p, $\mu_0$ , $\mu_1$ , $\sigma$ , r, $c_0$ , and $c_1$ , is common knowledge to both players.

We now introduce the solution concept that we will use.

Definition 1. (Perfect Bayesian equilibrium.) We call a triplet $(a^*,\tau^*,\Pi_0)\in\mathcal P\times\mathcal T \times \mathcal P$ a perfect Bayesian equilibrium (PBE) if the following conditions are satisfied.

  1. (A) Rationality: $J^i_1(a^*,\tau^*)\geq J^i_1(a,\tau^*)$ for $i=0,1$ , and $J^i_2(\tau^*,\Pi_0)\geq J^i_2(\tau,\Pi_0)$ , $i=0,1$ , for all pairs $(a,\tau)\in\mathcal P\times\mathcal T$ .

  2. (B) Bayesian updating: If $\min \big\{a^*_0,a^*_1\big\}<1$ , then

    \begin{align*}\Pi_0^0=\frac{p(1-a^*_1)}{p(1-a^*_1) + (1-p)(1-a^*_0)},\end{align*}
    and if $\max\big\{a^*_0,a^*_1\big\}>0$ , then
    \begin{align*}\Pi_0^1=\frac{pa^*_1}{pa^*_1+ (1-p)a^*_0}.\end{align*}

Remark 3. In Definition 1, condition (A) requires the strategy triplet to form an equilibrium in the usual sense that neither of the players wants to unilaterally deviate from it. Condition (B) requires Player 2 to update their belief system using Bayes’ rule on events with positive probability. If $a_0^*=a_1^*=1$ then Player 1 always chooses $C=c_1$ , and if $a_0^*=a_1^*=0$ , then Player 1 always chooses $C=c_0$ , and in these cases the belief system can be chosen with no restriction.

Remark 4. If $a_0^*=a_1^*\in\{0,1\}$ , then the PBE is of pooling type; if $a_0^*=1-a_1^*\in\{0,1\}$ , then the PBE is separating; finally, if $a^*_i\in\{0,1\}$ and $a_{1-i}^*\notin\{0,1\}$ , then the PBE is semi-separating.

3. Filtering

From the perspective of Player 2, the problem is a two-source learning problem: at time $t=0$ , the salary claim C is observed and the prior distribution of $\mu$ is updated in accordance with the specified belief system; at subsequent times $t>0$ , the posterior distribution is updated using observations of X.

Given $\pi\in[0,1]$ , define the process $\tilde\Pi\,:\!=\,\tilde\Pi^{\pi}$ by $\tilde \Pi_t\,:\!=\,\mathbb{P}_\pi(\mu=\mu_1\mid \mathcal{F}_t^{X})$ , where the index $\pi$ indicates that the conditional probability is calculated using an initial estimate $\pi$ for the event $\{\mu=\mu_1\}$ . Thus, $\tilde\Pi$ is the probability that $\mu=\mu_1$ conditioned merely on observations of X, and calculated with an initial belief $\pi$ . It is well known from filtering theory (see [Reference Liptser and Shiryaev19]) that the conditional probability $\tilde\Pi$ satisfies ${\mathrm{d}}\tilde\Pi_t=\omega\tilde\Pi_t(1-\tilde\Pi_t)\,{\mathrm{d}}\hat W_t$ , where $\omega\,:\!=\,(\mu_1-\mu_0)/\sigma$ is the signal-to-noise ratio and the innovations process

\begin{align*}\hat W_t\,:\!=\,\frac{1}{\sigma}\bigg(X_t-\int_0^t (\mu_0+(\mu_1-\mu_0)\tilde\Pi_s)\,{\mathrm{d}} s\bigg)\end{align*}

is an $\mathcal{F}^X$ -Brownian motion. In particular, the process $\tilde \Pi$ is strong Markov.

Now, given a belief system $\Pi_0=\big(\Pi_0^0,\Pi_0^1\big)\in\mathcal P$ , we define the conditional probability process

(2) \begin{equation} \Pi_t \,:\!=\, \left\{ \begin{array}{l@{\quad}l} \tilde\Pi_t^{\Pi_0^0} & \mbox{on }\{C=c_0\}, \\ \tilde\Pi_t^{\Pi_0^1} & \mbox{on }\{C=c_1\}. \end{array} \right.\end{equation}

If the Bayesian updating property (B) holds, then $\Pi_t$ coincides with $\mathbb{P}(\mu=\mu_1\mid \mathcal{F}^{X,C}_t)$ on the event $\{C=c_i\}$ provided that $\mathbb{P}(C=c_i)>0$ .

Lemma 1. Let $(\tau,\Pi_0)\in\mathcal T\times\mathcal P$ . Then, for $i=0,1$ ,

\begin{align*}J_2^i(\tau,\Pi_0) = \mathbb{E}_{\Pi_0^i}\bigg[\int_0^{\tau_i}{\mathrm{e}}^{-rt}(\mu_0-c_i+(\mu_1-\mu_0)\Pi_t)\,{\mathrm{d}} t\bigg].\end{align*}

Proof. By conditioning,

\begin{align*}\mathbb{E}_{\Pi_0^i}\bigg[\int_0^{\tau_i}{\mathrm{e}}^{-rt}\mu\,{\mathrm{d}} t\bigg] = \mathbb{E}_{\Pi_0^i}\bigg[\mu\frac{1-{\mathrm{e}}^{-r\tau_i}}{r}\bigg] = \mathbb{E}_{\Pi_0^i}\bigg[(\mu_0 +(\mu_1-\mu_0)\Pi_{\tau_i})\frac{1-{\mathrm{e}}^{-r\tau_i}}{r}\bigg],\end{align*}

where $\Pi_t\,:\!=\,\mathbb{P}_{\Pi_0^i}(\mu=\mu_1\mid\mathcal{F}^X_t)$ . Moreover, by an application of Itô’s formula and optional sampling,

\begin{align*}\mathbb{E}_{\Pi_0^i}\bigg[(\mu_0+(\mu_1-\mu_0)\Pi_{\tau_i})\frac{1-{\mathrm{e}}^{-r\tau_i}}{r}\bigg] = \mathbb{E}_{\Pi_0^i}\bigg[\int_0^{\tau_i}{\mathrm{e}}^{-rt}(\mu_0+(\mu_1-\mu_0)\Pi_t)\,{\mathrm{d}} t\bigg].\end{align*}

Consequently,

\begin{equation*} J_2^i(\tau,\Pi_0) = \mathbb{E}_{\Pi_0^i}\bigg[\int_0^{\tau_i}{\mathrm{e}}^{-rt}(\mu-c_i)\,{\mathrm{d}} t\bigg] = \mathbb{E}_{\Pi_0^i}\bigg[\int_0^{\tau_i}{\mathrm{e}}^{-rt}(\mu_0-c_i+(\mu_1-\mu_0)\Pi_t)\,{\mathrm{d}} t\bigg]. \end{equation*}

4. A semi-separating PBE

Note that if $c_0\geq \mu_1$ , then the net drift $\mu-C$ in (1) is non-positive, and Player 2 should choose immediate firing ( $\tau=0$ ). Similarly, if $\mu_0\geq c_1$ , then $\tau=\infty$ would always be optimal. Thus, to rule out degenerate cases, a minimal assumption is that $\mu_0< c_1<\mu_1$ . Moreover, we will make the additional assumption that $c_0\leq\mu_0$ so that the net drift $\mu-C$ in (1) is non-negative on the event $\{C=c_0\}$ . We thus impose the parameter ordering

(3) \begin{equation} 0<c_0\leq \mu_0<c_1<\mu_1.\end{equation}

It is straightforward to see that there is no PBE of separating type. Indeed, in a separating equilibrium with $a^*=(0,1)$ , the strong type would never be fired, and therefore the weak type would have an incentive to deviate and choose $c_1$ ; similarly, if $a^*=(1,0)$ , then the weak type would be fired immediately, and thus again have an incentive to deviate. The aim of the current section is to derive a perfect Bayesian equilibrium of semi-separating type under the assumption (3). In Sections 4.1 and 4.2 we use intuitive arguments to derive a candidate equilibrium, which is then verified in Section 4.3.

4.1. The employer’s perspective

Under the assumption (3), the lower salary level $c_0$ is smaller than the capacity $\mu$ with probability 1. Thus, if (3) holds, it is clear that if Player 1 chooses $C=c_0$ , then an optimal response for Player 2 should be to choose $\tau_0=\infty$ .

On the other hand, on the event $\{C=c_1\}$ , Player 2 would stop if there is sufficient evidence that $\mu=\mu_0$ . More precisely, we expect a boundary level b such that

(4) \begin{equation} \tau_1\,:\!=\,\inf\{t\geq 0\colon\Pi_t\leq b\}\end{equation}

is an optimal response for Player 2. To determine b, standard optimal stopping theory based on the dynamic programming principle (see, e.g., [Reference Shiryaev22]) suggests that the pair (V, b), where

\begin{equation*} V(\pi) \,:\!=\, \sup_{\tau}\mathbb{E}\bigg[\int_0^\tau{\mathrm{e}}^{-rt}(\mu_0-c_1+(\mu_1-\mu_0)\tilde\Pi^\pi_t)\,{\mathrm{d}} t\bigg]\end{equation*}

solves the free-boundary problem

(5) \begin{equation} \left\{ \begin{array}{l@{\quad}l} \mathcal L V +\mu_0-c_1+(\mu_1-\mu_0)\pi=0, \quad & \pi\in(b,1), \\ V(b)=0, & \\ V_\pi(b)=0, & \\ V(1-)=(\mu_1-c_1)/r, \end{array} \right.\end{equation}

where

\begin{align*}\mathcal L =\frac{1}{2}\omega^2\pi^2(1-\pi)^2\frac{{\mathrm{d}}^2}{{\mathrm{d}}\pi^2}-r.\end{align*}

Here, the two boundary conditions at b constitute the so-called condition of smooth fit, and the boundary condition at $\pi=1$ corresponds to receiving (discounted) payments at rate $\mu_1-c_1$ until time $\tau=\infty$ . Note that the ordinary differential equation (ODE) in (5) is of second order, so there are two degrees of freedom; additionally, the boundary b is unknown. On the other hand, there are three boundary conditions, so we would expect that (5) is well-posed.

To solve the free-boundary problem (5), we readily verify that the general solution of the ODE is given by

\begin{align*}V(\pi) = A_1(1-\pi)\bigg(\frac{\pi}{1-\pi}\bigg)^{\gamma_1} + A_2(1-\pi)\bigg(\frac{\pi}{1-\pi}\bigg)^{\gamma_2} + \frac{\mu_0-c_1+(\mu_1-\mu_0)\pi}{r},\end{align*}

where $\gamma_1<0$ and $\gamma_2>1$ are the solutions of the quadratic equation

(6) \begin{equation} \gamma^2 -\gamma -\frac{2r}{\omega^2} =0,\end{equation}

and $A_1$ and $A_2$ are arbitrary constants. Imposing the boundary condition at $\pi=1$ , we must have $A_2=0$ , and so the two remaining boundary conditions yield

\begin{align*}\left\{ \begin{array}{l} A_1(1-b)\bigg(\dfrac{b}{1-b}\bigg)^{\gamma_1} + \dfrac{\mu_0-c_1+(\mu_1-\mu_0)b}{r}=0, \\[7pt] A_1(\gamma_1-b)\bigg(\dfrac{b}{1-b}\bigg)^{\gamma_1} + \dfrac{(\mu_1-\mu_0)b}{r}=0. \end{array} \right.\end{align*}

Eliminating $A_1$ , we find that

(7) \begin{equation} b=\frac{-(c_1-\mu_0)\gamma_1}{\mu_1-c_1-(\mu_1-\mu_0)\gamma_1}.\end{equation}

Thus, the candidate optimal response for Player 2 when Player 1 chooses the higher salary $C=c_1$ is to stop when the conditional probability process $\Pi$ falls below the constant boundary b in (7). Moreover, the candidate value for the employer is then

\begin{align*} V(\pi) = \left\{ \begin{array}{l@{\quad}l} \dfrac{c_1-\mu_0-(\mu_1-\mu_0)b}{r}\bigg(\dfrac{\pi(1-b)}{b(1-\pi)}\bigg)^{\gamma_1}\dfrac{1-\pi}{1-b} + \dfrac{\mu_0-c_1+(\mu_1-\mu_0)\pi}{r}, & \pi>b, \\ 0, & \pi\leq b. \end{array} \right.\end{align*}

For a graphical illustration of the function V and the threshold b, see Figure 1.

Figure 1. The value function $V(\pi)$ of the employer on the event $\{C=c_1\}$ . The parameter values chosen for this example figure are $c_1=1.5$ , $\mu_0=1.4$ , $\mu_1=1.7$ , $r=0.05$ , and $\sigma=1$ . The value function attains positive values only after the boundary level $b \approx 0.167$ , and it approaches its maximum value $(\mu_1-c_1)/r$ for $\pi$ close to 1.

4.2. The employee’s perspective

We now take the perspective of Player 1. We will construct an equilibrium in which Player 1 always chooses $C=c_1$ on the event $\{\mu=\mu_1\}$ , and on the event $\{\mu=\mu_0\}$ uses a strategy such that $\mathbb{P}(C=c_1\mid\mu=\mu_0)=a_0=1-\mathbb{P}(C=c_0\mid\mu=\mu_0)$ for some $a_0\in[0,1]$ to be determined. Thus, in the notation of Section 2, we consider the strategy $a=(a_0,1)\in\mathcal P$ .

As noted above, on the event $\{C=c_0\}$ , Player 2 would use $\tau_0=\infty$ . By the indifference principle in game theory (see, e.g., [Reference Ferguson11] or [Reference Lindahl18]), to have an equilibrium with a strategy pair $(a^*,\tau^*)$ in which Player 1 uses a mixed strategy $a^*=(a_0,1)$ with $a_0\in(0,1)$ and Player 2 uses $\tau^*=(\infty,\tau_1)$ with $\tau_1$ as in (4), we need the expected payoffs $J^0_1((0,1),\tau^*)$ and $J^0_1((1,1),\tau^*)$ to coincide. Clearly, choosing $C=c_0$ gives the expected payoff $J^0_1((0,1),(\infty,\tau_1))=c_0/r$ for Player 1.

To determine the expected payoff $J^0_1((1,1),\tau^*)$ for Player 1 on the event $\{C=c_1\}$ , note that on the event $\{C=c_1\}$ , Player 2 would first re-evaluate the probability that Player 1 has the larger capacity $\mu=\mu_1$ according to the specified belief system $\Pi_0$ . Moreover, by the Bayesian updating requirement of the belief system, we have

\begin{align*}\Pi_0^1=\mathbb{P}(\mu=\mu_1\mid C=c_1)=\frac{\mathbb{P}(\mu=\mu_1, C=c_1)}{\mathbb{P}( C=c_1)}=\frac{p}{p+(1-p)a_0}.\end{align*}

Thus, $\Pi_t$ makes an initial jump from $\Pi_{0-}=p$ up to

$$\Pi^1_0=\frac{p}{p+(1-p)a_0}\geq p,$$

and then it diffuses with dynamics ${\mathrm{d}}\Pi_t=\omega\Pi_t(1-\Pi_t)\,{\mathrm{d}}\hat W_t$ . From the perspective of Player 1, however, $\hat W$ is not a Brownian motion since Player 1 knows the true value of $\mu$ . Instead,

\begin{equation*} {\mathrm{d}}\Pi_t = \omega\Pi_t(1-\Pi_t)\,{\mathrm{d}}\hat W_t = -\omega^2\Pi_t^2(1-\Pi_t)\,{\mathrm{d}} t + \omega\Pi_t(1-\Pi_t)\,{\mathrm{d}} W_t.\end{equation*}

Consequently, if Player 1 chooses $C=c_1$ , then their value is $U(\Pi^1_0)$ , where

(8) \begin{equation} U(\pi) \,:\!=\,\mathbb{E}_\pi\bigg[\int_0^{\tau_1}{\mathrm{e}}^{-rt}c_1\,{\mathrm{d}} t \mid \mu=\mu_0\bigg]\end{equation}

solves

\begin{equation*} \left\{ \begin{array}{l@{\quad}l} \dfrac{\omega^2\pi^2(1-\pi)^2}{2}U_{\pi\pi}-\omega^2\pi^2(1-\pi)U_{\pi}-rU +c_1=0, \quad & \pi\in(b,1), \\[8pt] U(b)=0, \\[6pt] U(1-)= c_1/r. \end{array} \right.\end{equation*}

This ODE has the general solution

\begin{align*}U(\pi)=B_1\bigg(\frac{\pi}{1-\pi}\bigg)^{\gamma_1} + B_2\bigg(\frac{\pi}{1-\pi}\bigg)^{\gamma_2}+c_1/r,\end{align*}

where $\gamma_1<0$ and $\gamma_2>1$ are the solutions of (6) as before, and $B_1$ and $B_2$ are arbitrary constants. As before, the boundary condition at $\pi=1$ yields $B_2=0$ , and then the boundary condition at $\pi=b$ gives

\begin{align*}B_1=\frac{-c_1}{r}\bigg(\frac{b}{1-b}\bigg)^{-\gamma_1},\end{align*}

so

\begin{align*}U(\pi) = \left\{ \begin{array}{l@{\quad}l} \dfrac{c_1}{r}\bigg(1-\bigg(\dfrac{\pi(1-b)}{(1-\pi)b}\bigg)^{\gamma_1}\bigg), \quad & \pi>b, \\[9pt] 0, & \pi\leq b. \end{array} \right.\end{align*}

Now recall that by the indifference principle we are looking for $a_0\in(0,1)$ such that

\begin{align*}\frac{c_0}{r}= U\left(\Pi_0^1\right) = U\bigg(\frac{p}{p+(1-p)a_0}\bigg).\end{align*}

This is possible only if $U(p)<c_0/r$ , i.e. if

\begin{align*}\frac{p}{1-p} < \frac{b}{1-b}\bigg(1-\frac{c_0}{c_1}\bigg)^{1/\gamma_1}.\end{align*}

Equivalently, we need to have

(9) \begin{equation} p<\hat p \,:\!=\, \frac{b(1-{c_0}/{c_1})^{1/\gamma_1}}{1-b+b(1-{c_0}/{c_1})^{1/\gamma_1}},\end{equation}

where $\hat p$ is the indifference point such that $U(\hat p)=c_0/r$ . Moreover, in that case, $a_0$ should be chosen so that

\begin{align*}\frac{p}{p+(1-p)a_0} = \frac{b(1-{c_0}/{c_1})^{1/\gamma_1}}{1-b+b(1-{c_0}/{c_1})^{1/\gamma_1}},\end{align*}

i.e.

\begin{align*}a_0 = \frac{p(1-b)}{(1-p)b(1-{c_0}/{c_1})^{1/\gamma_1}}.\end{align*}

That is, the candidate optimal strategy for Player 1 is as follows. If $\mu=\mu_1$ , then the high salary is chosen with probability 1 ( $a_1=1$ ). On the other hand, if Player 1 is of the weak type ( $\mu=\mu_0$ ), then the high salary $c_1$ is chosen with probability $a_0$ , where

\begin{align*}a_0 = \left\{ \begin{array}{l@{\quad}l} \dfrac{p(1-b)}{(1-p)b(1-{c_0}/{c_1})^{1/\gamma_1}}, \quad & p<\hat p, \\[15pt] 1, & p\geq \hat p. \end{array} \right.\end{align*}

For a graphical illustration of the value function U and the indifference point $\hat p$ , see Figure 2.

Figure 2. The value function $U(\pi)$ for the weak-type employee on the event $\{C = c_1\}$ . The parameter values of $c_1$ , $\mu_0$ , $\mu_1$ , r, and $\sigma$ are the same as in Figure 1, and $c_0=1.2$ . Here, $\hat p$ is the unique value such that $U(\hat p)=c_0/r$ .

4.3. Verification of equilibrium

We now summarize the strategies described above in Theorem 1. We then verify that these strategies together constitute a perfect Bayesian equilibrium.

Let b be defined as in (7), and define the strategy $a^*=(a_0^*,a^*_1)$ of Player 1 by

\begin{align*}a^*_0= \left\{ \begin{array}{l@{\quad}l} \dfrac{p(1-b)}{(1-p)b(1-{c_0}/{c_1})^{1/\gamma_1}}, \quad & p<\hat p, \\[8pt] 1, & p\geq\hat p \end{array} \right.\end{align*}

and $a_1^*=1$ , where $\hat p$ is as in (9). Moreover, let $\Pi_0\,:\!=\,\big(\Pi_0^0,\Pi_0^1\big)=(0,\hat p\vee p)$ , and let $\tau^*=(\tau^*_0,\tau^*_1)$ be defined by $\tau^*_0\,:\!=\,\infty$ and $\tau^*_1\,:\!=\,\inf\big\{t\geq 0\colon\tilde\Pi_t^{\Pi_0^1}\leq b\big\}$ .

Theorem 1. Assume that (3) holds. Then the triplet $(a^*,\tau^*, \Pi_0)$ specified above is a perfect Bayesian equilibrium. Moreover, if $p<\hat p$ , the equilibrium is semi-separating; if $p\geq\hat p$ , then the equilibrium is of pooling type.

Proof. We first note that, by construction, the belief system $\Pi_0$ satisfies the Bayesian updating property. The proof of rationality is divided into two parts.

Optimality of $\tau^*$

First note that $\Pi_0^0=0$ yields that

\begin{align*} J_2^0(\tau,\Pi_0) = \mathbb{E}\bigg[\int_0^{\tau_0}{\mathrm{e}}^{-rt}(\mu_0-c_0)\,{\mathrm{d}} t \mid \mu=\mu_0\bigg] \leq \frac{\mu_0-c_0}{r} = J_2^0(\tau^*,\Pi_0) \end{align*}

for any $\tau\in\mathcal T$ , so $\tau_0^*=\infty$ is a rational response to $C=c_0$ .

Next, if the employer observes the event $\{C=c_1\}$ , then the stopping time $\tau^*_1\,:\!=\,\inf\{t\geq 0\colon Z_t\leq b\}$ is used, where $Z_t\,:\!=\,\tilde{\Pi}_t^{\Pi^1_0}=\mathbb{P}_{\Pi_0^1}(\mu=\mu_1\mid\mathcal{F}_t^{X})$ , cf. (2). By Section 3, ${\mathrm{d}} Z_t = \omega Z_t(1-Z_t)\,{\mathrm{d}}\hat W_t$ , so an application of Itô’s formula together with (5) shows that

\begin{align*}Y_t\,:\!=\,{\mathrm{e}}^{-rt}V(Z_t) + \int_0^t{\mathrm{e}}^{-rs}(\mu_0-c_1+(\mu_1-\mu_0)Z_s)\,{\mathrm{d}} s\end{align*}

is a bounded supermartingale. For any stopping time $\tau^{\prime}=(\tau^{\prime}_0,\tau^{\prime}_1)\in\mathcal T$ , optional sampling therefore gives that

\begin{align*} V\big(\Pi_0^1\big) & \geq \mathbb{E}\bigg[e^{-r(T\wedge\tau^{\prime}_1)}V(Z_{T\wedge\tau^{\prime}_1}) + \int_0^{T\wedge\tau^{\prime}_1}{\mathrm{e}}^{-rt}(\mu_0-c_1+(\mu_1-\mu_0)Z_t)\,{\mathrm{d}} t\bigg] \\ & \geq \mathbb{E}\bigg[\!\int_0^{T\wedge\tau^{\prime}_1}\!{\mathrm{e}}^{-rt}(\mu_0-c_1+(\mu_1-\mu_0)Z_t)\,{\mathrm{d}} t\bigg] \to \mathbb{E}\bigg[\!\int_0^{\tau^{\prime}_1}\!{\mathrm{e}}^{-rt}(\mu_0-c_1+(\mu_1-\mu_0)Z_t)\,{\mathrm{d}} t\bigg] \end{align*}

as $T\to\infty$ by bounded convergence. Since

\begin{align*}\mathbb{E}\bigg[\int_0^{\tau^{\prime}_1}{\mathrm{e}}^{-rt}(\mu_0-c_1+(\mu_1-\mu_0)Z_t)\,{\mathrm{d}} t\bigg] = J_2^1(\tau^{\prime},\Pi_0)\end{align*}

by Lemma 1, we find that

(10) \begin{equation} J_2^1(\tau^{\prime},\Pi_0)\leq V\big(\Pi_0^1\big) \end{equation}

for all $\tau^{\prime}\in\mathcal T$ .

Furthermore, for $\tau^*$ , the stopped process $Y_{t\wedge\tau^*_1}$ is a martingale, so optional sampling and bounded convergence give

\begin{align*} V\big(\Pi_0^1\big) & = \mathbb{E}\bigg[{\mathrm{e}}^{-r\big(T\wedge\tau^*_1\big)}V\big(Z_{T\wedge\tau^*_1}\big) + \int_0^{T\wedge\tau^*_1}{\mathrm{e}}^{-rt}(\mu_0-c_1+(\mu_1-\mu_0)Z_t)\,{\mathrm{d}} t\bigg] \\ & \to \mathbb{E}\bigg[\int_0^{\tau^*_1}{\mathrm{e}}^{-rt}(\mu_0-c_1+(\mu_1-\mu_0)Z_t)\,{\mathrm{d}} t\bigg] = J_2^1(\tau^*,\Pi_0) \end{align*}

as $T\to\infty$ , which together with (10) implies that $\tau^*_1$ is an optimal response to $C=c_1$ .

Optimality of $a^*$

We have that

\begin{align*} J^0_1(a,\tau^*) & = (1-a_0)\frac{c_0}{r} + a_0\mathbb{E}_{\Pi_0^1}\bigg[\int_0^{\tau_1^*}{\mathrm{e}}^{-rt}c_1\,{\mathrm{d}} t\mid\mu=\mu_0\bigg] \\ & = (1-a_0)\frac{c_0}{r} + a_0U(\hat p\vee p) \leq (1-a^*_0)\frac{c_0}{r} + a^*_0U(\hat p\vee p) = J^0_1(a^*,\tau^*), \end{align*}

where the inequality holds since if $p>\hat p$ then we have $U(\hat p\vee p)=U(p)\geq c_0/r$ and $a_0^*=1\geq a_0$ , and if $p\leq \hat p$ then we have $U(\hat p\vee p)=U(\hat p)=c_0/r$ .

Similarly,

\begin{align*} J^1_1(a,\tau^*) & = (1-a_1)\frac{c_0}{r} + a_1\mathbb{E}_{\Pi_0^1}\bigg[\int_0^{\tau_1^*}{\mathrm{e}}^{-rt}c_1\,{\mathrm{d}} t\mid\mu=\mu_1\bigg] \\ & \leq \mathbb{E}_{\Pi_0^1}\bigg[\int_0^{\tau_1^*}{\mathrm{e}}^{-rt}c_1\,{\mathrm{d}} t\mid\mu=\mu_1\bigg] = J^1_1(a^*,\tau^*), \end{align*}

where the inequality follows from the inequalities

\begin{equation*} \mathbb{E}_{\Pi_0^1}\bigg[\int_0^{\tau_1^*}{\mathrm{e}}^{-rt}c_1\,{\mathrm{d}} t\mid\mu=\mu_1\bigg] \geq \mathbb{E}_{\Pi_0^1}\bigg[\int_0^{\tau_1^*}{\mathrm{e}}^{-rt}c_1\,{\mathrm{d}} t\mid\mu=\mu_0\bigg] = U\big(\Pi_0^1\big)\geq c_0/r. \end{equation*}

Remark 5. As is often the case for signaling games, there is no uniqueness of PBEs. Indeed, consider the strategy pair $(a,\tau)$ , where $a=(0,0)$ and $\tau=(\infty,0)$ ; in words, Player 1 always chooses $C=c_0$ (regardless of their type) and Player 2 never stops if $C=c_0$ and stops immediately if $C=c_1$ . Then $(a,\tau,\Pi_0)$ with $\Pi_0=(p,\Pi_0^1)$ is also a perfect Bayesian equilibrium (of pooling type) provided the belief $\Pi_0^1$ is chosen small enough (e.g. $\Pi_0^1\leq b$ ).

There have been substantial efforts in the literature to refine the notion of PBE (cf. [Reference Banks and Sobel2, Reference Kreps and Wilson16]) in order to rule out some non-intuitive equilibria. Rather than taking that path, however, we simply note that in the pooling equilibrium both types have the same equilibrium value $c_0/r$ , which is dominated by the corresponding equilibrium values in Theorem 1, so the semi-separating PBE is preferred by the first-mover of our game.

Remark 6. We have analyzed the game under the assumption (3) that $c_0\leq\mu_0<c_1<\mu_1$ . In the alternative ordering $\mu_0<c_0<c_1<\mu_1$ , the smaller salary $c_0$ provides a negative running reward for the employer in the case of a weak-type employee, and a semi-separating (or separating) equilibrium is not feasible. As in Remark 5, we can construct a pooling PBE with $a=(0,0)$ , but we also obtain a pooling equilibrium with $a=(1,1)$ , supported by a sufficiently small belief $\Pi_0^0$ . While semi-separating and separating PBEs are not feasible, it remains an open question whether mixing between the two pooling equilibria is possible in this parameter regime.

5. Extensions

In this section we briefly discuss a few extensions of the basic setup we have presented. All of these extensions can be easily solved using the methods of the current article, thus demonstrating the robustness of the benchmark case studied. For the sake of brevity, we merely outline the solutions and leave out the full arguments.

5.1. Firing cost

In this section we consider the specification (i)–(iv) in the introduction, but with the addition that

  1. (v) At the firing time $\tau$ , Player 2 pays a firing cost $\epsilon$ , where $\epsilon\in(0,({c_1-\mu_0})/{r})$ .

Note that the assumption $\epsilon<({c_1-\mu_0})/{r}$ implies that the firing cost is smaller than the maximal possible loss for Player 2. Adding assumption (v), the expected payoff for Player 2 is now

\begin{align*} J_{2}^{\epsilon, i}(\tau,\Pi_0) & \,:\!=\, \mathbb{E}_{\Pi_0^{i}}\bigg[\int_0^{\tau_i}{\mathrm{e}}^{-rt}(\mu-c_i)\,{\mathrm{d}} t - \epsilon{\mathrm{e}}^{-r\tau_i}\mathbf{1}_{\{0<\tau_i<\infty\}}\bigg] \\ &\ = \mathbb{E}_{\Pi_0^{i}}\bigg[\int_0^{\tau_i}{\mathrm{e}}^{-rt}(\mu_0-c_i+(\mu_1-\mu_0)\Pi_t)\,{\mathrm{d}} t - \epsilon{\mathrm{e}}^{-r\tau_i}\mathbf{1}_{\{0<\tau_i<\infty\}}\bigg]\end{align*}

by Lemma 1. Note that the choice $\tau_i=0$ gives rise to no firing cost, with the interpretation that no hiring takes place.

Replacing the first boundary condition in the free-boundary problem (5) with $V(b)=-\epsilon$ , wer obtain a stopping boundary

\begin{align*}b\,:\!=\,b^\epsilon \,:\!=\,\frac{-(c_1-\mu_0-\epsilon r)\gamma_1}{\mu_1-c_1+r\epsilon-(\mu_1-\mu_0)\gamma_1},\end{align*}

where $\gamma_1$ is the negative solution to the quadratic equation (6). Due to the parameter ordering in (3), we can verify that indeed $b^\epsilon\in(0,1)$ when $\epsilon\in(0,({c_1-\mu_0})/{r})$ . Arguing as in Section 4.2, we find that the indifference point $\hat p$ at which $U(\hat p)=c_0/r$ is given by

\begin{align*} \hat p \,:\!=\, \hat{p}^\epsilon \,:\!=\, \frac{b^\epsilon(1-{c_0}/{c_1})^{1/\gamma_1}}{1-b^\epsilon+b^\epsilon(1-{c_0}/{c_1})^{1/\gamma_1}},\end{align*}

which leads to the candidate strategy $a^*=(a^*_0,1)$ with

\begin{align*}a^*_0= \left\{ \begin{array}{l@{\quad}l} \dfrac{p(1-b^\epsilon)}{(1-p)b^\epsilon(1-{c_0}/{c_1})^{1/\gamma_1}}, \quad & p<\hat p, \\ 1, & p\geq\hat p. \end{array} \right.\end{align*}

Thus we specify a triplet $(a^{*},\tau^{*}, \Pi_0)\in\mathcal P\times\mathcal T\times\mathcal P$ by $a^{*}=(a_0^{*},1)$ , where $\Pi_0=(0,\hat p\vee p)$ and $\tau^{*}=(\infty, \tau_1)$ , with $\tau^{*}_1=\inf\big\{t\geq 0\colon\tilde\Pi_t^{\Pi_0^1}\leq b^\epsilon\big\}$ . It is then straightforward to verify that if $V(\hat p\vee p)> 0$ , then $(a^{*},\tau^{*}, \Pi_0)$ is a PBE. (On the other hand, if $V(\hat p\vee p)\leq 0$ , then ((0, 0), (0, 0), (p, p)), corresponding to always choosing $C=c_0$ and no hiring, is an equilibrium.)

The addition of a firing cost $\epsilon\in(0,({c_1-\mu_0})/{r})$ , together with $V(\hat p\vee p)> 0$ , then constitutes a PBE with a lower stopping boundary $b^\epsilon$ for Player 2 and a higher randomizing probability $a_0^*$ for a low-type Player 1 compared to the corresponding values in the benchmark case of Section 4.

5.2. Uncertainty about type

In this section we consider the same setup as specified in (i)–(iv), but where (ii) is replaced by

  1. (iiʹ) At time $t=0$ , Player 1 first receives a noisy observation of their capacity $\mu$ , and then presents to Player 2 a salary claim $C\in\{c_0,c_1\}$ .

In this way, Player 1 also has incomplete information about their own capacity (cf. [Reference Weiss24]). More precisely, assume that the noisy signal is either of the two events ‘strong belief’ and ‘weak belief’, where $p_1\,:\!=\,\mathbb{P}(\mbox{strong belief})$ , $q\,:\!=\,\mathbb{P}(\mu=\mu_1\mid\mbox{strong belief})$ , and

(11) \begin{equation} \mathbb{P}(\mu=\mu_1\mid\mbox{weak belief})=0.\end{equation}

With this notation, the probability (denoted p in the previous sections) that Player 1 has the large capacity is $p=p_1q$ . Note that when $q=1$ , this extension collapses to our original benchmark model. Also note that a consequence of (11) is that Player 1 has a tendency to overestimate their capacity: if Player 1 has weak belief, then they are always of the weak type, whereas if they have strong belief, then they may still be of the weak type. We adapt the strategy $a=(a_0,a_1)$ so that $a_0$ and $a_1$ now denote the conditional probabilities that Player 1 chooses $C=c_1$ given ‘weak belief’ and ‘strong belief’, respectively.

For simplicity, assume that $q>\hat p$ , where $\hat p$ is defined in (9). Now specify a triplet $(a^*,\tau^*, \Pi_0)\in\mathcal P\times\mathcal T\times\mathcal P$ by $a^*=(a_0^*,1)$ , $\Pi_0=(0,\hat p\vee (p_1q))$ , and $\tau^*=(\infty,\tau_1^*)$ , where

\begin{align*} a_0^* & = \left\{ \begin{array}{l@{\quad}l} \dfrac{p_1(q-\hat p)}{\hat p(1-p_1)}, \quad & p_1q<\hat p, \\[10pt] 1, & p_1q\geq\hat p, \end{array}\right. \\[3pt] \tau^*_1 & = \inf\Big\{t\geq 0\colon\tilde\Pi_t^{\Pi_0^1}\leq b\Big\},\end{align*}

with b as in (7). It is then straightforward to check that $(a^*,\tau^*, \Pi_0)$ constitutes a PBE.

5.3. Adding an interview phase

As a last extension of the setup in (i)–(iv), consider a situation where (iii) is replaced with

  1. (iiiʹ) At time 0, Player 2 observes the salary claim together with the result ‘weak result’ or ‘strong result’ of a test, and subsequently also noisy observations of $\mu$ , based upon which a choice is made of a stopping time $\tau$ to terminate the employment.

Thus, further to the salary claim C, Player 2 also receives information from an additional test result (cf. [Reference Alós-Ferrer and Prat1, Reference Daley and Green6]) with two possible outcomes. For definiteness, we assume that $\mathbb{P}(\mbox{strong result}\mid \mu=\mu_0)=q\in({c_0}/{c_1},1)$ and

(12) \begin{equation} \mathbb{P}(\mbox{strong result}\mid \mu=\mu_1)=1,\end{equation}

which corresponds to a situation in which strong types always perform well in the interview, and weak types sometimes do. Note that the outcome of the interview test is not available for Player 1, which is similar to the situation studied in [Reference Alós-Ferrer and Prat1, Reference Daley and Green6].

A belief system and strategy for Player 2 can now be described as the quadruples $\Pi_0=\big(\Pi_0^{0,\mathrm{weak}},\Pi_0^{0,\mathrm{strong}},\Pi_0^{1,\mathrm{weak}},\Pi_0^{1,\mathrm{strong}}\big)$ and $\tau=\big(\tau_0^{\mathrm{weak}},\tau_0^{\mathrm{strong}},\tau_1^{\mathrm{weak}},\tau_1^{\mathrm{strong}}\big)$ , where for $i=0,1$ , $\Pi_0^{i,\mathrm{weak}}$ , $\Pi_0^{i,\mathrm{weak}}$ , $\tau_i^{\mathrm{strong}}$ , and $\tau_i^{\mathrm{strong}}$ are the beliefs and stopping times used provided $C=c_i$ and the test result ‘weak result’ or ‘strong result’ are observed, respectively.

By the assumption in (12), if Player 2 observes ‘weak result’ in the test, then Player 1 is automatically of the weak type ( $\Pi_0^{i,\mathrm{weak}}=0$ ), and immediate firing ( $\tau_1^{\mathrm{weak}}=0$ ) would then be optimal in the case $C=c_1$ . Therefore, choosing $C=c_1$ is risky if Player 1 is of the weak type, and the obtained value from choosing $C=c_1$ (for the weak-type player) is $qU\big(\Pi_0^{1,\mathrm{strong}}\big)$ , with U as in (8). We thus define the indifference point $\hat P$ via the indifference principle (cf. Section 4.2) so that $qU(\hat P)={c_0}/{r}$ (note that $\hat P$ is well-defined since $q>c_0/c_1$ by assumption), and let

\begin{align*}a^*=\bigg(1,\frac{p(1-\hat P)}{\hat P(1-p)q}\wedge 1\bigg), \qquad \Pi_0=\big(0,0,0,\Pi_0^{1,\mathrm{strong}}\big),\end{align*}

with

\begin{align*} \Pi_0^{1,\mathrm{strong}} = \hat P\vee\frac{p}{p+(1-p)q}, \qquad \tau^*=(\infty,\infty,0,\tau_b), \qquad \tau_b=\inf\Big\{t\geq 0\colon\tilde\Pi_t^{\Pi_0^{1,\mathrm{strong}}}\leq b\Big\}.\end{align*}

It is then straightforward to check that $(a^*,\tau^*,\Pi_0)$ is a PBE.

Acknowledgements

We wish to thank Kristoffer Glover for enlightening discussions, and for sharing with us preliminary notes on a problem in which Bayesian updating of a fund manager’s skill plays a key role. We also thank Ola Andersson for stimulating discussions on the game-theoretical aspects of the current work.

Funding information

Funding from the Swedish Research Council is gratefully acknowledged.

Competing interests

There were no competing interests to declare which arose during the preparation or publication process of this article.

References

Alós-Ferrer, C. and Prat, J. (2012). Job market signaling and employer learning. J. Econom. Theory 147, 17871817.CrossRefGoogle Scholar
Banks, J. and Sobel, J. (1987). Equilibrium selection in signaling games. Econometrica 55, 647661.CrossRefGoogle Scholar
Cardaliaguet, P. and Rainer, C. (2009). Stochastic differential games with asymmetric information. Appl. Math. Optim. 59, 136.CrossRefGoogle Scholar
Cvitanić, J., Possamaï, D. and Touzi, N. (2018). Dynamic programming approach to principal-agent problems. Finance Stoch. 22, 137.CrossRefGoogle Scholar
Daley, B. and Green, B. (2012). Waiting for news in the market for lemons. Econometrica 80, 14331504.Google Scholar
Daley, B. and Green, B. (2014). Market signaling with grades. J. Econom. Theory 151, 114145.CrossRefGoogle Scholar
Dato, S., Grunewald, A., Kräkel, M. and Müller, D. (2016). Asymmetric employer information, promotions, and the wage policy of firms. Games Econom. Behavior 100, 273300.CrossRefGoogle Scholar
De Angelis, T. (2020). Optimal dividends with partial information and stopping of a degenerate reflecting diffusion. Finance Stoch. 24, 71123.CrossRefGoogle Scholar
Ekström, E., Lindensjö, K. and Olofsson, M. (2022). How to detect a salami slicer: A stochastic controller-and-stopper game with unknown competition. SIAM J. Control Optim. 60, 545574.CrossRefGoogle Scholar
Ekström, E. and Vaicenavicius, J. (2020). Optimal stopping of a Brownian bridge with an unknown pinning point. Stoch. Process. Appl. 130, 806823.CrossRefGoogle Scholar
Ferguson, T. (2020). A Course in Game Theory. World Scientific, Hackensack, NJ.CrossRefGoogle Scholar
Fudenberg, D. and Tirole, J. (1991). Perfect Bayesian equilibrium and sequential equilibrium. J. Econom. Theory 53, 236250.CrossRefGoogle Scholar
Grün, C. (2013). On Dynkin games with incomplete information. SIAM J. Control Optim. 51, 40394065.CrossRefGoogle Scholar
Harsanyi, J. (1967). Games with incomplete information played by ‘Bayesian’ players. I. The basic model. Manag. Sci. 14, 159182.Google Scholar
Holmström, B. and Milgrom, P. (1987). Aggregation and linearity in the provision of intertemporal incentives. Econometrica 55, 303328.CrossRefGoogle Scholar
Kreps, D. and Wilson, R. (1982). Sequential equilibria. Econometrica 50, 863894.CrossRefGoogle Scholar
Lakner, P. (1995). Utility maximization with partial information. Stoch. Process. Appl. 56, 247273.CrossRefGoogle Scholar
Lindahl, L.-Å. (2017). Non-Cooperative Games: An Introduction to Game Theory – Part I. Available at: https://bookboon.com/en/non-cooperative-games-introduction-ebook.Google Scholar
Liptser, R. and Shiryaev, A. (2001). Statistics of Random Processes. I. General Theory, 2nd edn. Springer, Berlin.Google Scholar
Osborne, M. and Rubinstein, A. (1994). A Course in Game Theory. MIT Press, Cambridge, MA.Google Scholar
Sannikov, Y. (2008). A continuous-time version of the principal-agent problem. Rev. Econom. Stud. 75, 957984.CrossRefGoogle Scholar
Shiryaev, A. (1973). Optimal Stopping Rules (Appl. Math. 8). Springer, New York.Google Scholar
Spence, M. (1973). Job market signaling. Quart. J. Econom. 87, 355374.CrossRefGoogle Scholar
Weiss, A. (1983). A sorting-cum-learning model of education. J. Political Econom. 91, 420442.CrossRefGoogle Scholar
Figure 0

Figure 1. The value function $V(\pi)$ of the employer on the event $\{C=c_1\}$. The parameter values chosen for this example figure are $c_1=1.5$, $\mu_0=1.4$, $\mu_1=1.7$, $r=0.05$, and $\sigma=1$. The value function attains positive values only after the boundary level $b \approx 0.167$, and it approaches its maximum value $(\mu_1-c_1)/r$ for $\pi$ close to 1.

Figure 1

Figure 2. The value function $U(\pi)$ for the weak-type employee on the event $\{C = c_1\}$. The parameter values of $c_1$, $\mu_0$, $\mu_1$, r, and $\sigma$ are the same as in Figure 1, and $c_0=1.2$. Here, $\hat p$ is the unique value such that $U(\hat p)=c_0/r$.