On gradual-impulse control of continuous-time Markov decision processes with exponential utility

Xin Guo; Aiko Kurushima; Alexey Piunovskiy; Yi Zhang

doi:10.1017/apr.2020.64

On gradual-impulse control of continuous-time Markov decision processes with exponential utility

Xin Guo, Aiko Kurushima, Alexey Piunovskiy, Yi Zhang

Mathematics

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

29 Downloads (Pure)

Abstract

We consider a gradual-impulse control problem of continuous-time Markov decision processes, where the system performance is measured by the expectation of the exponential utility of the total cost. We show, under natural conditions on the system primitives, the existence of a deterministic stationary optimal policy out of a more general class of policies that allow multiple simultaneous impulses, randomized selection of impulses with random effects, and accumulation of jumps. After characterizing the value function using the optimality equation, we reduce the gradual-impulse control problem to an equivalent simple discrete-time Markov decision process, whose action space is the union of the sets of gradual and impulsive actions.

Original language	English
Pages (from-to)	301-334
Number of pages	34
Journal	Advances in Applied Probability
Volume	53
Issue number	2
DOIs	https://doi.org/10.1017/apr.2020.64
Publication status	Published - 1 Jul 2021

Bibliographical note

Funding Information:
We thank the editors and referees for comments and remarks that significantly improved the readability of this paper. This work was supported by the Royal Society (grant number IE160503) and the Daiwa Anglo-Japanese Foundation (UK) (grant reference 4530/12801).

Publisher Copyright:
© 2021 Cambridge University Press. All rights reserved.

Keywords

Continuous-time Markov decision processes
dynamic programming
gradual-impulse control
optimality equation

ASJC Scopus subject areas

Statistics and Probability
Applied Mathematics

Access to Document

10.1017/apr.2020.64Licence: None: All rights reserved

GuoX2021gradualimpulse
This article has been published in a revised form in Advances in Applied Probability https://doi.org/10.1017/apr.2020.64. This version is free to view and download for private research and study only. Not for re-distribution or re-use. © The Author(s), 2021.
Accepted author manuscript, 633 KBLicence: None: All rights reserved

Cite this

@article{04f24c1bfae04ce7bc65bc9f05dabdeb,

title = "On gradual-impulse control of continuous-time Markov decision processes with exponential utility",

abstract = "We consider a gradual-impulse control problem of continuous-time Markov decision processes, where the system performance is measured by the expectation of the exponential utility of the total cost. We show, under natural conditions on the system primitives, the existence of a deterministic stationary optimal policy out of a more general class of policies that allow multiple simultaneous impulses, randomized selection of impulses with random effects, and accumulation of jumps. After characterizing the value function using the optimality equation, we reduce the gradual-impulse control problem to an equivalent simple discrete-time Markov decision process, whose action space is the union of the sets of gradual and impulsive actions.",

keywords = "Continuous-time Markov decision processes, dynamic programming, gradual-impulse control, optimality equation",

author = "Xin Guo and Aiko Kurushima and Alexey Piunovskiy and Yi Zhang",

note = "Funding Information: We thank the editors and referees for comments and remarks that significantly improved the readability of this paper. This work was supported by the Royal Society (grant number IE160503) and the Daiwa Anglo-Japanese Foundation (UK) (grant reference 4530/12801). Publisher Copyright: {\textcopyright} 2021 Cambridge University Press. All rights reserved.",

year = "2021",

month = jul,

day = "1",

doi = "10.1017/apr.2020.64",

language = "English",

volume = "53",

pages = "301--334",

journal = "Advances in Applied Probability",

issn = "0001-8678",

publisher = "Cambridge University Press",

number = "2",

}

TY - JOUR

T1 - On gradual-impulse control of continuous-time Markov decision processes with exponential utility

AU - Guo, Xin

AU - Kurushima, Aiko

AU - Piunovskiy, Alexey

AU - Zhang, Yi

N1 - Funding Information: We thank the editors and referees for comments and remarks that significantly improved the readability of this paper. This work was supported by the Royal Society (grant number IE160503) and the Daiwa Anglo-Japanese Foundation (UK) (grant reference 4530/12801). Publisher Copyright: © 2021 Cambridge University Press. All rights reserved.

PY - 2021/7/1

Y1 - 2021/7/1

N2 - We consider a gradual-impulse control problem of continuous-time Markov decision processes, where the system performance is measured by the expectation of the exponential utility of the total cost. We show, under natural conditions on the system primitives, the existence of a deterministic stationary optimal policy out of a more general class of policies that allow multiple simultaneous impulses, randomized selection of impulses with random effects, and accumulation of jumps. After characterizing the value function using the optimality equation, we reduce the gradual-impulse control problem to an equivalent simple discrete-time Markov decision process, whose action space is the union of the sets of gradual and impulsive actions.

AB - We consider a gradual-impulse control problem of continuous-time Markov decision processes, where the system performance is measured by the expectation of the exponential utility of the total cost. We show, under natural conditions on the system primitives, the existence of a deterministic stationary optimal policy out of a more general class of policies that allow multiple simultaneous impulses, randomized selection of impulses with random effects, and accumulation of jumps. After characterizing the value function using the optimality equation, we reduce the gradual-impulse control problem to an equivalent simple discrete-time Markov decision process, whose action space is the union of the sets of gradual and impulsive actions.

KW - Continuous-time Markov decision processes

KW - dynamic programming

KW - gradual-impulse control

KW - optimality equation

UR - http://www.scopus.com/inward/record.url?scp=85109189493&partnerID=8YFLogxK

U2 - 10.1017/apr.2020.64

DO - 10.1017/apr.2020.64

M3 - Article

AN - SCOPUS:85109189493

SN - 0001-8678

VL - 53

SP - 301

EP - 334

JO - Advances in Applied Probability

JF - Advances in Applied Probability

IS - 2

ER -

On gradual-impulse control of continuous-time Markov decision processes with exponential utility

Abstract

Bibliographical note

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Cite this