Learning to reflect: A unifying approach for data-driven stochastic control strategies

Sören Christensen, Claudia Strauch, Lukas Trottner

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

2 Citations (Scopus)

Abstract

Stochastic optimal control problems have a long tradition in applied probability, with the questions addressed being of high relevance in a multitude of fields. Even though theoretical solutions are well understood in many scenarios, their practicability suffers from the assumption of known dynamics of the underlying stochastic process, raising the statistical challenge of developing purely data-driven strategies. For the mathematically separated classes of continuous diffusion processes and Lévy processes, we show that developing efficient strategies for related singular stochastic control problems can essentially be reduced to finding rate-optimal estimators with respect to the sup-norm risk of objects associated to the invariant distribution of ergodic processes which determine the theoretical solution of the control problem. From a statistical perspective, we exploit the exponential β-mixing property as the common factor of both scenarios to drive the convergence analysis, indicating that relying on general stability properties of Markov processes is a sufficiently powerful and flexible approach to treat complex applications re-quiring statistical methods. We show moreover that in the Lévy case—even though per se jump processes are more difficult to handle both in statistics and control theory—a fully data-driven strategy with regret of significantly better order than in the diffusion case can be constructed utilizing spatial ergodicity of a path-time transformation of the Lévy process in form of its overshoots.

Original languageEnglish
JournalBernoulli
Volume30
Issue3
Pages (from-to)2074-2101
Number of pages28
ISSN1350-7265
DOIs
Publication statusPublished - Aug 2024

Keywords

  • Data-driven singular control
  • Lévy processes
  • diffusion processes
  • exploration vs. exploitation
  • nonparametric statistics
  • overshoots
  • reinforcement learning
  • sup-norm risk

Fingerprint

Dive into the research topics of 'Learning to reflect: A unifying approach for data-driven stochastic control strategies'. Together they form a unique fingerprint.

Cite this