Thompson Sampling Algorithm for Stochastic Differential Games

Name: Thompson Sampling Algorithm for Stochastic Differential Games
Start: 2026-01-21T16:00:00-05:00
End: 2026-01-21T17:00:00-05:00
Location: East Hall

Ruolan He, UM

We study a stochastic differential game with $N$ competitive players in a linear-quadratic framework with ergodic cost, where $d$-dimensional diffusion processes govern the state dynamics with an unknown common drift (matrix). Assuming a Gaussian prior on the drift, we use filtering techniques to update its posterior estimates. Based on these estimates, we propose a Thompson-sampling-based algorithm with dynamic episode lengths to approximate strategies. We show that the Bayesian regret for each player has an error bound of order $O(\sqrt{T\log(T)})$, where $T$ is the time-horizon, independent of the number of players. This implies that average regret per unit time goes to zero. Finally, we prove that the algorithm results in a Nash equilibrium.

Co-Sponsored By

Department of Mathematics

Explore Similar Events

Loading Similar Events...

Keywords

Mathematics

0 upcoming occurrence
0 expired occurrence

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

Thompson Sampling Algorithm for Stochastic Differential Games

Ruolan He, UM

Co-Sponsored By

Explore Similar Events

Keywords

Contact Event Organizers: Financial/Actuarial Mathematics Seminar - Department of Mathematics

When and Where

Map East Hall - 1360

January 2026

Contact Us

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

Thompson Sampling Algorithm for Stochastic Differential Games

Ruolan He, UM

Co-Sponsored By

Share Event

Explore Similar Events

Keywords

Contact Event Organizers: Financial/Actuarial Mathematics Seminar - Department of Mathematics

When and Where

Map East Hall - 1360

January 2026

Contact Us