All occurrences of this event have passed.
This listing is displayed for historical purposes.

Free Workshop / Seminar

Presented By: Department of Mathematics

Ask a Question About This Event

Financial/Actuarial Mathematics Seminar

Name: Financial/Actuarial Mathematics Seminar
Start: 2022-03-16T15:00:00-04:00
End: 2022-03-16T16:00:00-04:00
Location: Virtual

Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models

We develop a probabilistic framework for analysing model-based reinforcement learning in the episodic setting. We then apply it to study finite-time horizon stochastic control problems with linear dynamics but unknown coefficients and convex, but possibly irregular, objective function. Using probabilistic representations, we study regularity of the associated cost functions and establish precise estimates for the performance gap between applying optimal feedback control derived from estimated and true model parameters. Next, we propose a phase-based learning algorithm for which we show how to optimise exploration-exploitation trade-off. Our algorithm achieves sublinear (or even logarithmic) regrets in high probability and expectation, which matches the best possible results from the literature. Speaker(s): Yufei Zhang (LSE)

Co-Sponsored By

Financial/Actuarial Mathematics Seminar - Department of Mathematics

Explore Similar Events

Loading Similar Events...

Keywords

Mathematics

0 upcoming occurrence
0 expired occurrence

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

Financial/Actuarial Mathematics Seminar

Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models

Co-Sponsored By

Explore Similar Events

Keywords

Contact Event Organizers: Department of Mathematics

When and Where

Virtual - Zoom

March 2022

Contact Us

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

Financial/Actuarial Mathematics Seminar

Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models

Co-Sponsored By

Share Event

Explore Similar Events

Keywords

Contact Event Organizers: Department of Mathematics

When and Where

Virtual - Zoom

March 2022

Contact Us