All occurrences of this event have passed.
This listing is displayed for historical purposes.

Free Workshop / Seminar

Presented By: Industrial & Operations Engineering

Flag As Inappropriate

IOE 899: High-dimensional Optimization with Applications to Compute-Optimal Neural Scaling Laws

Name: IOE 899: High-dimensional Optimization with Applications to Compute-Optimal Neural Scaling Laws
Start: 2024-11-14T15:00:00-05:00
End: 2024-11-14T16:00:00-05:00
Location: Industrial and Operations Engineering Building

Courtney Paquette with McGill University

About the speaker: Courtney Paquette is an assistant professor at McGill University and a CIFAR Canada AI chair, MILA. She was awarded a Sloan Research Fellowship in Computer Science in 2024. Paquette’s research broadly focuses on designing and analyzing algorithms for large-scale optimization problems, motivated by applications in data science. She is also interested in scaling limits of stochastic learning algorithms. She received her PhD from the mathematics department at the University of Washington (2017), held postdoctoral positions at Lehigh University (2017-2018) and the University of Waterloo (NSF postdoctoral fellowship, 2018-2019), and works 20% as a research scientist at Google DeepMind, Montreal.

Abstract: Given the massive scale of modern ML models, we now only get a single shot to train them effectively. This restricts our ability to test multiple architectures and hyper-parameter configurations. Instead, we need to understand how these models scale, allowing us to experiment with smaller problems and then apply those insights to larger-scale models. In this talk, I will present a framework for analyzing scaling laws in stochastic learning algorithms using a power-law random features model, leveraging high-dimensional probability and random matrix theory. I will then use this scaling law to address the compute-optimal question: How should we choose model size and hyper-parameters to achieve the best possible performance in the most compute-efficient manner?

Explore Similar Events

IOE 899: A Nonasymptotic CLT for Markov chains, with applications to machine learning
- 4/17/2025 3:00pm
- Workshop / Seminar
- Industrial and Operations Engineering Building
Deep Learning Methods for Autonomous Underwater Survey, Reacquisition, and Close-Range Inspection
- 4/21/2025 12:00pm
- Presentation
- Ford Robotics Building
CAV Pilot Development and Deployment in Midwest Winter
- 4/15/2025 2:00pm
- Lecture / Discussion
- Virtual
CSEAS Friday Lecture Series. Fact Checking in Low-Resource Languages: A New Dataset and Transformer Model for the Burmese Language
- 4/18/2025 12:00pm
- Lecture / Discussion
- Weiser Hall
Multilingual Digital Humanities and Data Work
- 4/24/2025 1:00pm
- Conference / Symposium
- Hatcher Graduate Library

Keywords

0 upcoming occurrence
1 expired occurrence

November 2024

Selected 2024/11/14
1 expired occurrence

Industrial and Operations Engineering Building - 1680

3:00pm - 4:00pm

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

IOE 899: High-dimensional Optimization with Applications to Compute-Optimal Neural Scaling Laws

Courtney Paquette with McGill University

Explore Similar Events

IOE 899: A Nonasymptotic CLT for Markov chains, with applications to machine learning

Deep Learning Methods for Autonomous Underwater Survey, Reacquisition, and Close-Range Inspection

CAV Pilot Development and Deployment in Midwest Winter

CSEAS Friday Lecture Series. Fact Checking in Low-Resource Languages: A New Dataset and Transformer Model for the Burmese Language

Multilingual Digital Humanities and Data Work

Keywords

Selected 2024/11/14
1 expired occurrence

Selected 2024/11/14
1 expired occurrence

Contact Us

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

IOE 899: High-dimensional Optimization with Applications to Compute-Optimal Neural Scaling Laws

Courtney Paquette with McGill University

Share Event

Explore Similar Events

IOE 899: A Nonasymptotic CLT for Markov chains, with applications to machine learning

Deep Learning Methods for Autonomous Underwater Survey, Reacquisition, and Close-Range Inspection

CAV Pilot Development and Deployment in Midwest Winter

CSEAS Friday Lecture Series. Fact Checking in Low-Resource Languages: A New Dataset and Transformer Model for the Burmese Language

Multilingual Digital Humanities and Data Work

Keywords

Selected 2024/11/14 1 expired occurrence

Selected 2024/11/14 1 expired occurrence

Contact Us

Selected 2024/11/14
1 expired occurrence

Selected 2024/11/14
1 expired occurrence