All occurrences of this event have passed.
This listing is displayed for historical purposes.

Free Workshop / Seminar

Presented By: Department of Statistics

Contact Organizers Flag As Inappropriate

Department Seminar Series: Banghua Zhu, PhD Candidate, Department of Electrical Engineering and Computer Sciences, University of California, Berkeley

Name: Department Seminar Series: Banghua Zhu, PhD Candidate, Department of Electrical Engineering and Computer Sciences, University of California, Berkeley
Start: 2024-02-02T10:00:00-05:00
End: 2024-02-02T11:00:00-05:00
Location: West Hall

"Towards Principled Post-Training of Large Language Models"

Abstract: Reinforcement Learning from Human Feedback (RLHF) is a pivotal technique that aligns large language models (LLMs) closely with human-centric values, and has created several leading LLMs, including GPT-4, Claude and Llama 2. The first step of RLHF involves learning human values using a reward model from ranking data. It is observed that the performance of the reward model degrades after one epoch of training, and optimizing the language model too much against the learned proxy reward model hinders the true objective. This talk delves into these issues, leveraging the theoretical insights from statistical decision theory to design improved reward learning algorithms. We also introduce advanced prompting techniques that generate high-quality synthetic ranking dataset for RLHF. By combining the high-quality RLHF dataset with our improved reward learning algorithms, we created the open source language model Starling-7B, which ranks first among all 7B models according to human evaluation in Chatbot Arena.

Co-Sponsored By

Department of Statistics Seminar Series

Explore Similar Events

Loading Similar Events...

Keywords

Seminar

0 upcoming occurrence
0 expired occurrence

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

Department Seminar Series: Banghua Zhu, PhD Candidate, Department of Electrical Engineering and Computer Sciences, University of California, Berkeley

"Towards Principled Post-Training of Large Language Models"

Related Links

Co-Sponsored By

Explore Similar Events

Keywords

Contact Event Organizers: Department of Statistics

When and Where

Map West Hall - 340

February 2024

Contact Us

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

Department Seminar Series: Banghua Zhu, PhD Candidate, Department of Electrical Engineering and Computer Sciences, University of California, Berkeley

"Towards Principled Post-Training of Large Language Models"

Related Links

Co-Sponsored By

Share Event

Explore Similar Events

Keywords

Contact Event Organizers: Department of Statistics

When and Where

Map West Hall - 340

February 2024

Contact Us