All occurrences of this event have passed.
This listing is displayed for historical purposes.

Free Lecture / Discussion

Presented By: Department of Statistics

Ask a Question About This Event

Statistics Department Seminar Series: Linjun Zhang, Associate Professor, Department of Statistics, Rutgers University

Name: Statistics Department Seminar Series: Linjun Zhang, Associate Professor, Department of Statistics, Rutgers University
Start: 2025-11-07T10:00:00-05:00
End: 2025-11-07T11:00:00-05:00
Location: West Hall

"A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models"

Abstract: Large Language Models (LLMs) are rapidly gaining enormous popularity in recent years. However, the training of LLMs has raised significant privacy and legal concerns, particularly regarding the inclusion of copyrighted materials in their training data without proper attribution or licensing, which falls under the broader issue of data misappropriation. In this article, we focus on a specific problem of data misappropriation detection, namely, to determine whether a given LLM has incorporated data generated by another LLM. To address this issue, we propose embedding watermarks into the copyrighted training data and formulating the detection of data misappropriation as a hypothesis testing problem. We develop a general statistical testing framework, construct a pivotal statistic, determine the optimal rejection threshold, and explicitly control the type I and type II errors. Furthermore, we establish the asymptotic optimality properties of the proposed tests, and demonstrate its empirical effectiveness through intensive numerical experiments.

Co-Sponsored By

Department of Statistics Seminar Series

Explore Similar Events

Loading Similar Events...

Keywords

Seminar

0 upcoming occurrence
0 expired occurrence

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

Statistics Department Seminar Series: Linjun Zhang, Associate Professor, Department of Statistics, Rutgers University

"A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models"

Related Links

Co-Sponsored By

Explore Similar Events

Keywords

Contact Event Organizers: Department of Statistics

When and Where

Map West Hall - 340

November 2025

Contact Us

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

Statistics Department Seminar Series: Linjun Zhang, Associate Professor, Department of Statistics, Rutgers University

"A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models"

Related Links

Co-Sponsored By

Share Event

Explore Similar Events

Keywords

Contact Event Organizers: Department of Statistics

When and Where

Map West Hall - 340

November 2025

Contact Us