All occurrences of this event have passed.
This listing is displayed for historical purposes.

Free Workshop / Seminar

Presented By: Department of Statistics

Ask a Question About This Event

Statistics Department Seminar Series: Xianyang Zhang, Professor, Department of Statistics, Texas A&M University.

Name: Statistics Department Seminar Series: Xianyang Zhang, Professor, Department of Statistics, Texas A&M University.
Start: 2025-02-21T10:00:00-05:00
End: 2025-02-21T11:00:00-05:00
Location: West Hall

"Detecting and Segmenting Watermarked Texts from Language Models"

Abstract: The rapid adoption of large language models (LLMs), such as GPT-4 and Claude 3.5, underscores the need to distinguish LLM-generated text from human-written content to mitigate the spread of misinformation, misuse in education, and LLM training data contamination. One promising approach to address this issue is the watermark technique, which embeds subtle statistical signals into LLM-generated text to enable reliable identification. In this work, we enhance watermark detection using adaptive methods that assign higher weights to tokens with smaller next-token probabilities (NTPs), where NTPs quantify the likelihood of a token appearing based on its preceding context. We rigorously analyze the Type I and Type II error of the proposed method and demonstrate its superior detection power through numerical experiments. Due to the unavailability of true prompts and, thus, true NTPs, we introduce a prompt estimation method that identifies the most likely prompt from an instruction set to estimate NTPs. Furthermore, we develop a statistical framework for segmenting text into watermarked and non-watermarked substrings by framing it as a change point detection problem. Extensive experiments validate the proposed methods, demonstrating their effectiveness in detection, segmentation, and robustness.

https://zhangxiany-tamu.github.io/

Co-Sponsored By

Department of Statistics Seminar Series

Explore Similar Events

Loading Similar Events...

Keywords

Seminar

0 upcoming occurrence
0 expired occurrence

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

Statistics Department Seminar Series: Xianyang Zhang, Professor, Department of Statistics, Texas A&M University.

"Detecting and Segmenting Watermarked Texts from Language Models"

Related Links

Co-Sponsored By

Explore Similar Events

Keywords

Contact Event Organizers: Department of Statistics

When and Where

Map West Hall - 340

February 2025

Contact Us

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

Statistics Department Seminar Series: Xianyang Zhang, Professor, Department of Statistics, Texas A&M University.

"Detecting and Segmenting Watermarked Texts from Language Models"

Related Links

Co-Sponsored By

Share Event

Explore Similar Events

Keywords

Contact Event Organizers: Department of Statistics

When and Where

Map West Hall - 340

February 2025

Contact Us