All occurrences of this event have passed.
This listing is displayed for historical purposes.

Free Careers / Jobs

Presented By: University Career Center

Flag As Inappropriate

Salesforce Research Speaker Series: Robustness Gym: Evaluating NLPModels Using Data Slices

Name: Salesforce Research Speaker Series: Robustness Gym: Evaluating NLPModels Using Data Slices
Start: 2020-10-23T10:00:00-04:00
End: 2020-10-23T11:30:00-04:00

Join Nazneen Rajani in this week's speaker series focused on Evaluating NLP Models Using Data Slices.

Abstract:
Deep neural networkshave proven powerful for various NLP tasks on (large) datasets, but are often not robust to (adversarial) data corruptions, distribution shifts,and other harmful data manipulations. In practice, this can lead to severe vulnerabilities, limited successful generalization to unseen data, and hinder safe deployment. While practitioners are increasingly aware of these problems, the common paradigm of reporting model performance continues to rely on the train-val-test i.i.d. splits.

In this talk, I will introduce the Robustness Gym: an easy-to-use and extendable framework to concisely evaluate the robustness and diagnose model vulnerabilities using data slices, i.e., user-defined splits or transformation of evaluation data, on a wide range of NLP tasks in both classification and generation. Data slices are generic and can represent, for example, semantic sub-populations (gender, age splits), augmentations (sub-word substitutions), adversarial attacks, or statistical splits (e.g., by word frequency).

Our framework is flexible and enables users to programmatically define and evaluate on data slices.

The Robustness Gym provides an overview of model performance on data slices and supports a wide variety of evaluation metrics, including accuracy, precision-recall, and fairness metrics. Our framework can also evaluate model robustness under dataset manipulations,including a range of adversarial attacks, data-augmentations, and different evaluation sets such as stress tests and contrast sets.

RSVP here:https://salesforce.recsolu.com/external/events/AhRCRs2BDJ8W7gVJLaUtdw

Explore Similar Events

Deep Learning Methods for Autonomous Underwater Survey, Reacquisition, and Close-Range Inspection
- 4/21/2025 12:00pm
- Presentation
- Ford Robotics Building
CSEAS Friday Lecture Series. Fact Checking in Low-Resource Languages: A New Dataset and Transformer Model for the Burmese Language
- 4/18/2025 12:00pm
- Lecture / Discussion
- Weiser Hall
CAV Pilot Development and Deployment in Midwest Winter
- 4/15/2025 2:00pm
- Lecture / Discussion
- Virtual
Frontiers in Scientific Machine Learning Seminar 12: Operator Networks Based on Numerical Analysis
- 4/11/2025 12:00pm
- Workshop / Seminar
- GG Brown Laboratory
Sydney Gable Dissertation Defense
- 4/8/2025 1:00pm
- Lecture / Discussion
- 1100 North University Building

0 upcoming occurrence
1 expired occurrence

October 2020

Selected 2020/10/23
1 expired occurrence

10:00am - 11:30am

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

Salesforce Research Speaker Series: Robustness Gym: Evaluating NLPModels Using Data Slices

Explore Similar Events

Deep Learning Methods for Autonomous Underwater Survey, Reacquisition, and Close-Range Inspection

CSEAS Friday Lecture Series. Fact Checking in Low-Resource Languages: A New Dataset and Transformer Model for the Burmese Language

CAV Pilot Development and Deployment in Midwest Winter

Frontiers in Scientific Machine Learning Seminar 12: Operator Networks Based on Numerical Analysis

Sydney Gable Dissertation Defense

Selected 2020/10/23
1 expired occurrence

Selected 2020/10/23
1 expired occurrence

Contact Us

Happening @ Michigan

The University of Michigan Events Calendar

Sponsors

Keywords

Types

Search Results

Events

Salesforce Research Speaker Series: Robustness Gym: Evaluating NLPModels Using Data Slices

Share Event

Explore Similar Events

Deep Learning Methods for Autonomous Underwater Survey, Reacquisition, and Close-Range Inspection

CSEAS Friday Lecture Series. Fact Checking in Low-Resource Languages: A New Dataset and Transformer Model for the Burmese Language

CAV Pilot Development and Deployment in Midwest Winter

Frontiers in Scientific Machine Learning Seminar 12: Operator Networks Based on Numerical Analysis

Sydney Gable Dissertation Defense

Selected 2020/10/23 1 expired occurrence

Selected 2020/10/23 1 expired occurrence

Contact Us

Selected 2020/10/23
1 expired occurrence

Selected 2020/10/23
1 expired occurrence