Skip to Content


No results


No results


No results

Search Results


No results
Search events using: keywords, sponsors, locations or event type
When / Where
All occurrences of this event have passed.
This listing is displayed for historical purposes.

Institute for Social Research pres.


Rebecca C. Steorts

poster image poster image
poster image
An Introduction to Entity Resolution, a half-day workshop geared toward statisticians, data scientists, population researchers, and computational social scientists of all experience levels. This hands-on workshop will cover both the theory and practice of probabilistic entity resolution, while demonstrating state of the art techniques using R software and Apache Spark.

Topics include:

• Overview and introduction to entity resolution

• Entity resolution fundamentals (record linkage, de-duplication, blocking, and computational gains)

• Entity resolution evaluation metrics (including precision, reduction ratio, and robustness to tuning parameters)

• Bayesian entity resolution models (including both parametric and nonparametric Bayesian mixture models)

• Hands-on demonstration of state of the art R packages (using blink) and computational gains (using Apache Spark)


Dr. Rebecca C. Steorts is Assistant Professor of the Department of Statistical Science at Duke University and affiliated faculty in Computer Science, Biostatistics and Bioinformatics, the information initiative at Duke (iiD), and the Social Science Research Institute. She also holds a Schedule A appointment at the U.S. Census Bureau.

Steorts main research focus is on entity resolution (record linkage or de-duplication), where the goal is to remove duplicated information from large, noisy databases in the absence of unique identifiers.
poster image poster image
poster image

Explore Similar Events

  •  Loading Similar Events...
Report Event As Inappropriate Contact Event Organizers
Back to Main Content