Presented By: Michigan Institute for Data Science
Scrubbing and Cleaning of Sensitive Data
Lead Presenter: Jonathan Reader, Programmer/Data Analyst, Neurology, University of Michigan Co-Presenters: Nicolas May, Data Systems Manager, Neurology, University of Michigan Kelly Bakulski, Research Assistant Professor, School of Public Health, Univers
Before analysis, data must be retrieved, scrubbed of identifiable information, cleaned (e.g., addressed missing data, reshaped appropriately), and delivered. Using biomedical and transportation datasets as examples of how this generalizable process works, this workshop will walk attendees through a real-world pipeline used to process and deliver datasets. Documentation and code will be made available through GitLab to allow for coding along with the demonstration. As a result of this workshop, attendees will leave with a practical template for implementing their own a data science pipeline.
Explore Similar Events
-
Loading Similar Events...