[AI Alliance] Workshop: Preparing High Quality Datasets with Data Prep Kit
Thursday, March 27th, 2025: 12:00 PM to 1:00 PM
Join us for an interactive, hands-on session where you will learn to clean up data and prepare high quality datasets.
Virtual
When building machine learning and data applications, a significant portion of your time will be dedicated to data wrangling - from content extraction and filtering out problematic and low quality data. In this hands-on session we will explore Data Prep Kit - an open source toolkit, designed to streamline these essential tasks.
Attendees will learn first hand how to use the Data Prep Kit to improve overall data quality such as removing spam and low quality documents, removing HAP (Hate Abuse Profanity) speech, removing PII (Personally Identifiable Information) data, thus leading to higher quality dataset.
Hosted by Data, Cloud and AI in Miami