- Prepare (due M 1/31)
- Content below
- Sakai quizzes
- Peer Instructions – See on the class forum
- Homework (due Su 2/6)
- Worked Example
Content (Slides in the Box folder)
4.A – What is Wrangling
- Data sources, formats, and importing (26 min.)
- Common data cleaning problems (16 min.)
- Read Section 3.4 Handling Missing Data from Python Data Science Handbook
4.B – Wrangling Text
- Python string operations (16 min.)
- Introduction to regular expressions (18 min.)
- Read Section 3.10 Vectorized String Operations from Python Data Science Handbook