Module 06: Combining Data

  1. Prepare (due Mon 2/17)
    1. Content below
    2. Canvas quizzes
  2. Class engagement – See on the class forum
  3. Homework (due Sun 2/23) [Link]
  4. Worked Example [Link]

Content (Slides in the Box Folder)

06.A – Summarizing Data

  1. Read Section 3.8 Aggregating and Grouping from Python Data Science Handbook.
  2. Read Section 3.9 Pivot Tables from Python Data Science Handbook.

06.B – Merging Data

  1. Read Section 3.6 Concat and Append from Python Data Science Handbook. Please note that the join_axes optional parameter mentioned in this section has been deprecated from the Pandas library, you can skip over the details on this parameter.
  2. Read Section 3.7 Merge and Join from Python Data Science Handbook
  3. Table Relationships (4 min.)
  4. Which Join to Use (4 min.)
  5. Record Linkage (8 min.)
  6. Fuzzy Matching (21 min.)

Optional Supplements

Leave a Reply

Your email address will not be published. Required fields are marked *