Data is the new currency! In every walk of life we leave digital traces, which are stored and analyzed at both individual and population levels, by businesses for improving products and services, by governments for policy-making and national security, and by scientists for advancing the frontiers of human knowledge.
This course is intended as an introduction to various aspects of working with data—from acquisition, integration to querying, analysis, and visualization—for different types of data—from unstructured text to structured databases. Through lectures and hands-on labs with real, interesting datasets, we will learn both fundamental concepts and computational tools for working with data. The course ends with a capstone project, which applies what we learn to a dataset of your choice.
This course is open to students from both inside and outside computer science. Dealing with data requires more than just computer programming: What do we know about the processes underlying the data? What are the interesting questions to ask about data? What practical impacts can arise from the data? What constitute ethical uses? Therefore, we also welcome students with analytical backgrounds (e.g., statistics, math) or knowledge in fields that would benefit from data analysis (e.g., social science, public policy).
Prerequisites: Equivalent of CompSci 101 or consent of the instructor.