homeworks

Assignment Out Due
Getting Started May 14 May 25
SQL May 25 June 1
Web Crawling June 1 June 8
MapReduce June 8 June 17
Stats June 17 June 24
Machine Learning July 6 July 13
Data Visualization July 23 July 30



labs

Labs are designed to help with assignments and the deliverables for the final project.

Every student will be assigned a lab slot during shopping period. Details for lab checkoff and attendance can be found in the syllabus.

All labs will happen on Gather.town here, password being "pandas". Please use Signmeup to ask questions and get checked off for labs. Please refer to our Lab Guide if you want more information on how to open Google Colab.

Week Topic
May 20 – May 26 Intro to SQL
May 27 – June 2 Pandas/Numpy Lab
June 3 – June 9 Intro to MapReduce
June 15 – June 21 Stats
July 2 – July 15 Supervised Machine Learning
July 16 – July 26 Python Data Visualization
N/A NLP & Matrix Factorization (Optional)
N/A D3.js Data Visualization (Optional)
N/A ML Fairness (Optional)



final project

Project groups will work closely with a TA Advisor throughout the semester. The dates for checkins are approximate, and may vary depending on your group's schedule and that of your advisor.

Project Checkpoint Out Due
Data Deliverable June 25 July 13
Exploratory Analysis Deliverable July 13 July 27
Confirmatory Analysis Deliverable & Presentation July 27 August 10

Please check out highlighted example projects from the handout. For more example project ideas, please check out our archive! Additionally, you can take a look at Awesome Public Datasets.