Assignment | Out | Due |
---|---|---|
Getting Started | May 14 | May 25 |
SQL | May 25 | June 1 |
Web Crawling | June 1 | June 8 |
MapReduce | June 8 | June 17 |
Stats | June 17 | June 24 |
Machine Learning | July 6 | July 13 |
Data Visualization | July 23 | July 30 |
Labs are designed to help with assignments and the deliverables for the final project.
Every student will be assigned a lab slot during shopping period. Details for lab checkoff and attendance can be found in the syllabus.
All labs will happen on Gather.town here, password being "pandas". Please use Signmeup to ask questions and get checked off for labs. Please refer to our Lab Guide if you want more information on how to open Google Colab.
Week | Topic |
---|---|
May 20 – May 26 | Intro to SQL |
May 27 – June 2 | Pandas/Numpy Lab |
June 3 – June 9 | Intro to MapReduce |
June 15 – June 21 | Stats |
July 2 – July 15 | Supervised Machine Learning |
July 16 – July 26 | Python Data Visualization |
N/A | NLP & Matrix Factorization (Optional) |
N/A | D3.js Data Visualization (Optional) |
N/A | ML Fairness (Optional) |
Project groups will work closely with a TA Advisor throughout the semester. The dates for checkins are approximate, and may vary depending on your group's schedule and that of your advisor.
Project Checkpoint | Out | Due |
---|---|---|
Data Deliverable | June 25 | July 13 |
Exploratory Analysis Deliverable | July 13 | July 27 |
Confirmatory Analysis Deliverable & Presentation | July 27 | August 10 |
Please check out highlighted example projects from the handout. For more example project ideas, please check out our archive! Additionally, you can take a look at Awesome Public Datasets.