Tentative Schedule: Slides, Readings, References
· (DM) Data Mining: Concepts and Techniques, 3rd
edition, by Jiawei Han, Micheline Kamber, and Jian Pei. Morgan Kaufmann, 2011
· (IR) Introduction to information retrieval, by
Christopher Manning, Prabhakar Raghavan, and Hinrich Schutze. Cambridge
University Press, 2008
https://nlp.stanford.edu/IR-book/pdf/irbookonlinereading.pdf
· (PY) Introduction to Machine Learning with Python: A
Guide for Data Scientists, by Andreas C. Muller and Sarah Guido. O’Reilly 2016
|
Date |
Slides |
Chapters |
additional materials |
|
Week 1 |
Syllabus
and Introduction |
DM Chapter 1,2 |
|
|
|
|
||
|
Week2 |
|
|
|
|
|
|||
|
Week3 |
|
linear
algebra quick review, probability
quick review, statistics
basics |
|
|
Week4 |
numpy
and pandas tutorials. linear algebra examples, pandas examples with simple
datasets : sample
datasets |
|
|
|
|
DM Chapter 3.1, 3.2, IR 2.1, 2.2 |
||
|
Week 5 |
IR 6.2,6.3,6.4 |
|
|
|
week 5, 6 |
|
Project 1 announced, |
|
|
week 6 |
DM 8.1 |
|
|
|
Week 7 |
|
||
|
Week 8 |
DM 7.5 |
||
|
Week 9 |
DM8.5 |
Project 2 announced |
|
|
Week 10 |
Decision tree models |
DM 8.2, 8.6 |
|
|
Week 11 |
|
|
|
|
Week 12 |
IR 13.5 |
decision tree exercise |
|
|
week 12 |
|||
|
|
DM 10.1-10.3 |
||
|
|
DM 6.1, 6.2 |
||
|
|
|
|
|
|
|
|
|