Fridays 4:30 pm - 7:10pm at Robinson Hall B124 (Jan 21, 2020 - May 13, 2020)
Grade composition: No in-class examination. Grade based entirely on participation in class, homework assignments, take-home midterm and final project.
Diez, Barr and Cetinkaya-Rundel OpenIntro Statistics, OpenIntro, 2015
James, Witten, Hastie and Tibshirani, An Introduction to Statistical Learning with Applications in R, Springer, 2009.
Kuhn and Johnson, Applied Predictive Modeling, Springer, 2013.
Hyndman and Athanasopoulos, Forecasting: Principles and Practice, OTexts, 2013.
Airbnb (Random Forest)
Facebook (Decision trees and logistic regrsssion)
Youtube (deep learning)
Uber (time series)
Debby Kermer (data services): contact info
UCE ML Repo (Lots of datasets along with descriptions of each)
Knuggets (Lot of links to datasets relevant for data mining)
ExonData (The site has links to plenty of regional, state, and local economic data)
3stages (Searchable listing 363 Internet sites of Social Science data)
data.gov (A repository for information collected by the federal government)
Chicago (Urban Analytics)
Here are the courses that cover different aspects of data science
Statistical modeling (STAT250, SYST664, OR719, STAT 554)
Data management (AIT614)
Optimization (OR604)