This course is designed to provide the student with solid practical skills in implementing basic statistical and machine learning techniques for the purpose of predictive analytics. Throughout the course, many real world case studies are used to motivate and explain the strengths and appropriateness of each method of interest. In those case studies, students will learn how to apply data cleaning, visualization, and other exploratory data analysis tools to a variety of real world complex data. Students will gain experience with reproducibility and documentation of computational projects and with developing basic data products for predictive analytics. The following techniques will be implemented and then tested with cross-validation: regularization in linear models, regression and smoothing splines, k-nearest neighbor, and tree-based methods, including random forest.
This course is not currently scheduled. Please contact the concierge to learn more.