Miaomiao Zhang

CSE 426 / 326: Fundamentals of Machine Learning

Description: An introductory course offers a broad overview of the main techniques in machine learning. Students will study the basic concepts of advanced machine learning methods as well as their theoretical background. Topics of learning theory (bias /variance tradeoffs; VC theory); supervised learning parametric / nonparametric methods, Bayesian models, support vector machines, neural networks); unsupervised learning (dimensionality reduction, kernel tricks, clustering) and reinforcement learning will be covered.

Class meetings: MW 2:35pm-3:50pm @ Neville Hall 003
Instructor: Miaomiao Zhang (miaomiao -at- cse.lehigh.edu)
Office hours: MW 4:00pm-5:00pm @ PL514A
Teaching Assistant: Yuan Xue (yux715-at-lehigh.edu), TTh 9:00am-11:00am @ PL114, or by appointment

Lecture materials including slides and notes will be posted on CourseSite

Prerequisites by Topic

Linear algebra: matrices, vectors, vector spaces, basic matrix transformations, eigenvalues and linear differential equations

Probability and statistics: Distribution of random variables, Bayesian rules, random sampling, hypotheses testing, correlation and regression

Programming: algorithm and implementation in Matlab, or Python, or C/C++

Textbooks

Shai Shalev-Shwartz and Shai Ben-David, “Understanding Machine Learning: From Theory to Algorithms”, 1st Edition, Cambridge University Press, 2014, ISBN 978-1107057135

Mehryar Mohri, Afshin Rostamizadeh and Ameet Talwalkar, “Foundations of Machine Learning (Adaptive Computation and Machine Learning series)”, The MIT Press, 2012, ISBN 978-0262018258

Christopher Bishop, "Pattern Recognition and Machine Learning”, ISBN 978-0-387-31073-2

Grading

Midterm (15%)
Problem sets and projects (65%)
Final (15%)
Participation (5%)

All programming will be in Matlab, a powerful, free programming tool for Lehigh students: https://software.lehigh.edu/install/. Reports must be written in LaTeX and submitted as PDF.

Homeworks are due by midnight (11:59:59 PM) on the due date. Late penalty is 15% per day. Late assignments after three days will not be accepted.

Cheating

Discussions with classmates are strongly encouraged. However, any assignment must be your own work based on your own understanding of the questions. Both copying others’ solution and let others copy your solution will be considered as cheating.

You must work on exams within limited time independently. Students caught cheating will fail this course. Please don't take that chance.

Schedule

Date	Topics	Problem sets
Week 1
Jan 22	Course Introduction	PS0, due on Jan 24
Jan 24	Basic Concepts	PS0 due
Week 2
Jan 29	Linear Regression Methods
Jan 31	The Perceptron	PS1, due on Feb 12
Week 3
Feb 5	Support Vector Machines
Feb 7	Learning Theory
Week 4
Feb 12	Clustering Algorithms - Nearest Neighbors, K-means	PS1 due
Feb 14	Probabilistic Modeling, MLE	PS2, due on Feb 25
Week 5
Feb 19	Gaussian Mixture Model and Graphical Models
Feb 21	The EM algorithm
Week 6
Feb 26	Midterm Review by TA (Instructor travel)
Feb 28	Midterm Exam
Week 7
Mar 5	Data Dimensionality Reduction - PCA	PS3, due on March 19
Mar 7	Kernel Methods
Week 8
Mar 12	Spring break
Mar 14	Spring break
Week 9
Mar 19	Neural Networks I	PS3 due
Mar 21	Neural Networks II	PS4, due on Apr 2
Week 10
Mar 26	Stochastic Gradient Decent
Mar 28	Bayesian Models, MAP
Week 11
Apr 2	Markov Chain Monte Carlo Methods	PS4 due
Apr 4	Random Sampling	PS5, due on Apr 16
Week 12
Apr 9	Boosting
Apr 11	Model Selection
Week 13
Apr 16	Random Forest	PS5 due
Apr 18	Structured Prediction I
Week 14
Apr 23	Structured Prediction II
Apr 25	Basics of Reinforcement Learning
Week 15
Apr 30	Final project presentation
May 2	Final project presentation
May 9	Final project Due

Disclaimer

The instructor reserves the right to make changes to the course schedule, syllabus, and project deadlines. Changes will be announced early in advance.