The course Web page is http://www.cse.lehigh.edu/~brian/course/webmining/. Announcements, assignments, and some lecture materials will be available there.
The course will be taught by Prof. Brian D. Davison. My email is davison (at) cse.lehigh.edu. My homepage is http://www.cse.lehigh.edu/~brian/. My office is in Packard 380, and my office hours and contact information are posted on my home page. All other meetings should be by appointment.
Class meetings will be held MWF 11:10-noon in Maginnes 113.
Required: Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, by Bing Liu (Springer, 2007). Recommended: Data Mining: Practical Machine Learning Tools and Techniques, 2nd Ed, by Witten and Frank (Morgan Kaufmann, 2005).
A schedule of topics and papers to read will be published on the course website.Prerequisites
CSE345/445 WWW Search Engines or CSE430 Textual Data Mining or CSE 347/447 Data Mining or CSE326/426 Pattern Recognition or permission of the instructor.Expected Work
One midterm is expected. Attendance is required. Coursework will primarily consist of paper critiques and presentations along with a semester-long web mining project (from which results will be presented to the class and in a technical report).Grading Components
Grades will be determined through a combination of midterm, class participation, presentation quality and preparedness, paper critiques, and course project. (Percentages still to be determined.)Topics to be CoveredA late project or homework will be docked 10% of its total value for each 24 hour period for which it is late. No work will be accepted more than five days late, nor for assignments for which a solution has been posted or presented in class.
Policy on Academic Integrity and CollaborationWeb Content Mining Web Structure Mining Web Usage Mining
All work, unless explicitly stated in the problem definition, is to be an individual effort. You are encouraged to discuss assignments with one another, your friends, and with the instructors and graders of the course. Indeed, this may be the most effective method of learning. You may share concepts, approaches and strategies for producing a solution. However all work submitted in your name must be your own. If necessary, violations will be considered as cases of academic dishonesty.Policy on DisabilitiesIt is sometimes difficult to know where to draw the line between educationally useful sharing of ideas and the educationally destructive copying of ideas. Please refer to the "Collaboration Policy" statement for more examples of what is and what is not unfair collaboration.
If you have a disability for which you are or may be requesting accommodations, please contact both your instructor and the Office of Academic Support Services, University Center C212 (610-758-4152) as early as possible in the semester. You must have documentation from the Academic Support Services office before accommodations can be granted.Other Relevant University Policies
Religious Holidays Lehigh Computer Usage Academic Integrity