Web Mining Seminar
CSE 450: Spring 2008 Syllabus

Course Web page
The course Web page is http://www.cse.lehigh.edu/~brian/course/webmining/. Announcements, assignments, and some lecture materials will be available there.
Instructor
The course will be taught by Prof. Brian D. Davison. My email is davison (at) cse.lehigh.edu. My homepage is http://www.cse.lehigh.edu/~brian/. My office is in Packard 380, and my office hours and contact information are posted on my home page. All other meetings should be by appointment.
Time/Location
Class meetings will be held MWF 11:10-noon in Maginnes 113.
Textbooks
Required: Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, by Bing Liu (Springer, 2007). Recommended: Data Mining: Practical Machine Learning Tools and Techniques, 2nd Ed, by Witten and Frank (Morgan Kaufmann, 2005).

Reading List
A schedule of topics and papers to read will be published on the course website.

Prerequisites
CSE345/445 WWW Search Engines or CSE430 Textual Data Mining or CSE 347/447 Data Mining or CSE326/426 Pattern Recognition or permission of the instructor.
Expected Work
One midterm is expected. Attendance is required. Coursework will primarily consist of paper critiques and presentations along with a semester-long web mining project (from which results will be presented to the class and in a technical report).
Grading Components
Grades will be determined through a combination of midterm, class participation, presentation quality and preparedness, paper critiques, and course project. (Percentages still to be determined.)

A late project or homework will be docked 10% of its total value for each 24 hour period for which it is late. No work will be accepted more than five days late, nor for assignments for which a solution has been posted or presented in class.

Topics to be Covered
  • Web Content Mining
  • Web Structure Mining
  • Web Usage Mining
  • Policy on Academic Integrity and Collaboration
    All work, unless explicitly stated in the problem definition, is to be an individual effort. You are encouraged to discuss assignments with one another, your friends, and with the instructors and graders of the course. Indeed, this may be the most effective method of learning. You may share concepts, approaches and strategies for producing a solution. However all work submitted in your name must be your own. If necessary, violations will be considered as cases of academic dishonesty.

    It is sometimes difficult to know where to draw the line between educationally useful sharing of ideas and the educationally destructive copying of ideas. Please refer to the "Collaboration Policy" statement for more examples of what is and what is not unfair collaboration.

    Policy on Disabilities
    If you have a disability for which you are or may be requesting accommodations, please contact both your instructor and the Office of Academic Support Services, University Center C212 (610-758-4152) as early as possible in the semester. You must have documentation from the Academic Support Services office before accommodations can be granted.
    Other Relevant University Policies
  • Religious Holidays
  • Lehigh Computer Usage
  • Academic Integrity

  • This page is http://www.cse.lehigh.edu/~brian/course/2008/webmining/syllabus.html
    Last revised: 11 November 2007, Prof. Davison.