Instructors: Prof. Brian D. Davison
Prof. Roger N. Nagel
Time/Location: W 4:10-7pm in MG 103. Introduction: The web has forever changed access to knowledge, and the development of Web 2.0 tools enabled new methods of communication and interaction. The contexts of and relationships among information (and people) are now becoming explicit, enabling new kinds of automated (and semi-automated) analyses. In this seminar, we will explore how to extract and discover knowledge within information, communication and relationship networks and how people use them. Expected topics will include search, usage mining, social networks, information extraction, link analysis, and more.
This course will focus on reading and presenting papers related to mining networks of information, including the world-wide web, social and professional networks, and will include a semester-long project. Paper and presentation critiques will be required, and course participation will be evaluated.
Objectives: To become proficient at reading technical papers; to gain knowledge of important current social network and web mining research; to gain experience presenting technical material; to learn to write critical reviews of research papers; to explore a research project in some depth and write a technical paper summarizing that work. Prerequisite: One or more of CSE345/445 WWW Search Engines, CSE347/447 Data Mining, CSE430 Textual Data Mining, or CSE326/426 Pattern Recognition. Undergraduates with such background are encouraged to petition for entry. Textbook(s): Recommended: Reshaping Your Business with Web 2.0, by Casarez, Cripe, Sini, and Weckerle (McGraw-Hill, 2009); Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, by Bing Liu (Springer, 2007); Data Mining: Practical Machine Learning Tools and Techniques, 2nd Ed, by Witten and Frank (Morgan Kaufmann, 2005). We will read and discuss significant and recent papers. Useful Links: Syllabus, CourseSite, WEKA