Fall 2017 - CMPT 741 G100
Data Mining (3)
Class Number: 7115
Delivery Method: In Person
Overview
-
Course Times + Location:
Sep 5 – Dec 4, 2017: Mon, 11:30 a.m.–12:20 p.m.
BurnabySep 5 – Dec 4, 2017: Wed, Fri, 11:30 a.m.–12:20 p.m.
Burnaby
-
Instructor:
Ke Wang
wangk@sfu.ca
1 778 782-4667
Description
CALENDAR DESCRIPTION:
The student will learn basic concepts and techniques of data mining. Unlike data management required in traditional database applications, data analysis aims to extract useful patterns, trends and knowledge from raw data for decision support. Such information are implicit in the data and must be mined to be useful.
COURSE DETAILS:
Data mining aims to extract useful patterns, trends and previously unknown knowledge from raw data for decision support. This course has two focuses: basic concepts and techniques, and recent technologies and developments in dealing with very large data sets. For the first focus, we will study the classic data mining techniques including association, classification, and clustering; for the second focus, we will study the dominant software systems and algorithms for coping with Big Data. Topics include large-scale non-traditional data storage frameworks including graph, recommendation algorithms; and data security. The course will involve hands-on programming assignments and projects.
Topics
- 1. Introduction
- 2. Association Rule Mining
- 3. Classification and Supervised Learning
- 4. Clustering and Unsupervised Learning
- 7. Finding Similar Items
- 9. Link Analysis
- 10. Recommendation Systems
- 11. Dimensionality Reduction
- 12. Data Security
Grading
NOTES:
Assignments/Projects (40%), Midterm (20%), and Final exam (40%)
Materials
REQUIRED READING:
Introduction to Data Mining
Pang-Ning Tan
Addison Wesley, 2006
Available online
ISBN: 9780321321367
Lecture notes: a combination of the notes provided by the authors in item 1, the slides of the course “CS345A: data mining” at Stanford University, and the slides of the instructor.,
Mining of Massive Datasets, Anand Rajaraman, Jure Leskovec, and Jeffrey Ullman
Cambridge University Press, 2012
Available free online: http://proquest.safaribooksonline.com/9781316147047?uicode=simonfraser
ISBN: 9781107077232
Data Mining: Concepts and Techniques, 3rd Edition
Han, Kamber, Pei
Morgan Kaufmann, 22 Jun 2011
Available online
ISBN: 9780123814791
Graduate Studies Notes:
Important dates and deadlines for graduate students are found here: http://www.sfu.ca/dean-gradstudies/current/important_dates/guidelines.html. The deadline to drop a course with a 100% refund is the end of week 2. The deadline to drop with no notation on your transcript is the end of week 3.
Registrar Notes:
SFU’s Academic Integrity web site http://students.sfu.ca/academicintegrity.html is filled with information on what is meant by academic dishonesty, where you can find resources to help with your studies and the consequences of cheating. Check out the site for more information and videos that help explain the issues in plain English.
Each student is responsible for his or her conduct as it affects the University community. Academic dishonesty, in whatever form, is ultimately destructive of the values of the University. Furthermore, it is unfair and discouraging to the majority of students who pursue their studies honestly. Scholarly integrity is required of all members of the University. http://www.sfu.ca/policies/gazette/student/s10-01.html
ACADEMIC INTEGRITY: YOUR WORK, YOUR SUCCESS