Data Mining

Data Mining is the process of extracting knowledge from data. The main topics covered in this course include pre-processing, sorting, grouping, membership rules, anomaly, and the data mining process itself. The discipline aims to provide students with the fundamental skills needed to conduct their own research in data mining.


  1.  Mohammed J. Zaki and Wagner Meira Jr. Data Mining and Analysis: Fundamental Concepts and Algorithms. Cambridge University Press, May 2014.
  2. Ian H. Witten, Eibe Frank, and Mark A. Hall. Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, Burlington, MA, 3 edition, January 2011.
  3. Jiawei Han, Micheline Kamber, and Jian Pei. Data Mining: Concepts and Techniques. Morgan Kaufmann, Waltham, Mass., 3 edition, July 2011.
  4.  Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani. An Introduction to Statistical Learning: with Applications in R . Springer, 1st edition, August 2013.
  5. Trevor Hastie, Robert Tibshirani, and Jerome Friedman. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, 2nd edition, April 2011.
  6. Bing Liu. Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data. Springer, softcover reprint of hardcover 2nd ed. 2011 edition, August 2013.

This course is regularly offered once a year at CEFET/RJ for graduate students (PPCIC and PPPRO).

Slides and schedule available at Moodle.


About Eduardo Ogasawara
I am a Professor of the Computer Science Department of the Federal Center for Technological Education of Rio de Janeiro (CEFET / RJ) since 2010. I hold a PhD in Systems Engineering and Computer Science at COPPE / UFRJ. Between 2000 and 2007 I worked in the Information Technology (IT) field where I acquired extensive experience in workflows and project management. I have solid background in the Databases and my primary interest is Data Science. He currently studies space-time series, parallel and distributed processing, and data preprocessing methods. I am a member of the IEEE, ACM, INNS, and SBC. Throughout my career I have been presenting consistent number of published articles and projects approved by the funding agencies, such as CNPq and FAPERJ. I am also reviewer of several international journals, such as VLDB Journal, IEEE Transactions on Service Computing and The Journal of Systems and Software. Currently, I am heading the Post-Graduate Program in Computer Science (PPCIC) of CEFET / RJ.

Comments are closed.