Data Mining Course – 2018 – T2

Study of data mining techniques, i.e., knowledge discovery from data (KDD). The KDD process includes the exploratory data analysis, preprocessing, identification of outliers, clustering, prediction, frequent patterns, and data warehouses

  1. G. James, D. Witten, T. Hastie, and R. Tibshirani, 2013, An Introduction to Statistical Learning: with Applications in R. 1 ed. Springer.


  2. J. Han, M. Kamber, and J. Pei, 2011, Data Mining: Concepts and Techniques. 3 ed. Haryana, India; Burlington, MA, Morgan Kaufmann.


  3. S. García, J. Luengo, and F. Herrera, 2015, Data Preprocessing in Data Mining.


  4. C. Aggarwal and J. Han, eds., 2014, Frequent Pattern Mining. 2014 ed. New York, Springer.


  5. K.J. Keen, 2018, Graphics for Statistics and Data Analysis with R, Second Edition. 2 edition ed. Boca Raton, Chapman and Hall/CRC.


  6. B. Lantz, 2013, Machine Learning with R. Birmingham, Packt Publishing.


  7. J.P. Lander, 2017, R for Everyone: Advanced Analytics and Graphics. 2 ed. Boston, MA, Addison-Wesley Professional.


  8. R.H. Shumway and D.S. Stoffer, 2017, Time Series Analysis and Its Applications: With R Examples. 4 ed. New York, NY, Springer.

Slides and schedule available at Moodle.

Fill this form to get access to the course.

About Eduardo Ogasawara
I am a Professor of the Computer Science Department of the Federal Center for Technological Education of Rio de Janeiro (CEFET / RJ) since 2010. I hold a PhD in Systems Engineering and Computer Science at COPPE / UFRJ. Between 2000 and 2007 I worked in the Information Technology (IT) field where I acquired extensive experience in workflows and project management. I have solid background in the Databases and my primary interest is Data Science. He currently studies space-time series, parallel and distributed processing, and data preprocessing methods. I am a member of the IEEE, ACM, INNS, and SBC. Throughout my career I have been presenting consistent number of published articles and projects approved by the funding agencies, such as CNPq and FAPERJ. I am also reviewer of several international journals, such as VLDB Journal, IEEE Transactions on Service Computing and The Journal of Systems and Software. Currently, I am heading the Post-Graduate Program in Computer Science (PPCIC) of CEFET / RJ.

Comments are closed.