Short course on Data Analytics

Study of data mining techniques, i.e., extraction of knowledge from large volumes of data. The knowledge extraction process includes exploratory data analysis, preprocessing, identification of outliers, grouping, classification, frequent patterns and data warehouses.

  1. Han, M. Kamber, and Pei J. Data Mining: Concepts and Techniques, Morgan Kaufmann Publisher, Burlington, MA, USA, 3rd Edition, 2011.
  2. Zaki, M.J. and Jr., W.M. Data Mining and Analysis: Fundamental Concepts and Algorithms, Cambridge University Press, Cambridge, United Kingdom, 1st Edition, 2014.
  3. Witten, I.H., Frank, E. and Hall M.A., Data Mining: Practical Machine Learning Tools and Techniques, Morgan Kaufmann Publishers, Burlington, MA, USA, 3rd Edition, 2011.
  4. Hastie, T., Tibshirani, R., Friedman, J., (2011), The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer Publishing, New York, USA, 2 edition: 2013.
  5. James, G., Witten, D., Hastie, T., Tibshirani, R., (2013), An Introduction to Statistical Learning: with Applications in R., Springer Publishing, New York, USA, 1 edition: 2013.
  6. Lantz, B., (2013), Machine Learning with R. Packt Publishing Publishing, United Kingdom, 1st Edition, 2013.
  7. Leskovec, J., Rajaraman, A., Ullman, J.D., (2015), Mining of Massive Datasets. Cambridge University Press, Cambridge, United Kingdom, 2nd Edition, 2015.
  8. Shumway, R.H., Stoffer, D. S., (2010), Time Series Analysis and Its Applications: With Examples. Publisher Springer, New York, USA, 3 edition: 2010.

This short-course is regularly offered once a year at LNCC under the collaboration between CEFET/RJ and LNCC.

Slides and schedule available at Moodle.


About Eduardo Ogasawara
I am a Professor of the Computer Science Department of the Federal Center for Technological Education of Rio de Janeiro (CEFET / RJ) since 2010. I hold a PhD in Systems Engineering and Computer Science at COPPE / UFRJ. Between 2000 and 2007 I worked in the Information Technology (IT) field where I acquired extensive experience in workflows and project management. I have solid background in the Databases and my primary interest is Data Science. He currently studies space-time series, parallel and distributed processing, and data preprocessing methods. I am a member of the IEEE, ACM, INNS, and SBC. Throughout my career I have been presenting consistent number of published articles and projects approved by the funding agencies, such as CNPq and FAPERJ. I am also reviewer of several international journals, such as VLDB Journal, IEEE Transactions on Service Computing and The Journal of Systems and Software. Currently, I am heading the Post-Graduate Program in Computer Science (PPCIC) of CEFET / RJ.

Comments are closed.