{"id":1587,"date":"2023-07-18T09:59:31","date_gmt":"2023-07-18T12:59:31","guid":{"rendered":"https:\/\/eic.cefet-rj.br\/~eogasawara\/?p=1587"},"modified":"2026-03-15T13:44:18","modified_gmt":"2026-03-15T16:44:18","slug":"data-mining","status":"publish","type":"page","link":"https:\/\/eic.cefet-rj.br\/~eogasawara\/data-mining\/","title":{"rendered":"Minera\u00e7\u00e3o de Dados"},"content":{"rendered":"<p><strong>Ementa<\/strong><\/p>\n<p>Minera\u00e7\u00e3o de Dados. Processo de Descoberta de Conhecimento em Bases de Dados (KDD). An\u00e1lise explorat\u00f3ria de dados. Pr\u00e9-processamento e qualidade de dados. Minera\u00e7\u00e3o de padr\u00f5es frequentes e sequenciais. T\u00e9cnicas de agrupamento. Modelagem preditiva por classifica\u00e7\u00e3o e regress\u00e3o. Arquiteturas anal\u00edticas para suporte \u00e0 minera\u00e7\u00e3o de dados, incluindo data warehouses, data lakes e OLAP. Aspectos \u00e9ticos, de privacidade e responsabilidade no uso de t\u00e9cnicas de minera\u00e7\u00e3o de dados..<\/p>\n<p><strong>Objetivos<\/strong><\/p>\n<p>Fundamentar os conhecimentos indispens\u00e1veis \u00e0 extra\u00e7\u00e3o sistem\u00e1tica de conhecimento a partir de grandes volumes de dados, com foco no processo de Descoberta de Conhecimento em Bases de Dados. Para isso, \u00e9 realizado um estudo detalhado das etapas do KDD, desde a compreens\u00e3o do dom\u00ednio e o pr\u00e9-processamento dos dados at\u00e9 a aplica\u00e7\u00e3o, avalia\u00e7\u00e3o e interpreta\u00e7\u00e3o de modelos de minera\u00e7\u00e3o. O curso visa proporcionar um s\u00f3lido embasamento te\u00f3rico aliado \u00e0 pr\u00e1tica com ferramentas computacionais modernas, capacitando o aluno a selecionar, aplicar e analisar t\u00e9cnicas de minera\u00e7\u00e3o de dados em diferentes contextos, bem como a compreender limita\u00e7\u00f5es, impactos e implica\u00e7\u00f5es \u00e9ticas associadas ao uso dessas t\u00e9cnicas em cen\u00e1rios reais e multidisciplinares.<\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p>&nbsp;<\/p>\n<p><strong>Slides<\/strong><\/p>\n<ol>\n<li>Introdu\u00e7\u00e3o \u00e0 Minera\u00e7\u00e3o de Dados &#8211; Vis\u00e3o geral do curso e do papel da metodologia cient\u00edfica na minera\u00e7\u00e3o de dados. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/01-Introduction.pdf\">PDF<\/a><\/li>\n<li>Fundamentos da Linguagem R &#8211; Fundamentos da linguagem R e ferramentas necess\u00e1rias para an\u00e1lises reprodut\u00edveis. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/02-R-Basics.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/examples\/02-R-Basics.md\">examples\/02-R-Basics.md<\/a><\/li>\n<li>Visualiza\u00e7\u00e3o de Dados &#8211; Princ\u00edpios e exemplos de visualiza\u00e7\u00e3o de dados para explorar padr\u00f5es iniciais. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/03-DataVisualization.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/examples\/03-DataVisualization.md\">examples\/03-DataVisualization.md<\/a><\/li>\n<li>An\u00e1lise Explorat\u00f3ria de Dados &#8211; Estrat\u00e9gias de an\u00e1lise explorat\u00f3ria para entender distribui\u00e7\u00f5es, correla\u00e7\u00f5es e outliers. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/04-ExploratoryAnalysis.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/examples\/04-ExploratoryAnalysis.md\">examples\/04-ExploratoryAnalysis.md<\/a><\/li>\n<li>Pr\u00e9-processamento de Dados Fundamentos &#8211; T\u00e9cnicas de limpeza, normaliza\u00e7\u00e3o e prepara\u00e7\u00e3o dos dados antes da modelagem. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/05-DataPreprocessing.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/examples\/05-DataPreprocessing.md\">examples\/05-DataPreprocessing.md<\/a><\/li>\n<li>Minera\u00e7\u00e3o de Padr\u00f5es &#8211; Descoberta de padr\u00f5es frequentes e regras de associa\u00e7\u00e3o em conjuntos de dados. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/06-PatternMining.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/examples\/06-PatternMining.md\">examples\/06-PatternMining.md<\/a><\/li>\n<li>Classifica\u00e7\u00e3o \u2013 Introdu\u00e7\u00e3o e Fundamentos &#8211; Modelos supervisionados para classifica\u00e7\u00e3o e avalia\u00e7\u00e3o de desempenho preditivo. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/07-Classification.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/examples\/07-Classification.md\">examples\/07-Classification.md<\/a><\/li>\n<li>Classifica\u00e7\u00e3o \u2013 Conceitos Avan\u00e7ados &#8211; Uma explora\u00e7\u00e3o abrangente dos conceitos fundamentais de classifica\u00e7\u00e3o em ci\u00eancia de dados e aprendizado de m\u00e1quina. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/08-Classification-Advanced.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/examples\/08-Classification-Advanced.md\">examples\/08-Classification-Advanced.md<\/a><\/li>\n<li>Regress\u00e3o &#8211; Fundamentos &#8211; Uma introdu\u00e7\u00e3o clara aos fundamentos da an\u00e1lise de regress\u00e3o, explorando os conceitos essenciais, tipos de modelos e aplica\u00e7\u00f5es pr\u00e1ticas na ci\u00eancia de dados. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/09-Regression.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/examples\/09-Regression.md\">examples\/09-Regression.md<\/a><\/li>\n<li>Clustering &#8211; Fundamentos &#8211; M\u00e9todos de agrupamento n\u00e3o supervisionado para encontrar estruturas naturais nos dados. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/10-Clustering.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/examples\/10-Clustering.md\">examples\/10-Clustering.md<\/a><\/li>\n<li>Outliers &#8211; Uma introdu\u00e7\u00e3o abrangente aos conceitos, tipos e desafios na detec\u00e7\u00e3o de anomalias em dados. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/11-Outliers.pdf\">PDF<\/a><\/li>\n<li>Gest\u00e3o de Dados &#8211; Bem-vindo ao mundo da gest\u00e3o de dados empresariais, onde informa\u00e7\u00f5es estrat\u00e9gicas impulsionam decis\u00f5es inteligentes e transformam neg\u00f3cios. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/12-DataManagement.pdf\">PDF<\/a><\/li>\n<li>Deep Learning: Vis\u00e3o Geral &#8211; Deep Learning representa o aprendizado com redes neurais profundas, onde a caracter\u00edstica central \u00e9 o aprendizado autom\u00e1tico de representa\u00e7\u00f5es. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/13-DeepLearning.pdf\">PDF<\/a><\/li>\n<\/ol>\n<p><strong>DAL Toolbox (vis\u00e3o pr\u00e1tica)<\/strong><\/p>\n<ol>\n<li>d01. DAL Toolbox &#8211; Vis\u00e3o pr\u00e1tica das ferramentas do DAL Toolbox. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/d01-daltoolbox.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/cefet-rj-dal\/daltoolbox\/tree\/main\/examples\/transf\">examples\/transf<\/a><\/li>\n<li>d02. DAL Toolbox &#8211; Tutorial pr\u00e1tico do DAL Toolbox. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/d02-tutorial.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/cefet-rj-dal\/daltoolbox\/tree\/main\/examples\/tutorial\">examples\/tutorial<\/a><\/li>\n<li>d03. DAL Toolbox &#8211; Casos de pr\u00e9-processamento com DAL Toolbox. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/d03-data-preprocessing.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/cefet-rj-dal\/daltoolbox\/tree\/main\/examples\/transf\">examples\/transf<\/a><\/li>\n<li>d04. DAL Toolbox &#8211; Classifica\u00e7\u00e3o. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/d04-classification.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/cefet-rj-dal\/daltoolbox\/tree\/main\/examples\/classification\">examples\/classification<\/a><\/li>\n<li>d05. DAL Toolbox &#8211; Regress\u00e3o. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/d05-regression.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/cefet-rj-dal\/daltoolbox\/tree\/main\/examples\/regression\">examples\/regression<\/a><\/li>\n<li>d06. DAL Toolbox &#8211; Clustering. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/d06-clustering.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/cefet-rj-dal\/daltoolbox\/tree\/main\/examples\/clustering\">examples\/clustering<\/a><\/li>\n<li>d07. DAL Toolbox &#8211; Gr\u00e1ficos. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/d07-graphics.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/cefet-rj-dal\/daltoolbox\/tree\/main\/examples\/graphics\">examples\/graphics<\/a><\/li>\n<li>d08. DAL Toolbox &#8211; Exemplos customizados. <a href=\"https:\/\/github.com\/eogasawara\/datamining\/blob\/main\/d08-custom.pdf\">PDF<\/a> | C\u00f3digo: <a href=\"https:\/\/github.com\/cefet-rj-dal\/daltoolbox\/tree\/main\/examples\/custom\">examples\/custom<\/a><\/li>\n<\/ol>\n<p><strong>Reposit\u00f3rio<\/strong><\/p>\n<p><a href=\"https:\/\/github.com\/eogasawara\/datamining\">https:\/\/github.com\/eogasawara\/datamining<\/a><\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p><strong>Playlist<\/strong><\/p>\n<p class=\"responsive-video-wrap clr\"><iframe loading=\"lazy\" title=\"\ud83d\udcca Minera\u00e7\u00e3o de Dados \u2013 Fundamentos e Algoritmos\" width=\"1200\" height=\"675\" src=\"https:\/\/www.youtube.com\/embed\/videoseries?list=PLTy3TWJeueGw1eeP1cvaIiTzJrIljcVAq\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p>&nbsp;<\/p>\n<p><strong>Ferramentas<\/strong><\/p>\n<p>DAL Toolbox: <a href=\"https:\/\/cefet-rj-dal.github.io\/daltoolbox\/\">https:\/\/cefet-rj-dal.github.io\/daltoolbox\/<\/a><\/p>\n<p>Harbinger: <a href=\"https:\/\/cefet-rj-dal.github.io\/harbinger\/\">https:\/\/cefet-rj-dal.github.io\/harbinger\/<\/a><\/p>\n<p>TSPredIT: <a href=\"https:\/\/cefet-rj-dal.github.io\/tspredit\/\">https:\/\/cefet-rj-dal.github.io\/tspredit\/<\/a><\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p>&nbsp;<\/p>\n<p><strong>Bibliografia\u00a0<\/strong><strong>B\u00e1sica<\/strong><\/p>\n<ol>\n<li>HAN, Jiawei; KAMBER, Micheline; PEI, Jian. Data mining: concepts and techniques. 4. ed. Cambridge, MA: Morgan Kaufmann, 2022.<\/li>\n<li>JAMES, Gareth; WITTEN, Daniela; HASTIE, Trevor; TIBSHIRANI, Robert. An introduction to statistical learning: with applications in R. 2. ed. New York: Springer, 2021.<\/li>\n<li>Escovedo, T.; Koshiyama, A. Introdu\u00e7\u00e3o a Data Science: Algoritmos de Machine Learning e m\u00e9todos de an\u00e1lise. Casa do C\u00f3digo, 2020.<\/li>\n<\/ol>\n<p><strong>Bibliografia\u00a0Complementar<\/strong><\/p>\n<ol>\n<li>BISHOP, C. M.; Bishop, H. Deep Learning: Foundations and Concepts. Springer Nature, 2023.<\/li>\n<li>BRAMER, M. Principles of Data Mining. Springer London, 2020.<\/li>\n<li>GARCIA, S.; LUENGO, J.; HERRERA, F. Data Preprocessing in Data Mining. Springer, 2014.<\/li>\n<li>GOLDSCHMIDT, R.; PASSOS, E.; BEZERRA, E. Data Mining. Elsevier Brasil, 2015.<\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Ementa Minera\u00e7\u00e3o de Dados. Processo de Descoberta de Conhecimento em Bases de Dados (KDD). An\u00e1lise explorat\u00f3ria de dados. Pr\u00e9-processamento e qualidade de dados. Minera\u00e7\u00e3o de padr\u00f5es frequentes e sequenciais. T\u00e9cnicas de agrupamento. Modelagem preditiva por classifica\u00e7\u00e3o e regress\u00e3o. Arquiteturas anal\u00edticas para suporte \u00e0 minera\u00e7\u00e3o de dados, incluindo data warehouses, data lakes e OLAP. Aspectos \u00e9ticos, [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"class_list":["post-1587","page","type-page","status-publish","hentry","entry"],"_links":{"self":[{"href":"https:\/\/eic.cefet-rj.br\/~eogasawara\/wp-json\/wp\/v2\/pages\/1587","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/eic.cefet-rj.br\/~eogasawara\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/eic.cefet-rj.br\/~eogasawara\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/eic.cefet-rj.br\/~eogasawara\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/eic.cefet-rj.br\/~eogasawara\/wp-json\/wp\/v2\/comments?post=1587"}],"version-history":[{"count":34,"href":"https:\/\/eic.cefet-rj.br\/~eogasawara\/wp-json\/wp\/v2\/pages\/1587\/revisions"}],"predecessor-version":[{"id":2685,"href":"https:\/\/eic.cefet-rj.br\/~eogasawara\/wp-json\/wp\/v2\/pages\/1587\/revisions\/2685"}],"wp:attachment":[{"href":"https:\/\/eic.cefet-rj.br\/~eogasawara\/wp-json\/wp\/v2\/media?parent=1587"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}