Systematic Review Data
The systematic review data for the paper On the Relevancy of Data Science for Flight Delay Research can be found at survey-analysis.xls.
The possibility for the reader to be able to reproduce all the results presented in papers is significant for the scientific method. Initiatives that publishes methods and experimental evaluation using active documents (such as Jupyter notebook) are relevant for support reproducibility. We have provided an example (analytics-example.ipynb) of a reproducible code that enables the comprehension of some data analytics methods presented in the paper.
Goal: The Data Analysis workshop encompasses a set of data mining techniques aimed at extracting knowledge from data. The process of knowledge extraction includes exploratory data analysis, preprocessing and prediction. This short course is contextualized using the R language. Proposal: Introduction Basics of R Exploratory data analysis Data Preprocessing Regression Slides Code examples: Basics of […]Continue reading →
Study of data mining techniques, i.e., extraction of knowledge from large volumes of data. The knowledge extraction process includes exploratory analysis, data preprocessing, clustering, and prediction. This short-course is regularly offered once a year at LNCC under the collaboration between CEFET/RJ and LNCC. Fill this form to request access to the course. Slides and schedule available […]Continue reading →
Authors: Heraldo Borges, Murillo Dutra, Rafaelli~Coutinho, Fábio Perosi, Amin Bazaz, Florent~Masseglia, Esther Pacitti, Fábio Porto, Eduardo Ogasawara Abstract: Discovering motifs in time series data has been widely explored. Various techniques have been developed to tackle this problem. However, when it comes to spatial-time series, a clear gap can be observed according to the literature review. […]Continue reading →
Título: Oportunidades na Ciência da Computação: Uma visão naperspectiva de Ciência de Dados Fórum: Escola Municipal Victor Hugo Data: December / 2018 Local: Rio de Janeiro, RJ Resumo: O Brasil atualmente ocupa o sexto maior mercado mundial de tecnologia da informação e comunicação (TIC) (ABES 2016). Estima-se que o setor de TIC tenha movimentado US$ 152 […]Continue reading →
Title: Comparing Motif Discovery Techniques with Sequence Mining in the Context of Space-Time Series Venue: INRIA / LIRMM / University of Montpellier Date: November / 2018 Location: Montpellier, France Abstract: A relevant area that is being explored in time series analysis community is finding patterns. Patterns are sub-sequences of time series that are related to some special […]Continue reading →
Student: Heraldo Pimenta Borges Filho (firstname.lastname@example.org) Advisor: Eduardo Ogasawara (email@example.com) Description: The package STMotif allows performing research of motif in spatial-time series. A motif is a previously unknown subsequence of a (spatial) time series with a relevant number of occurrences. The main purpose is to find a way to handle the issue of large amounts […]Continue reading →
The LADaS 2018 Workshop (Latin America Data Science Workshop) was organized in conjunction with the VLDB 2018 (Very Large Data Bases) at Rio de Janeiro on August 27th. Scope: Dealing with the data deluge produced nowadays in different areas, ranging from basic sciences to billions of users of Global Internet services, emerges as one of […]Continue reading →
Title: Detecção de Anomalias Frequentes no Transporte Rodoviário Urbano Venue: SBBD 2018 Date: August / 2018 Location: Rio de Janeiro, RJ – Brasil Abstract: The growth of urban population and, consequently, the number of vehicles causes the increase of traffic jams and emission of polluting gases. In this context, we observe the intensification of papers that aim […]Continue reading →
Title: Rumo à Otimização de Operadores sobre UDF no Spark Venue: SBBD 2018 Date: August / 2018 Location: Rio de Janeiro, RJ – Brasil Abstract: Workflows emerged as a basic abstraction for structuring data analysis experiments in the current Data Intensive Scalable Computing (DISC) scenario. In many situations, these workflows are intensive, either computationally or in relation […]Continue reading →