{"id":39,"date":"2018-02-22T17:55:12","date_gmt":"2018-02-22T17:55:12","guid":{"rendered":"http:\/\/eic.cefet-rj.br\/ladas2018\/?page_id=39"},"modified":"2018-07-08T20:14:32","modified_gmt":"2018-07-08T23:14:32","slug":"program","status":"publish","type":"page","link":"https:\/\/eic.cefet-rj.br\/ladas2018\/program\/","title":{"rendered":"Program"},"content":{"rendered":"<blockquote><p><strong>August 27th, 2018<br \/>\n<\/strong><strong>Room: Seg\u00f3via 1<\/strong><\/p><\/blockquote>\n<table>\n<tbody>\n<tr>\n<td width=\"101\"><strong>09:00-09:15<\/strong><\/td>\n<td width=\"516\"><strong>Opening <\/strong><\/td>\n<\/tr>\n<tr>\n<td width=\"101\"><strong>09:15-10:30<\/strong><\/td>\n<td width=\"516\"><strong>Keynote Speaker:\u00a0<\/strong><strong>Claudia Bauzer Medeiros<br \/>\n<\/strong><strong>Data Science, Open Science &#8211; when and how shall the twain meet?<\/strong><\/td>\n<\/tr>\n<tr>\n<td width=\"101\"><strong>10:30-11:00<\/strong><\/td>\n<td width=\"516\">Coffee Break<\/td>\n<\/tr>\n<tr>\n<td width=\"101\"><strong>11:00-12:30<br \/>\n<\/strong><\/p>\n<p style=\"text-align: center;\"><strong>Technical Session 1<br \/>\n<\/strong><\/p>\n<\/td>\n<td width=\"516\"><strong>1.\u00a0\u00a0\u00a0\u00a0 <\/strong><strong>Scientific Data Analysis Using Data-Intensive Scalable Computing: the SciDISC Project (invited paper)<br \/>\n<\/strong>Patrick Valduriez, Marta Mattoso, Reza Akbarinia, Heraldo Borges, Jos\u00e9 Camata, Alvaro Coutinho, Daniel Gaspar, Noel Lemus, Ji Liu, Hermano Lustosa, Florent Masseglia, Fabricio Nogueira Da Silva, V\u00edtor Silva, Renan Souza, Kary Oca\u00f1a, Eduardo Ogasawara, Daniel de Oliveira, Esther Pacitti, Fabio Porto and Dennis Shasha<br \/>\n<strong>2.\u00a0\u00a0\u00a0\u00a0 <\/strong><strong>An I\/O Performance Evaluation Tool for Distributed Data-Intensive Scientific Applications<br \/>\n<\/strong>Eduardo Camilo Inacio and Mario Antonio Ribeiro Dantas<br \/>\n<strong>3.\u00a0\u00a0\u00a0\u00a0 <\/strong><strong>A Comparative Study on Streaming Frameworks for Big Data<br \/>\n<\/strong>Wissem Inoubli, Sabeur Aridhi, Haithem Mezni, Mondher Maddouri and Engelbert Mephu Nguifo<br \/>\n<strong>4.\u00a0\u00a0\u00a0\u00a0 Big Data Analytics Technologies and Platforms: a brief review<\/strong><strong><br \/>\n<\/strong>Ticiana Coelho Da Silva, Regis Pires, Igo Ramalho Brilhante, Jose Macedo, David Ara\u00fajo, Paulo Rego and Aloisio Vieira Lira Neto<\/td>\n<\/tr>\n<tr>\n<td width=\"101\"><strong>12:30-14:00<\/strong><\/td>\n<td width=\"516\">Lunch Break<\/td>\n<\/tr>\n<tr>\n<td width=\"101\"><strong>14:00-15:30<\/strong><\/p>\n<p style=\"text-align: center;\"><strong>Technical Session 2<\/strong><\/p>\n<\/td>\n<td width=\"516\"><strong>5.\u00a0\u00a0\u00a0\u00a0 <\/strong><strong>Urban Data Consistency in RDF: A Case Study of Curitiba Transportation System<br \/>\n<\/strong>Mirian Halfeld Ferrari, Carmem Hara, Nadia Kozievitch and Flavio Uber<br \/>\n<strong>6.\u00a0\u00a0\u00a0\u00a0 <\/strong><strong>Business Activity Clustering: A Use Case in Curitiba<br \/>\n<\/strong>Yuri Bichibichi, N\u00e1dia Kozievitch, Ricardo Dutra and Artur Ziviani<br \/>\n<strong>7.\u00a0\u00a0\u00a0\u00a0 <\/strong><strong>Influence of Virtual Road Traffic Sensors of Oporto for Origin-Destination Matrix Estimation<br \/>\n<\/strong>Luciano Urgal Pando, Ricardo L\u00fcders, Keiko Veronica Ono Fonseca and Marcelo de Oliveira Rosa<br \/>\n<strong>8.\u00a0\u00a0\u00a0\u00a0 <\/strong><strong>A Multilayer and Time-varying Structural Analysis of the Brazilian Air Transportation Network<br \/>\n<\/strong>Klaus Wehmuth, Bernardo Costa, Jo\u00e3o Victor Bechara and Artur Ziviani<\/td>\n<\/tr>\n<tr>\n<td width=\"101\"><strong>15:30-16:00<\/strong><\/td>\n<td width=\"516\">Coffee Break<\/td>\n<\/tr>\n<tr>\n<td width=\"101\"><strong>16:00-17:10<\/strong><\/p>\n<p style=\"text-align: center;\"><strong>Technical Session 3<\/strong><\/p>\n<\/td>\n<td width=\"516\"><strong>9.\u00a0\u00a0\u00a0\u00a0 <\/strong><strong>Applying term frequency-based indexing to improve scalability and accuracy of probabilistic data linkage<br \/>\n<\/strong>Robespierre Pita, Luan Menezes and Marcos Barreto<br \/>\n<strong>10.\u00a0 <\/strong><strong>ATAnalysis &#8211; Toward a psycholinguistic method to analyze video textual information<br \/>\n<\/strong>Helder Yukio Okuno, Flavio Carvalho, Gustavo Paiva Guedes and Marcelle Torres Alves Okuno<br \/>\n<strong>Short papers:<\/strong><br \/>\n<strong>11.\u00a0 <\/strong><strong>Computation of PDFs on Big Spatial Data: Problem &amp; Architecture<br \/>\n<\/strong>Ji Liu, Noel M. Lemus, Esther Pacitti, Fabio Porto and Patrick Valduriez<br \/>\n<strong>12.\u00a0 <\/strong><strong>Towards a Human-in-the-Loop Library for Tracking Hyperparameter Tuning in Deep Learning Development<br \/>\n<\/strong>Renan Souza, Liliane Neves, Leonardo Azeredo, Ricardo Luiz, Elaine Tady, Paulo Cavalin and Marta Mattoso<br \/>\n<strong>13.\u00a0 <\/strong><strong>A Method to build a Geolocalized Food Price Time Series Knowledge Base analyzable by Everyone<br \/>\n<\/strong>Johyn Papin, Frederic Andres and Laurent D&#8217;Orazio<\/td>\n<\/tr>\n<tr>\n<td width=\"101\"><strong>17:15-17:30<\/strong><\/td>\n<td width=\"516\"><strong>Closing remarks<\/strong><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<hr \/>\n<blockquote><p><strong>Detailed Program<\/strong><\/p><\/blockquote>\n<p><strong>09:00-09:15 \u2013 Opening<\/strong><\/p>\n<p><strong>09:15-10:30:\u00a0<\/strong><strong>Keynote Speaker:\u00a0Claudia Bauzer Medeiros<\/strong><\/p>\n<p style=\"padding-left: 30px;\"><strong>Data Science, Open Science &#8211; when and how shall the twain meet?<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Unicamp &#8211; Campinas, SP, Brazil<\/p>\n<p><strong>11:00-12:30: Technical Session 1<\/strong><\/p>\n<p><strong>11-00-11:20 &#8211; Scientific Data Analysis Using Data-Intensive Scalable Computing: the SciDISC Project (invited paper)<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Patrick Valduriez<sup>1<\/sup>, Marta Mattoso<sup>2<\/sup>, Reza Akbarinia<sup>1<\/sup>, Heraldo Borges<sup>3<\/sup>, Jos\u00e9 Camata<sup>2<\/sup>, Alvaro Coutinho<sup>2<\/sup>, Daniel Gaspar<sup>4<\/sup>, Noel Lemus<sup>4<\/sup>, Ji Liu<sup>1<\/sup>, Hermano Lustosa<sup>4<\/sup>, Florent Masseglia<sup>1<\/sup>, Fabricio Nogueira Da Silva<sup>5<\/sup>, V\u00edtor Silva<sup>2<\/sup>, Renan Souza<sup>2<\/sup>, Kary Oca\u00f1a<sup>4<\/sup>, Eduardo Ogasawara<sup>3<\/sup>, Daniel de Oliveira<sup>5<\/sup>, Esther Pacitti<sup>1<\/sup>, Fabio Porto<sup>4<\/sup> and Dennis Shasha<sup>6<\/sup><\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup>Inria, LIRMM and University Montpellier &#8211; France<br \/>\n<sup>2<\/sup>COPPE\/UFRJ, Rio de Janeiro \u2013 RJ \u2013 Brazil<br \/>\n<sup>3<\/sup>CEFET\/RJ, Rio de Janeiro \u2013 RJ \u2013 Brazil<br \/>\n<sup>4<\/sup>LNCC, Rio de Janeiro \u2013 RJ \u2013 Brazil<br \/>\n<sup>5<\/sup>UFF, Rio de Janeiro \u2013 RJ \u2013 Brazil<br \/>\n<sup>6<\/sup>NYU, New York \u2013 NY \u2013 USA<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract<\/strong>: Data-intensive science requires the integration of two fairly different paradigms: high-performance computing (HPC) and data-intensive scalable computing (DISC), as exemplified by frameworks such as Hadoop and Spark. In this context, the SciDISC project addresses the grand challenge of scientific data analysis using DISC, by developing architectures and methods to combine simulation and data analysis. SciDISC is an ongoing project between Inria, several research institutions in Rio de Janeiro and NYU. This paper introduces the motivations and objectives of the project, and reports on the first results achieved so far.<\/p>\n<p><strong>11-20-11:40 &#8211; An I\/O Performance Evaluation Tool for Distributed Data-Intensive Scientific Applications<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Eduardo Camilo Inacio<sup>1<\/sup> and Mario Antonio Ribeiro Dantas<sup>1<\/sup><\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup>UFSC &#8211; Universidade Federal de Santa Catarina, Florian\u00f3polis, SC &#8211; Brazil<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract<\/strong>: I\/O performance arises as a major bottleneck in nowadays data-intensive scientific applications. In order to identify execution parameters that provide an improved I\/O performance, experimental efforts relying on synthetic I\/O workload generators are widely employed. Focusing on addressing limitations of currently workload generators, and to provide a more flexible, unified, and user-friendly approach for parallel I\/O performance analysis and optimization, we have proposed a differentiated tool, called IORE. In this paper, we demonstrate IORE applicability for I\/O performance analysis of a dataset-based workload derived from a real-world scientific application. Beyond giving insight on best performing configurations for the referred application workload, our results indicate the potential of IORE as a parallel I\/O experimental tool.<\/p>\n<p><strong>11-40-12:00 &#8211; A Comparative Study on Streaming Frameworks for Big Data<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Wissem Inoubli<sup>1<\/sup>, Sabeur Aridhi<sup>2<\/sup>, Haithem Mezni<sup>3<\/sup>, Mondher Maddouri<sup>4<\/sup> and Engelbert Mephu Nguifo<sup>5<\/sup><\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup>University of Tunis El Manar, Faculty of Sciences of Tunis, LIPAH<br \/>\n<sup>2<\/sup>University of Lorraine, CNRS, Inria, LORIA<br \/>\n<sup>3<\/sup>University of Jendouba, SMART Lab<br \/>\n<sup>4<\/sup>College Of Buisness, University of Jeddah<br \/>\n<sup>5<\/sup>Clermont University, Blaise Pascal University, LIMOS<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract: <\/strong>Recently, increasingly large amounts of data are generated from a variety of data sources. Existing data processing technologies are not suitable to cope with the huge amounts of generated data. Yet, many research works focus on streaming in Big Data, a task referring to the processing of massive volumes of structured and\/or unstructured streaming data. Recently proposed streaming frameworks for Big Data applications help to store, analyze and process the continuously captured data. In this paper, we discuss the challenges of streaming Big Data and we survey existing streaming frameworks for Big Data. We also present an experimental evaluation and a comparative study of the most popular streaming frameworks<\/p>\n<p><strong>12-00-12:20 &#8211; Big Data Analytics Technologies and Platforms: a brief review<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Ticiana Coelho Da Silva<sup>1<\/sup>, Regis Pires<sup>1<\/sup>, Igo Ramalho Brilhante<sup>1<\/sup>, Jose Macedo<sup>1<\/sup>, David Ara\u00fajo<sup>1<\/sup>, Paulo Rego<sup>1<\/sup> and Aloisio Vieira Lira Neto<sup>2<\/sup><\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup>Federal University of Ceara, Brazil<br \/>\n<sup>2<\/sup>Brazilian Federal Highway Police<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract<\/strong>: A plethora of Big Data Analytics technologies and platforms have been proposed in the last years. However, in 2017, only 53% of companies are adopting such tools. It seems that the industry is not convinced about Big Data promises or maybe choosing the right technology\/platform requires in-depth knowledge about the capabilities of all these tools. Before deciding the right technology or platform to choose from, the organizations have to investigate the application\/algorithm needs and the advantages and drawbacks of each technology\/platform. In this paper, we aim at helping organizations in the selection of technologies\/platforms more appropriate to their analytic processes by offering a short-review according to some categories of Big Data problems as processing (streaming and batch), storage, data integration, analytics, data governance, and monitoring.<\/p>\n<p><strong>14:00-15:30: Technical Session 2<\/strong><\/p>\n<p><strong>14-00-14:20 &#8211; Urban Data Consistency in RDF: A Case Study of Curitiba Transportation System<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Mirian Halfeld Ferrari, Carmem Hara, Nadia Kozievitch and Flavio Uber<\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup>Universite d\u2019Orleans, INSA CVL, Orleans, France<br \/>\n<sup>2<\/sup>Universidade Federal do Parana, Curitiba, PR, Brazil<br \/>\n<sup>3<\/sup>Universidade Tecnologica Federal do Parana, Curitiba, PR, Brazil<br \/>\n<sup>4<\/sup>Universidade Estadual de Maringa, Maringa, PR, Brazil<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract<\/strong>:\u00a0Urban Computing has an important role in providing new tools for\u00a0urban mobility. In this paper, integrity constraints and blank nodes are used in\u00a0an RDF database to minimize extra updates (called side effects) to guarantee\u00a0consistency during required updates. A case study using a real scenario on Curitiba\/Brazil\u00a0 transportation database is presented. Experimental results showed\u00a0that our approach performs better and produces more meaningful results when\u00a0compared to a similar strategy.<\/p>\n<p><strong>14-20-14:40 &#8211; Business Activity Clustering: A Use Case in Curitiba<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Yuri Bichibichi<sup>1<\/sup>, N\u00e1dia Kozievitch<sup>1<\/sup>, Ricardo Dutra<sup>1<\/sup> and Artur Ziviani<sup>2<\/sup><\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup>Universidade Tecnol\u00f3gica Federal do Parana (UTFPR), Curitiba, PR, Brazil<br \/>\n<sup>2<\/sup>National Laboratory for Scientific Computing (LNCC), Petropolis, RJ, Brazil<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract<\/strong>: In the context of smart cities, the information of businesses licenses has the potential to discriminate economics characteristics of the observed urban environment. This work performs an initial analysis on business activity clustering using the k-means algorithm with data from the granting of business licenses (from 1980 to 2016) in the city of Curitiba &#8211; Brazil.<\/p>\n<p><strong>14-40-15:00 &#8211; Influence of Virtual Road Traffic Sensors of Oporto for Origin-Destination Matrix Estimation<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Luciano Urgal Pando<sup>1<\/sup>, Ricardo L\u00fcders<sup>1<\/sup>, Keiko Veronica Ono Fonseca<sup>1<\/sup> and Marcelo de Oliveira Rosa<sup>1<\/sup><\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup>Federal University of Technology &#8211; Parana (UTFPR)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract<\/strong>: The knowledge of urban mobility patterns is important to maintain good public services as well as to improve city planning. These mobility patterns can be characterized by using expensive fieldwork or analyzing the huge amount of data available from services and environmental monitoring in smart cities. The origin-destination matrix estimation (ODME) aims to estimate the traffic of vehicles between two particular origin and destination areas in the city from traffic observed from sensors installed at roads. This estimation is stated as an optimization problem and solved here by linear programming. The results obtained for sensor data of Porto in Portugal have shown that the number and location of sensors are important issues to be considered.<\/p>\n<p><strong>15-00-15:20 &#8211; A Multilayer and Time-varying Structural Analysis of the Brazilian Air Transportation Network<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Klaus Wehmuth, Bernardo Costa, Jo\u00e3o Victor Bechara and Artur Ziviani<\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup> LNCC &#8211; National Laboratory for Scientific Computing<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract<\/strong>: This paper provides a multilayer and time-varying structural analysis of one air transportation network, having the Brazilian air transportation network as a case study. Using a single mathematical object called MultiAspect Graph (MAG) for this analysis, the multi-layer perspective enables the unveiling of the particular strategies of each airline to both establish and adapt in a moment of crisis its specific flight network.<\/p>\n<p><strong>16:00-17:30: Technical Session 3<\/strong><\/p>\n<p><strong>16-00-16:20 &#8211; Applying term frequency-based indexing to improve scalability and accuracy of probabilistic data linkage<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Robespierre Pita<sup>1,2<\/sup>, Luan Menezes<sup>1,2<\/sup> and Marcos Barreto<sup>1,2<\/sup><\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup>UFBA &#8211; Federal University of Bahia, Salvador, BA, Brazil<br \/>\n<sup>2<\/sup>FIOCRUZ &#8211; Oswaldo Cruz Foundation, Salvador, BA, Brazil<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract<\/strong>: Record or data linkage is a technique frequently used in diverse domains to aggregate data stored in different sources that presumably pertain to the same real-world entity. Deterministic (key-based) or probabilistic (rule-based) linkage methods can be used to implement data linkage, being the second approach suitable when no common link attributes exist amongst the data sources involved. Depending on the volume of data being linked, indexing (or blocking) techniques should be used to reduce the number of pairwise comparisons that need to be executed to decide if a given pair of records match or not. In this paper, we discuss a new indexing scheme, based on term-frequency counts, deployed in our data linkage tool (AtyImo). We present our algorithm design and some metrics related to accuracy and efficiency (reduction ratio achieved during blocking construction), as well a comparative analysis with a predicate-based technique also used in AtyImo. Our results show a very high level of accuracy and reduction in terms of pairwise comparison tasks.<\/p>\n<p><strong>16-20-16:40 &#8211; ATAnalysis \u2013 Toward a psycholinguistic method to analyze video textual information<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Helder Yukio Okuno, Flavio Carvalho, Gustavo Paiva Guedes and Marcelle Torres Alves Okuno<\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup>CEFET\/RJ, Rio de Janeiro \u2013 RJ \u2013 Brazil<br \/>\n<sup>2<\/sup>EGN &#8211; Escola de Guerra Naval, Rio de Janeiro \u2013 RJ \u2013 Brazil<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract<\/strong>: Political statements of world leaders may affect many lives, so it is important to study what they express through language. We propose a method to do psycholinguistic analysis of statements extracted from videos. To show the relevance and some interesting information, we conducted some experiments in video subtitles of world leaders Donald Trump and Kim Jong-un amid imminent agreement that could lead to peace in the Korean peninsula. Results suggest less security in statements of the North Korean leader while threatening to unleash an &#8220;unimaginable strike&#8221; at the US territory. Moreover, the US president shows less honesty by saying he hopes never to use the nuclear arsenal. This approach may be useful in future studies to reveal what the language used by candidates can show.<\/p>\n<p><strong>16-40-16:50 &#8211; Computation of PDFs on Big Spatial Data: Problem &amp; Architecture<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Ji Liu<sup>1<\/sup>, Noel M. Lemus<sup>2<\/sup>, Esther Pacitti<sup>1<\/sup>, Fabio Porto<sup>2<\/sup> and Patrick Valduriez<sup>1<\/sup><\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup>Inria and LIRMM, Univ. of Montpelier, France<br \/>\n<sup>2<\/sup>LNCC Petropolis, Brazil<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract<\/strong>: Big spatial data can be produced by observation or numerical simulation programs and correspond to points that represent a 3D soil cube area. However, errors in signal processing and modeling create some uncertainty, and thus a lack of accuracy in identifying geological or seismic phenomenons. To analyze uncertainty, the main solution is to compute a Probability Density Function (PDF) of each point in the spatial cube area, which can be very time consuming. In this paper, we analyze the problem and discuss the use of Spark to efficiently compute PDFs.<\/p>\n<p><strong>16-50-17:00 &#8211; Towards a Human-in-the-Loop Library for Tracking Hyperparameter Tuning in Deep Learning Development<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Renan Souza<sup>1,2<\/sup>, Liliane Neves<sup>1<\/sup>, Leonardo Azeredo<sup>1<\/sup>, Ricardo Luiz, Elaine Tady<sup>1<\/sup>, Paulo Cavalin<sup>2<\/sup> and Marta Mattoso<sup>1<\/sup><\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup>COPPE\/Federal University of Rio de Janeiro<br \/>\n<sup>2<\/sup>IBM Research<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract<\/strong>: The development lifecycle of Deep Learning (DL) models requires humans (the model trainers) to analyze and steer the training evolution. They analyze intermediate data, fine-tune hyperparameters, and stop when a resulting model is satisfying. The problem is that existing solutions for DL do not track the trainer actions. There are no explicit data relationship between trainer action with the input data and hyperparameter to the output performance results, throughout the training process. This jeopardizes online training data analyses and post-hoc results reproducibility, reusability, and understanding. This paper presents DL-Steer, our first prototype to aid trainers to fine-tune hyperparameters and for tracking trainer steering actions. Tracked data are stored in a relational database for online and post-hoc data analyses.<\/p>\n<p><strong>17-00-17:10 &#8211; A Method to build a Geolocalized Food Price Time Series Knowledge Base analyzable by Everyone<\/strong><\/p>\n<p style=\"padding-left: 30px;\">Johyn Papin<sup>1<\/sup>, Frederic Andres<sup>2<\/sup> and Laurent D\u2019Orazio<sup>1<\/sup><\/p>\n<p style=\"padding-left: 30px;\"><sup>1<\/sup>Univ Rennes, France<br \/>\n<sup>2<\/sup>NII &#8211; National Institute of Informatics, Japan<\/p>\n<p style=\"padding-left: 30px;\"><strong>Abstract<\/strong>: Time-series analysis is a very challenging concept in Data Science for companies and industries. Harvesting prices of agricultural production (e.g. vegetable, fruit, milk&#8230;) as time series is key to operating reliable dish cost prediction at scale to ensure for example that the market price is valid. In this paper, we describe initial stakeholder needs, the application and engineering contexts in which the challenge of time-serie harvesting and management arose, and theoretical and architectural choices we made to implement a solution of historical food prices to demonstrate the feasibility. For this, we use scrappers through the TOR network. We also propose the knowledge map approach to make the data accessible to any type of users.<\/p>\n<p><strong>17-15-17:30 \u2013 Closing remarks<\/strong><\/p>\n","protected":false},"excerpt":{"rendered":"<p>August 27th, 2018 Room: Seg\u00f3via 1 09:00-09:15 Opening 09:15-10:30 Keynote Speaker:\u00a0Claudia Bauzer Medeiros Data Science, Open Science &#8211; when and&hellip; <\/p>\n","protected":false},"author":0,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-39","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/eic.cefet-rj.br\/ladas2018\/wp-json\/wp\/v2\/pages\/39","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/eic.cefet-rj.br\/ladas2018\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/eic.cefet-rj.br\/ladas2018\/wp-json\/wp\/v2\/types\/page"}],"replies":[{"embeddable":true,"href":"https:\/\/eic.cefet-rj.br\/ladas2018\/wp-json\/wp\/v2\/comments?post=39"}],"version-history":[{"count":22,"href":"https:\/\/eic.cefet-rj.br\/ladas2018\/wp-json\/wp\/v2\/pages\/39\/revisions"}],"predecessor-version":[{"id":140,"href":"https:\/\/eic.cefet-rj.br\/ladas2018\/wp-json\/wp\/v2\/pages\/39\/revisions\/140"}],"wp:attachment":[{"href":"https:\/\/eic.cefet-rj.br\/ladas2018\/wp-json\/wp\/v2\/media?parent=39"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}