gstsm: Generalized Spatial-Time Sequence Miner

Spatio-temporal patterns bring knowledge about sequences of events, the location, and the moment they occur. Finding such patterns is a complex task and of great value for different domains. However, not all patterns are frequent across an entire dataset, occurring instead with frequency in restricted space and time. This work formalizes the Mining of Restricted Sequences in Space and Time, without the use of threshold restrictions for time and space. This allows sequences of different lengths, time intervals, and three-dimensional space to present such patterns. It also provides validation with an implementation tested on a real seismic dataset, resulting in a sensitivity analysis and evaluation of resource usage that indicate the validity and feasibility of the solution.

Implementations of the algorithms are presented in the article Generalized Spatial-Time Sequence Miner, original title: Generalização de Mineração de Sequências Restritas no Espaço e no Tempo. In: XXXVI SBBD – Brazilian Symposium on Databases, 2021 <doi:10.5753/sbbd.2021.17891>).

Available at CRAN: https://CRAN.R-project.org/package=gstsm

Code repository at GitHub: https://github.com/cefet-rj-dal/gstsm

Authors: Castro, Antonio; Borges, Heraldo; Pacitti, Esther; Porto, Fabio; Coutinho, Rafaelli; Ogasawara, Eduardo

Eduardo Ogasawara

Eduardo Ogasawara has been a professor at the Department of Computer Science at the Federal Center for Technological Education of Rio de Janeiro (CEFET/RJ) since 2010. He holds a D.Sc. in Systems and Computer Engineering from COPPE/UFRJ. Between 2000 and 2007, he worked in the Information Technology (IT) sector, gaining extensive experience in workflows and project management. With a strong background in Data Science, he is currently focused on Data Mining and Time Series Analysis. He is a member of IEEE, ACM, and SBC. Throughout his career, he has authored numerous published articles and led projects funded by agencies such as CNPq and FAPERJ. Currently, he heads the Data Analytics Lab (DAL) at CEFET/RJ, where he continues to advance research in Data Science.