Author: Rafaelli Coutinho

Dissertation (December 15, 2025): Vanessa Santos Soares

Student: Vanessa Santos Soares

Title: Avaliação de modelos de aprendizado de máquina para a correção automática de redações segundo as competências do ENEM

Advisors: Eduardo Bezerra da Silva (advisor) and Gustavo Paiva Guedes e Silva (co-advisor)

Committee: Eduardo Bezerra da Silva (Cefet/RJ), Gustavo Paiva Guedes e Silva (Cefet/RJ), Diego Moreira de Araújo Carvalho (Cefet/RJ) e Geraldo Bonorino Xexéo (UFRJ)

Day/Hour: December 15, 2025 / 10 a.m.

Room: Auditório V

Abstract: With the growth of remote education and the implementation of large-scale exams such as ENEM, the automation of essay grading has become an increasing necessity. This work investigates different machine learning strategies for the automatic evaluation of essays written in Portuguese, based on the five assessment competencies defined by ENEM. A total of 9,599 essays were analyzed, collected from the Vestibular Brasil Escola portal, covering 102 topics published between 2009 and 2024. Two main approaches are compared: (i) traditional methods based on TF-IDF and linguistically engineered features extracted from the texts, and (ii) pre-trained language models with fine-tuning (XLM-RoBERTa with LoRA). Model performance is evaluated using the Quadratic Weighted Kappa (QWK) metric, which measures agreement with human raters. The study aims to demonstrate that pre-trained models provide significant improvements in robustness and reliability, outperforming feature-engineering-based approaches. This research contributes to the advancement of Automatic Essay Scoring (AES) in Portuguese by offering a benchmark and comparative analysis that can support future studies and educational applications.

Dissertation (December 8, 2025): Fernando Henrique de Jesus Fraga da Silva

Student: Fernando Henrique de Jesus Fraga da Silva

Title: Aprendizado por Reforço Profundo Aplicado à Negociação Intradiária de Múltiplas Ações

Advisors: Eduardo Bezerra da Silva (advisor) and Pedro Henrique González Silva (co-advisor)

Committee: Eduardo Bezerra da Silva (Cefet/RJ), Pedro Henrique González Silva (UFRJ), Aline Marins Paes Carvalho (UFF) e Glauco Fiorott Amorim (Cefet/RJ)

Day/Hour: December 8, 2025 / 3 p.m.

Room: https://teams.microsoft.com/v2/?meetingjoin=true#/l/meetup-join/19:PKOJTuK7mfHSDE6QkCWQCYp71f0xOMNoRgSUj4wjMKc1@thread.tacv2/1763760050816?context=%7b%22Tid%22%3a%228eeca404-a47d-4555-a2d4-0f3619041c9c%22%2c%22Oid%22%3a%22c03d6068-4733-48a6-bbb4-aa78f351d9cf%22%7d&anon=true&deeplinkId=91733be2-9804-4f09-ac6a-f1a362e67de8

Abstract: The stock market is a dynamic and volatile environment in which publicly traded companies negotiate fractions of their value, subject to continuous price fluctuations influenced by economic, political, and social factors. Anticipating these fluctuations is a complex task, especially in the context of intraday trading, where buy and sell decisions must be made within very short time intervals based on rapidly changing data. In this scenario, Reinforcement Learning (RL) emerges as a promising paradigm capable of developing adaptive strategies through the continuous interaction between agent and environment. This dissertation investigates the use of Deep Reinforcement Learning (DRL) techniques in financial trading, focusing on intraday scenarios involving multiple stocks. It proposes a DRL-based approach to estimate buy and sell actions simultaneously across various assets, using high-granularity market data to better approximate real trading conditions. Experimental analyses were conducted using the Proximal Policy Optimization (PPO) algorithm. The results indicate that the proposed agent outperformed traditional benchmark strategies, achieving gains exceeding 10 percentage points in certain cases.

Algorithms and Graph Based Models

The field of Graph Theory studies the relationships between elements, called nodes, and their connections, known as edges. This area encompasses models ranging from technological networks to social and air transportation networks. Its main subfields include Network Science, which analyzes interactions in complex systems, and Computer Networks, which provide the technological infrastructure for global communication.

Network Science investigates how the structure and dynamics of connections influence the global behavior of a network. Topics such as centrality, robustness, and structural patterns are analyzed to better understand social, economic, and biological networks. The growth of technology and the explosion of data in recent decades have further increased the relevance of this field.

In Computer Networks, defining the network topology is essential for efficient monitoring. This process can be modeled as an optimization problem or analyzed as a Complex Network, using graph-theoretic concepts to study its properties and performance. Moreover, infrastructure management and data communication rely on specific protocols tailored to different applications, such as environmental monitoring, mobile networks, and biomedical systems. The efficiency of these protocols is evaluated using metrics such as packet delivery rate, network throughput, and energy consumption.

This project aims to develop graph-based applications across various domains, combining computational simulation with practical experiments. It also seeks to improve the design and communication within these graph structures, exploring new protocols to make information transmission more efficient and resilient.

Faculty Members Involved:

  • Diego Nunes Brandão (coordinator) 
  • Felipe da Rocha Henriques 
  • Glauco Fiorott Amorim 
  • Helga Dolorico Balbi
  • Laura Silva de Assis 

Smart Applications

Intelligent Applications have become essential for optimizing processes and enabling informed decision-making. Their integration with Robotics, Multimedia, and the Internet of Things (IoT) drives significant innovation across multiple domains.

In Robotics, intelligent applications enhance machine autonomy and interaction, enabling solutions that range from personal assistant robots to advanced surgical systems. A special focus is placed on educational robotics, which combines state-of-the-art technology with playful, interactive approaches to develop intelligent embedded systems and perception algorithms. These solutions are often tested in technology competitions to refine their performance before being applied in educational contexts.

Multimedia has transformed the way information is consumed by integrating video, audio, images, and text with intelligent algorithms. This enables personalized user experiences, speech and image recognition, and immersive virtual reality environments, resulting in more intuitive and multisensory interactions.

In IoT, Artificial Intelligence allows everyday objects to collect and analyze data to create more efficient and secure environments. The convergence of IoT and AI gives rise to AIoT (Artificial Intelligence of Things), which incorporates advanced learning and decision-making capabilities into connected devices.

This research project explores how these technologies can transform teaching and learning, synchronize multisensory effects, and support environmental monitoring, enabling the development of more autonomous and efficient systems.

Faculty Members Involved:

  • Joel Andre Ferreira dos Santos (coordinator) 
  • João Roberto de Toledo Quadros 
  • Glauco Fiorott Amorim 
  • Diego Nunes Brandão

Data Analysis

Data Analysis is a multidisciplinary field focused on interpreting large volumes of information to support decision-making, strategy development, and innovation. Statistical and machine learning techniques are employed to identify patterns and forecast future events, encompassing structured, semi-structured, and unstructured data.

For structured data, the main challenges involve analyzing time series and spatiotemporal data, including prediction, pattern discovery, and adaptation to data drift. Methods such as filtering and decomposition are used to build robust models for forecasting. The detection of events in time series, such as anomalies and regime changes, is relevant for both retrospective and real-time analysis.

When dealing with semi-structured and unstructured data, challenges include text mining and natural language processing (NLP). Text mining aims to uncover patterns and trends through statistical learning and text vectorization, supporting applications such as sentiment analysis and affective computing, which studies emotions in texts and human interactions. In this project, text mining is closely linked to affective computing and behavioral analysis, also encompassing image and video processing.

Behavioral analysis examines individuals within social networks, using graph-based models to identify communities and understand interaction dynamics. Applications include targeted marketing and information diffusion, providing insights into collective and emotional patterns within human interactions.

Faculty Members Involved:

  • Eduardo Soares Ogasawara (coordinator) 
  • Eduardo Bezerra da Silva 
  • Gustavo Paiva Guedes e Silva 
  • Jorge de Abreu Soares 
  • Kele Teixeira Belloze

Software Engineering

Software Engineering is the field that studies and applies scientific and technological methods to the software life cycle, ensuring systematic and disciplined approaches to development. With the growing reliance on software in smartphones, computers, and wearable devices, the quality and security of these systems have become fundamental. Furthermore, emerging technologies such as Artificial Intelligence, the Internet of Things (IoT), Blockchain, and Virtual Reality impose new challenges on software engineering.

This research project investigates how software engineering can be applied to these technologies to maximize their societal benefits. In the context of Blockchain, for instance, smart contracts enable innovative services, but code vulnerabilities can lead to million-dollar losses, making security a critical concern. In IoT, security is equally essential, as failures can compromise hardware or even endanger human lives. Developing secure, scalable, and reliable systems thus becomes a central challenge within Software Engineering.

Educational games are another important application, supporting learning through exploration within the game environment. The use of data provenance makes it possible to analyze player actions, revealing their behavior and strategies.

This project also welcomes additional investigations into emerging technologies and their societal impact, exploring innovative approaches to software development.

Faculty Members Involved: 

  • Diogo Silveira Mendonça (coordinator) 
  • Joel André Ferreira dos Santos 

Machine Learning and Optimization

Machine Learning (ML) is a branch of Artificial Intelligence dedicated to developing new algorithms and methodologies capable of identifying patterns and making decisions without explicit programming. Beyond practical applications, progress in this field depends on creating novel theoretical and computational approaches that enhance the efficiency, interpretability, and generalization capacity of models.

This research project investigates advanced ML methods, spanning traditional techniques, such as deep neural networks and probabilistic models, to emerging approaches including self-supervised learning, generative models, federated learning, and reinforcement learning. Additionally, the project aims to improve strategies for explainability and interpretability to make models more transparent and trustworthy, especially in critical applications.

A second fundamental pillar of this project is Optimization, a field that integrates with ML to improve model performance and solve complex problems across different domains. The project focuses on the design and application of methods for solving problems using linear, nonlinear, integer, and mixed-integer programming (through exact and/or heuristic methods), as well as bio-inspired metaheuristics such as ant colony optimization, genetic algorithms, and particle swarm optimization. Optimization techniques are applied to tasks such as tuning machine learning model parameters, feature selection, and neural network architecture design.

Finally, Affective Computing explores how ML algorithms can interpret, process, and respond to human emotional states. This includes investigating new methods for fusing physiological and emotional signals. The goal is to advance the development of systems capable of adapting their responses in more natural and empathetic ways, with applications ranging from conversational interfaces to interactive robotics.

Faculty Members Involved: 

  • Eduardo Bezerra da Silva (coordinator) 
  • Gustavo Paiva Guedes e Silva 
  • Diogo Silveira Mendonça 
  • Diego Moreira de Araújo Carvalho 
  • Laura Silva de Assis

Database Management and Administration

The growing volume of data requires organizations to develop strategies for extracting valuable insights and gaining competitive advantage. This process involves the collection, storage, integration, and analysis of structured, semi-structured, and unstructured data. The research investigates methodologies for managing and transforming these data into useful knowledge to support decision-making.

The focus lies on data-centric artificial intelligence (Data-Centric AI) for data preparation and on large-scale processing techniques. One of the challenges addressed is the parallel and distributed processing of massive volumes of heterogeneous data, common in fields such as bioinformatics, astronomy, and engineering. Scientific workflows are essential for these experiments and are frequently executed on clusters, supercomputers, and cloud environments.

The project also explores frameworks such as Apache Spark, optimizing workflows for large-scale data analysis and management. In addition, it investigates conceptual modeling techniques, ontologies, preprocessing, indexing, and querying in Big Data systems. The research considers approaches based on distributed storage (HDFS), NoSQL databases, NewSQL systems, and object-relational databases, aiming to enhance the efficiency of data handling and analysis.

Faculty Members Involved:

  • Rafaelli de Carvalho Coutinho (coordinator)  
  • Eduardo Soares Ogasawara 
  • Diego Moreira de Araújo Carvalho 
  • Jorge de Abreu Soares 
  • Kele Teixeira Belloze