Chiron workflow engine

Chiron is a workflow execution engine designed to execute workflows in parallel in High Performance Computing (HPC) environment. A major goal of Chiron is to take a workflow specification and provide for data parallelism automatically with runtime query provenance support. Data is fragmented from a set of parameter sweep combinations or input dataset. Parallel processing is obtained in a MapReduce (Hadoop) style, however, Chiron engine is supported by a workflow algebra, which allows for optimization, dynamic scheduling and runtime workflow steering. Some additional libraries are necessary to execute Chiron such as the JDBC drivers to connect with PostgreSQL database and MPJ libraries.

The Chiron workflow engine was first created at the COPPE Institute from The Federal University of Rio de Janeiro in the NACAD Lab with Marta Mattoso‘s research group in March 2012. Recent contributions have been made by CEFET/RJ and students from the Polytechnic School of the Federal University of Rio de Janeiro, in addition to developers from Marta Mattoso’s research Chiron team. The Chiron developers welcome contributions in the form of patches and bug reports (preferably with a minimal test case that reliably reproduces the error) to the official mailing lists. Many thanks to SourceForge and SVN for hosting the project. You can find out what is currently happening in the development branch by checking out the homepage, and you can see how many people are downloading the library on the statistics page.

Authors: Eduardo Ogasawara, Jonas Dias, Vitor Silva, Fernando Chirigati, Daniel de Oliveira, Fabio Porto, Patrick Valduriez, and Marta Mattoso

Main paper: https://dx.doi.org/10.1002/cpe.3032

Project homepage: http://chironengine.sourceforge.net/index.php/home

 

Eduardo Ogasawara

I am a Professor of the Computer Science Department of the Federal Center for Technological Education of Rio de Janeiro (CEFET / RJ) since 2010. I hold a PhD in Systems Engineering and Computer Science at COPPE / UFRJ. Between 2000 and 2007 I worked in the Information Technology (IT) field where I acquired extensive experience in workflows and project management. I have solid background in the Databases and my primary interest is Data Science. He currently studies space-time series, parallel and distributed processing, and data preprocessing methods. I am a member of the IEEE, ACM, INNS, and SBC. Throughout my career I have been presenting consistent number of published articles and projects approved by the funding agencies, such as CNPq and FAPERJ. I am also reviewer of several international journals, such as VLDB Journal, IEEE Transactions on Service Computing and The Journal of Systems and Software. Currently, I am heading the Post-Graduate Program in Computer Science (PPCIC) of CEFET / RJ.