BioMAJ is a dedicated workflow engine designed for efficient management of biological databanks.
Biological knowledge in a genomic or post-genomic context is primarily based on transitive bioinformatics analysis. This involves an iterative and periodic comparison of newly produced data against a corpus of known biological information. In large-scale projects, this approach requires accurate bioinformatics software, pipelines, interfaces, and numerous heterogeneous biological banks that are distributed around the world. However, the integration process that consists of mirroring and indexing these data is typically a significant bottleneck in most bioinformatics projects.
That's where BioMAJ comes in. This robust and flexible fully automated environment aims to resolve all your data mirroring and indexing needs so you can get your project up and running in no time.
Some of the key features of BioMAJ include synchronization options such as multiple remote protocols (ftp, http, rsync, local copy), powerful exception handling, data transfers integrity checks, release versioning using an incremental approach, multi-threading, and data tree directory normalization.
In addition, BioMAJ offers advanced workflow description (D.A.G) using easy normalized syntax language, post-process indexation for various bioinformatics software such as blast, srs, fastacmd, readseq, and more. You can also easily integrate your personal scripts for bank post-processing automation.
The software also includes a reporting facility for automatic web report generation, history graph generation for better repository analysis, alert facility for update cycle supervision, and online query of data warehouse contents.
Overall, BioMAJ is an incredibly useful software tool for those working in the field of bioinformatics who need a reliable and automated solution for managing biological databanks.
Version 0.9.2: N/A