Treffer: Bioinformatic pipelines in Python with Leaf.

Title:
Bioinformatic pipelines in Python with Leaf.
Authors:
Napolitano F; Department of Computer Science (DI),University of Salerno, Fisciano (SA) 84084, Italy. fnapolitano@unisa.it, Mariani-Costantini R, Tagliaferri R
Source:
BMC bioinformatics [BMC Bioinformatics] 2013 Jun 21; Vol. 14, pp. 201. Date of Electronic Publication: 2013 Jun 21.
Publication Type:
Journal Article; Research Support, Non-U.S. Gov't
Language:
English
Journal Info:
Publisher: BioMed Central Country of Publication: England NLM ID: 100965194 Publication Model: Electronic Cited Medium: Internet ISSN: 1471-2105 (Electronic) Linking ISSN: 14712105 NLM ISO Abbreviation: BMC Bioinformatics Subsets: MEDLINE
Imprint Name(s):
Original Publication: [London] : BioMed Central, 2000-
References:
Bioinformatics. 2012 Jun 1;28(11):1525-6. (PMID: 22500002)
J Mol Graph Model. 1999 Feb;17(1):57-61. (PMID: 10660911)
Bioinformatics. 2009 Jun 1;25(11):1422-3. (PMID: 19304878)
BMC Bioinformatics. 2004 Apr 19;5:40. (PMID: 15096276)
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W729-32. (PMID: 16845108)
BMC Bioinformatics. 2008 Feb 05;9:82. (PMID: 18251993)
Genome Biol. 2010;11(8):R86. (PMID: 20738864)
Brief Bioinform. 2008 Jan;9(1):57-68. (PMID: 18056132)
Genome Res. 2003 Aug;13(8):1904-15. (PMID: 12869579)
Genome Res. 2007 Nov;17(11):1665-74. (PMID: 17921354)
Bioinformatics. 2010 Nov 1;26(21):2778-9. (PMID: 20847218)
Nature. 2012 Feb 22;482(7386):485-8. (PMID: 22358837)
Bioinformatics. 2011 Apr 1;27(7):903-11. (PMID: 21278189)
Nat Rev Genet. 2009 Aug;10(8):551-64. (PMID: 19597530)
Genome Med. 2010 Sep 07;2(9):65. (PMID: 20822536)
Entry Date(s):
Date Created: 20130622 Date Completed: 20140206 Latest Revision: 20211021
Update Code:
20250114
PubMed Central ID:
PMC3747863
DOI:
10.1186/1471-2105-14-201
PMID:
23786315
Database:
MEDLINE

Weitere Informationen

Background: An incremental, loosely planned development approach is often used in bioinformatic studies when dealing with custom data analysis in a rapidly changing environment. Unfortunately, the lack of a rigorous software structuring can undermine the maintainability, communicability and replicability of the process. To ameliorate this problem we propose the Leaf system, the aim of which is to seamlessly introduce the pipeline formality on top of a dynamical development process with minimum overhead for the programmer, thus providing a simple layer of software structuring.
Results: Leaf includes a formal language for the definition of pipelines with code that can be transparently inserted into the user's Python code. Its syntax is designed to visually highlight dependencies in the pipeline structure it defines. While encouraging the developer to think in terms of bioinformatic pipelines, Leaf supports a number of automated features including data and session persistence, consistency checks between steps of the analysis, processing optimization and publication of the analytic protocol in the form of a hypertext.
Conclusions: Leaf offers a powerful balance between plan-driven and change-driven development environments in the design, management and communication of bioinformatic pipelines. Its unique features make it a valuable alternative to other related tools.