Verification and validation of bioinformatics software without a gold standard: A case study of BWA and Bowtie

Verification and validation of bioinformatics software without a gold standard: A case study of BWA and Bowtie

Verification and validation of bioinformatics software without a gold standard: A case study of BWA and Bowtie (#51)

Eleni Giannoulatou ¹ , Shin-Ho Park ¹ , David Humphreys ¹ , Joshua WK Ho ¹

Victor Chang Cardiac Research Institute, Darlinghurst, NSW, Australia

Background: Bioinformatics software quality assurance is essential in genomic medicine. Systematic verification and validation of bioinformatics is difficult because it is often not possible to obtain a realistic “gold standard” for systematic evaluation. Here we apply a technique that originates from the software testing literature, namely Metamorphic Testing (MT), to systematically test three widely used short read sequence alignment programs.

Results: MT alleviates the problems associated with the lack of gold standard by checking that the results from multiple executions of a program satisfy a set of expected or desirable properties that can be derived from the software specification or user expectations. We tested BWA, Bowtie and Bowtie2 using simulated data and one HapMap dataset. It is interesting to observe that multiple execution of the same aligner using slightly modified input FASTQ sequence file, such as randomly re-ordering of the reads, may affect alignment results. Furthermore, we found that the list of variant calls can be affected unless strict quality control is applied during variant calling.

Conclusion: Thorough testing of bioinformatics software is important in delivering clinical genomic medicine. This paper demonstrates a different framework to test a program that involve checking its properties, thus greatly expanding the number and repertoire of test cases we can apply in practice.

Authors contributing to this presentation.

Joshua Ho completed a BSc (Hon 1, Medal) in Biochemistry and Computer Science in 2006 and a PhD in Bioinformatics in 2010, both at the University of Sydney. He then completed an interdisciplinary postdoctoral fellowship at the Harvard Medical School (HMS). In mid 2012, he became an Instructor in Medicine at HMS. In July 2013, he returned to Australia to set up the Bioinformatics and Systems Medicine Laboratory at the Victor Chang Cardiac Research Institute. He is also a conjoint senior lecturer at UNSW.

Verification and validation of bioinformatics software without a gold standard: A case study of BWA and Bowtie (#51)

Giannoulatou , E

Park, S

Humphreys, D

Ho, J.W

Verification and validation of bioinformatics software without a gold standard: A case study of BWA and Bowtie (#51)

Add notes

Giannoulatou , E

Park, S

Humphreys, D

Ho, J.W

Login