Here is a list of programs used in the genome assembly and variant calling process. Very basic information is provided about each program. Check the paper and program documentation links for more extensive information. If you feel any information is inaccurate or out of date, or if you want to recommend a program to add to the lists, please contact me.
Programs listed with a GREEN BACKGROUND are ones used in this workshop.
Genome assembly programs
Program | Author | Year | Contig assembly method | Scaffolding method | Use cases | Link | Paper |
---|---|---|---|---|---|---|---|
ALLPATHS-LG | Gnerre | 2011 | de Bruijn graph and sequence graphs | Read pairs | Large genomes with both short and long reads | Website | Paper |
hifiasm | Cheng | 2020 | Error corrected overlap-layout to preserve haplotypes | NA | PacBio Hifi reads | Website | Paper |
Spades | Bankevich | 2012 | de Bruijn graph | Read pairs and small gap repeat resolution; can also use long reads or previously inferred contigs | Good for small genomes or targeted sequencing (e.g. exomes) | Website | Paper |
Discovar de novo | Weisenfeld | 2014 | de Bruijn graph and lines | NA | Single 2x250bp library | Website | Paper |
Supernova | Weisenfeld | 2017 | de Bruijn graph and lines | Read pairs and linked read barcodes | Large genomes with linked reads for phased genome assemblies | Website | Paper |
Canu | Koren | 2017 | Overlap-layout-consensus | NA | Long reads | Website | Paper |
HiCanu | Nurk | 2020 | Overlap-layout-consensus | NA | PacBio Hifi reads | Website | Paper |
Flye | Kolmogorov | 2019 | Repeat graph | NA | Long reads | Website | Paper |
platanus | Kajitani | 2014 | de Bruijn graph | Read pairs | Short reads for genomes with high heterozygosity | Website | Paper |
opera-lg | Gao | 2016 | NA | Paired reads and long reads | Scaffolding of repeat-rich genomes | Website | Paper |
agouti | Zhang | 2016 | NA | RNA-seq reads | Scaffolding of large genomes | Website | Paper |
Abyss | Jackman | 2017 | de Bruijn graph; all possible k-mers | Mate pairs, linked reads, or long reads | Short read libraries for genomes up to 100Mb; Transcriptomes with Trans-ABySS | Website | Paper |
Velvet | Zerbino and Birney | 2008 | de Bruijn graph | Read pairs | Short read assembly | Website | Paper |
SOAPdenovo2 | Luo | 2012 | de Bruijn graph | Read pairs | Short read assembly | Website | Paper |
MaSuRCA | Zimin | 2013 | Overlap-layout-consensus on unique super-reads | Mate pairs, linked reads, or long reads | Mixed short read libraries of large genomes | Website | Paper |
CABOG | NA | NA | NA | NA | NA | Website | Paper |
Falcon | Chin | 2016 | String graph | NA | PacBio long reads for diploid genome assembly | Website | Paper |
Miniasm | Li | 2016 | Overlap-layout | NA | Long reads | Website | Paper |
HINGE | Kamath | 2017 | Overlap-layout-consensus with hinging | NA | Long reads | Website | Paper |
Abruijn | Lin | 2016 | A-Bruijn graph | NA | Long reads | Website | Paper |
MEGAHIT | Li | 2015 | de Bruijn graph | Short read assembly of metagenomes | Website | Paper | |
Peregrine | Chin and Khalak | 2020 | Overlap-layout-consensus with shimmer indexing | NA | Fast long reads assembly | Website | Paper |
Read mapping programs
Program | Author | Year | Use cases | Link | Paper |
---|---|---|---|---|---|
BWA | Li and Durbin | 2010 | Short read alignment | Website | Paper |
TopHat2 | Kim | 2013 | Mapping RNA-seq reads | Website | Paper |
Minimap2 | Li | 2018 | Long read mapping and whole genome alignment | Website | Paper |
bbmap | Bushnell | 2014 | Short and long read mapper with many extras | Website | Paper |
Bowtie2 | Langmead and Salzberg | 2012 | Short read alignment | Website | Paper |
SOAP2 | Li | 2009 | Short read alignment | Website | Paper |
Single-nucleotide variant calling programs
Other helpful programs
Program | Author | Year | Use cases | Link | Paper |
---|---|---|---|---|---|
bedtools | Quinnlan and Hall | 2010 | Perform operations on sets of genomic coordinates. | Website | Paper |
bcftools | NA | NA | Perform opterions on VCF and BCF formatted files. | Website | Paper |
samtools | Li | 2009 | Perform operations on SAM/BAM/CRAM formatted files. | Website | Paper |
Picard tools | Broad Institute | 2019 | Performs many operations on SAM/BAM/CRAM and VCF files. | Website | Paper |
mosdepth | Pedersen and Quinlan | 2018 | Calculates read depth from mapped reads. | Website | Paper |
pseudo-it | Sarver | 2017 | Iterative read mapping for pseudo-reference assembly. | Website | Paper |
Referee | Thomas and Hahn | 2018 | Assign per-base quality scores to genome assemblies. | Website | Paper |