MAQGene Host-it-yourself webpage that inputs fastq reads and outputs annotated variants
Purpose and audience For C. elegans biologists with next-gen sequence data, facilitates genome-wide discovery of biologically meaningful mutations.
Scope Currently C. elegans only. Extension to other species under development.
Availability & Installation Requires Linux with Apache and MySQL servers. Available through svn:
# cut-and-paste the following in a shell:
svn co https://maqweb.svn.sourceforge.net/svnroot/maqweb/trunk maqgene
Or see SourceForge project Page
Automated Workflow
maq)
Selected example output (Kindly provided by Christina Chen)
Key
ID Mut CHR START PHASTCONS REF SAM CS LM MQ NQ WR VR DP PILEUP VTP IS CLASS DESCRIPTION PARENT_FEATURES
183824 my_mutant II 9964672 0.00 X X -1000 -999.99 -1000 -1000 3 8 11 @,,,g--- indel 0 nongenic -1782 downstream R53.2
10976 my_mutant I 2777373 0.00 C G 24 0.75 30 4 0 27 27 @GgGG--- point 0 five_prime_UTR none Y71F9B.13c
204709 my_mutant I 4345310 0.94 X X -1000 -999.99 -1000 -1000 2 -1 1 @,c,c indel 4 frameshift none ZK973.6
548 my_mutant I 233319 -missing- T G 2 2.19 34 2 1 12 13 @GGGG--- point 0 missense ATA->CTA[Ile->Leu] Y48G1BM.6
18001 my_mutant I 7407340 -missing- C T 3 9.44 1 2 1 8 9 @,TTT--- point 0 ncRNA none C15A11.7b
102 my_mutant I 24070 -missing- G A 14 1.25 5 14 0 4 4 @AaAa point 0 nongenic 2709 into Y74C9A.4b
73010 my_mutant III 9101166 0.01 C T 5 0.62 27 4 0 25 25 @TGtg--- point 0 non_start ATG->ATA[Met->Ile] ZK507.1
35147 my_mutant II 1771674 -missing- T A 11 1.69 0 6 0 3 3 @Aaa point 0 premature_stop TGT->TGA[Cys->stop] F36H5.1.1,F36H5.1.2
35873 my_mutant II 2022881 0.00 A G 5 3.38 22 2 1 5 6 @.gGg--- point 0 readthrough TAA->CAA[stop->Gln] F59H6.7
779 my_mutant I 321706 0.00 C T 4 0.81 31 0 0 75 75 @tTTT--- point 0 silent GAG->GAA[Glu->Glu] Y48G1A.5
423 my_mutant I 200949 -missing- C T 16 1.00 28 14 0 5 5 @TTTt--- point 0 SNP none haw294
75163 my_mutant III 10797302 0.04 C T 23 0.75 31 22 0 12 12 @tttt--- point 0 splice_acceptor none K11D9.3.1,K11D9.3.2,K11D9.3.3
167603 my_mutant X 4914714 0.00 C A 3 0.75 29 0 1 8 9 @,aaa--- point 0 splice_donor none K05B2.3
7609 my_mutant I 1862796 -missing- T C 7 1.00 0 6 0 2 2 @cc point 0 three_prime_UTR none M01D7.2
brief name name in actual output description
ID variant_id arbitrary id number assigned to this variant by MAQGene
Mut mutant_strain name of this sample given by the user
CHR dna chromosome identifier (may also be contig or any piece of dna in the reference)
START start 1-based start position on chromosome
PHASTCONS per_locus_5way_conservation optional per-locus associated information added by user
REF reference_base nucleotide in the reference genome at this position
SAM sample_base majority base (or one of two) in the sample genome at this position
CS consensus_score Phred-scaled score, probability that called sample base is erroneous
LM loci_multiplicity Average number of additional loci that reads aligned here also mapped to
MQ mapping_quality Average Phred-scaled score, probability that reads were mapped here in error
NQ neighbor_quality NQS (Neighborhood quality standard) score
WR number_wildtype_reads number of reads aligned at this locus that match the reference base
VR number_variant_reads number of reads aligned at this locus that mismatch the reference base
DP sequencing_depth total number of reads covering this locus (number_wildtype_reads + number_variant_reads)
PILEUP sample_reads symbolic slice of the alignment at this locus
VTP variant_type either indel or point mutation
IS indel_size 0 for point mutation. > 0 for insertion (more dna in sample than reference)
CLASS class any of several classes of genic and non-genic annotations
DESCRIPTION description extra biological or proximity information
PARENT_FEATURES parent_features name of associated annotation feature relating to this entry