Gene Mvan_0014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0014 
Symbol 
ID4644541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp18488 
End bp20005 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content69% 
IMG OID639803524 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_950871 
Protein GI120401042 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.446466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA TCCCGCACTA CCGGATGTAC ATCGACGGAG CGTGGCGTGA CGCCGCGGAC 
ACGATCGAAG TCCGCAGTCC CGCCACCGGC GAACTGGTGG CGACGGTCGC CTACGGCGGC
GTGACCACCG TGGACGACGC GGTCGCCGCG GCCAAGGCCG CCGACGAGGC CGGGGTGTGG
CGCAACACGT CTGCACAGCA ACGCGCGGAC GTACTCGATG CGATCGCCGA CAACCTCGCT
GCGCGCACCG ACGAACTGAC CGCACTTCAG GTCGCCGAGA ACGGTGCCAC CGTCCGCGCG
GCGGGCGCCT TCCTCATCGG CTACGCCATC GCGCACCTGC GGTACTTCGC CGGCCTTGCG
CGCACCTACG CGTTCCAGTC CAGCGGACCG CTGATGGAGG CCCCGACGCT GGCCGCAGGG
ATGATCGTCA GGGATCCGGT CGGCGTGTGC GCCGGCATGA TCCCGTGGAA CTTCCCGCTG
CTGCTCGCGG TCTGGAAGCT GGGTCCCGCA CTCGCCGCCG GCAACACCGT CGTGCTGAAG
CCCGACGACC AGACGCCGCT GACGCTGCTC GAGCTGGCCC GCGCCGCCGA CGAAGTCGGG
CTCCCCGCAG GGGTTCTCAA CGTGGTCACG GGCGCCGGCC CGACCGTCGG GGCACGTCTG
GCCGAACACC CGGACGTCCG GAAGGTGGCG TTCACCGGAT CGACCGAAGT GGGCAAGAGC
GTCATGCGCG CGGCGGCCGA CACGGTGAAG AAGGTCACGC TCGAGCTGGG CGGCAAGGGC
GCCAGCATCG TCCTCGACGA TGCCGACCTC GACCTGGCGG TCGACGGATC GCTGTTCGCG
TTCCTGTTGA TGAGCGGTCA GGCCTGCGAG TCCGGAACCC GGCTGCTCGT CCACGAGTCC
ATCCATGACG AGTTCGTCCG CCGCATGGTG GCCCGTGCCG AGACGCTGGT GATGGGCGAC
CCGATGAGCC TCGCCTCCGA TCTGGGACCG CTGGTCTCCG CCAAGCAGAA GGCGCGGGTG
GAGAAGTACA TCGCGCTCGG CCAGGAGGAA GGGTGCAAGC TGGCCTACCA GGGCACCGTC
CCGACGGATC CTGCTCTGGC ACAAGGTCAT TGGGTGCCTC CGACGATCCT CACCGGCGCC
ACCAACGACA TGCGGATCGC CCGGGAGGAG ATCTTCGGTC CCGTGCTCGT CGTGCTCACC
TACGGTGACG ACGACGAGGC GGTCGCCATC GCGAACGACA GCGAGTACGG GCTGTCGGCC
GGGGTGTGGA GTGCCGACAG GGAACGGGCG CTGGGGATCG CCCGCCGCCT GCAATCCGGC
ACGGTGTGGG TCAACGACTG GCACATGATC AACGCGATGT ACCCGTTCGG CGGGGTCAAG
CAGAGCGGTC TCGGCCGTGA ACTCGGTCCC GACGCACTCG ACGAGTACAC CGAACCGAAG
TTCATCCACG TCGACATGAC CGACGACCGC CGCAAACACG TGTATCCGGT CGTCATCTCT
GCGGCAGCGC AGGGCTGA
 
Protein sequence
MSDIPHYRMY IDGAWRDAAD TIEVRSPATG ELVATVAYGG VTTVDDAVAA AKAADEAGVW 
RNTSAQQRAD VLDAIADNLA ARTDELTALQ VAENGATVRA AGAFLIGYAI AHLRYFAGLA
RTYAFQSSGP LMEAPTLAAG MIVRDPVGVC AGMIPWNFPL LLAVWKLGPA LAAGNTVVLK
PDDQTPLTLL ELARAADEVG LPAGVLNVVT GAGPTVGARL AEHPDVRKVA FTGSTEVGKS
VMRAAADTVK KVTLELGGKG ASIVLDDADL DLAVDGSLFA FLLMSGQACE SGTRLLVHES
IHDEFVRRMV ARAETLVMGD PMSLASDLGP LVSAKQKARV EKYIALGQEE GCKLAYQGTV
PTDPALAQGH WVPPTILTGA TNDMRIAREE IFGPVLVVLT YGDDDEAVAI ANDSEYGLSA
GVWSADRERA LGIARRLQSG TVWVNDWHMI NAMYPFGGVK QSGLGRELGP DALDEYTEPK
FIHVDMTDDR RKHVYPVVIS AAAQG