Gene Mvan_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1031 
Symbol 
ID4644252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1082378 
End bp1083976 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content71% 
IMG OID639804532 
Productaldehyde dehydrogenase 
Protein accessionYP_951875 
Protein GI120402046 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.576892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCA CGATCTCGGG CGCCACCAGC CTCACCGGAC AGATGCTGAT CGCCGGGGCA 
CCTGTGCGGG GCACCGGCAA GGAGGTCCGC GCATTCGATC CCGCGGCCGG GCAACCGCTG
GAACCGGTGT ACCAGCACGG CGACAACACC CACGTCGACG CCGCATGCGC TGCGGCCGCC
GACGCGTTCG CGCAATACCG CGCCACCACA TCGGAGCAGC GCGCCGCGTT TCTCGACACG
ATCGCCACCA ACATCGAGGC CGTCGGCGAG GCGCTGATCG CCCGCGCCGT CGCCGAATCC
GGACTGCCGC AGGCCAGGAT CACCGGCGAG CTCGGCCGCA CCACCGGACA GCTGCGCCTG
TTCGCCTCGG TGCTGCGCGA GGGCAGCTGG AACGGTGCCC GCATCGACAC CGCGCTGCCC
GACCGGACCC CGCTGCCGCG CCCCGACCTC CGTCAGCGCC ATATCCCGCT CGGTCCCGTC
GCGGTGTTCA GCGCGTCGAA CTTCCCGCTG GCGTTCTCCG TCGCCGGCGG TGACACCGCC
TCGGCGCTGG CTGCCGGCTG CCCGGTCGTC GTCAAAGGAC ACGACGCGCA TCCCGGCACC
TCCGAGCTCG TCGCCCGCGC CGTCACCGAC GCCGTCACCA CCTCCGGACT GCCCGCGGGA
ACGTTCTCGC TGCTGTTCGG CTCCGGCCCC GGTCTCGGCA TCGCACTGGT CACCGATCCG
CGCATCAAGG CCGTCGGTTT CACCGGATCA CGTTCCGGCG GAATGGCCCT CGTCTCCGCC
GCGGCGGCAC GTCCCGAACC CATCCCGGTG TATGCCGAGA TGAGCTCCAT CAACCCGGTG
TTCGTGCTCG ACGGTGCGCT GAAAACCCGC GGCGCCGAGC TGGGCCGCGC GTTCGTCGCG
TCGCTGACGA TGGGTTCCGG CCAGTTCTGC ACCAACCCCG GACTGGTGAT CGCCGTCGAC
GGACCCGGGC TGGACACATT CGCCGCCGCC GCTCGTGACG CACTGGCCGG CTCGCCGGCC
ACCCCGATGC TGACCCCGAC CATCGCGCGC AGCTACGCCT CCGGTGTGGA GGCGCTGTCC
GGTGCCGCGC AGCTTGTCGG CCGCGGCGCG CCCGGTACCA GTGAAACTGC TTGCCACGCC
GCGCTGTTCA GCACCGATGC GCAGACCTTT CTGGCGTCGG AGGCATTACA GGCCGAGGTG
TTCGGCTCGT CGAGCCTGAT CGTGCGTTGC GCCGACTTCG AGCAGATGCG CGCCGTCGCC
GAGGGCATCG AAGGACAGCT CACCGCGACC GTGCACGCCG ACGACTCCGA CCTCGACGAC
GCGGGCCGGC TGCTGCCACT GCTGGAACTC AAGGCAGGTC GGATCCTGTT CGGCGGCTGG
CCGACCGGCG TCGAGGTCTG CCACGCGATG GTGCACGGCG GACCGTTCCC GGCCACGTCG
GACTCGCGCA GCACCTCGGT CGGTTCGCAG GCCATCGAAC GCTATCTGCG GCCCGTCTGC
TATCAGGACG TGCCGGCCCC GTTGCTGCCC AGCGCGATCG CCGAAGGAAA CCCCGAAAAG
CTGTGGCGGC GCGTCGACGG CCGACTCACC CAAGACTGA
 
Protein sequence
MTATISGATS LTGQMLIAGA PVRGTGKEVR AFDPAAGQPL EPVYQHGDNT HVDAACAAAA 
DAFAQYRATT SEQRAAFLDT IATNIEAVGE ALIARAVAES GLPQARITGE LGRTTGQLRL
FASVLREGSW NGARIDTALP DRTPLPRPDL RQRHIPLGPV AVFSASNFPL AFSVAGGDTA
SALAAGCPVV VKGHDAHPGT SELVARAVTD AVTTSGLPAG TFSLLFGSGP GLGIALVTDP
RIKAVGFTGS RSGGMALVSA AAARPEPIPV YAEMSSINPV FVLDGALKTR GAELGRAFVA
SLTMGSGQFC TNPGLVIAVD GPGLDTFAAA ARDALAGSPA TPMLTPTIAR SYASGVEALS
GAAQLVGRGA PGTSETACHA ALFSTDAQTF LASEALQAEV FGSSSLIVRC ADFEQMRAVA
EGIEGQLTAT VHADDSDLDD AGRLLPLLEL KAGRILFGGW PTGVEVCHAM VHGGPFPATS
DSRSTSVGSQ AIERYLRPVC YQDVPAPLLP SAIAEGNPEK LWRRVDGRLT QD