Gene Mvan_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1841 
Symbol 
ID4644221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1962093 
End bp1963118 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content72% 
IMG OID639805329 
Productalcohol dehydrogenase 
Protein accessionYP_952668 
Protein GI120402839 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGG TCGTCATCGA CAGTGAACGT CGGATCCGCG TCGAGACGCG GCCCGATCCG 
CAGCTGCCCG GACCCGACGG CGCCATCGTC GAGGTGACGG CCACCGCGAT CTGCGGGTCG
GATCTGCACT TCCTGGAAGG CCACTACCCG ATCGACCAAC CGGTGTCGAT CGGTCACGAA
GCCGTCGGGG TCGTCGTCGA AACCGGCACT ACGGTCGCCG GATTCAAAGT CGGCGACCGG
GTGCTGGTGT CGTCGGTGGC CGGCTGCGGA CGTTGCGCGG GGTGCCGGAC GCAGGACCCC
GTGAAGTGTG TGCAGGGCCC GCAGATCTTC GGCTCGGGTC TGCTCGGCGG GGCGCAGGCG
GACCTGATGG CGGTGCCGGC GGCGGATTTC CAGTTGCTCG CGATCCCCGA CGGTATCGAC
ACCGAGCAGG CGCTGCTGCT GACCGACAAC CTCGCCACCG GGTGGGCCGC CGCCAAACGC
GCCGACATCC CGGTCGGCGG GACCGTGGCG GTGATCGGAG CGGGAGCTGT CGGCCAATGC
GCCCTGCGCA GCGCGTACGC ACTCGGCGCG GCAACGGTTT TCGCCGTCGA CCCGGTCGCC
GCCCGACGGG ACCGTGCCGC GGCGGCAGGC GCGCGGGCGG TCCCGGCCCC CGCCGCGGCA
GCGATCCTGG AGGCCACCGG CGGGCTCGGC GTCGATTCGG TGATCGACGC GGTCGGGACC
GACACGTCGC TCGACGACGC GTTGGCGTGC GTGCGCACCG GCGGAACGGT GTCGATCGTC
GGCGTGCACG ACCTGCAGCC CTACCCCCTG CCCGCGCTCG TCTGCCTGCT GCGCAGCCTG
ACGATCCGGC TGACCACCGC CCCGGTCCAG CAGACATGGC CCGAACTGAT CCCGCTGCTG
CAGGCGGGGC GACTCAGCGT CGACGGGATC TTCACCGGCG CACTGCCGCT CGACGATGCC
GAGCGGGCCT ACGCCGCGGC GTTCTCCCGA TCGGCCGAGC ACCTCAAGGT CCAGCTCATC
CCGTGA
 
Protein sequence
MRAVVIDSER RIRVETRPDP QLPGPDGAIV EVTATAICGS DLHFLEGHYP IDQPVSIGHE 
AVGVVVETGT TVAGFKVGDR VLVSSVAGCG RCAGCRTQDP VKCVQGPQIF GSGLLGGAQA
DLMAVPAADF QLLAIPDGID TEQALLLTDN LATGWAAAKR ADIPVGGTVA VIGAGAVGQC
ALRSAYALGA ATVFAVDPVA ARRDRAAAAG ARAVPAPAAA AILEATGGLG VDSVIDAVGT
DTSLDDALAC VRTGGTVSIV GVHDLQPYPL PALVCLLRSL TIRLTTAPVQ QTWPELIPLL
QAGRLSVDGI FTGALPLDDA ERAYAAAFSR SAEHLKVQLI P