Gene Mvan_4749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4749 
Symbol 
ID4647756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5086415 
End bp5087596 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content67% 
IMG OID639808218 
Productalcohol dehydrogenase 
Protein accessionYP_955528 
Protein GI120405699 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.403113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCAG TCACCTGGCA TGGACGGCGC GACGTACGAG TCGACTCCGT CCCCGATCCG 
AAAATCGAAG AGCCCACCGA CGCGATCATC GAAGTGACGT CGACGAACAT CTGCGGCTCC
GACCTCCACC TATACGAGGT GCTTGGCGCG TTCATGAACG AGGGCGACAT CCTCGGACAC
GAGCCGATGG GGGTGGTCCG CGAGGTCGGC AGCGCGGTGT CGAATCTGGC AGTGGGAGAC
CGGGTGGTGA TCCCGTTCCA GATCTCGTGC GGACATTGCT TCATGTGCGA CCGCAAGCTG
TACACCCAGT GCGAGACCAC CCAGGTCCGC GACCAGGGGA TGGGGGCGGC GCTGTTCGGC
TATTCGGAGC TCTACGGCTC GGTGCCCGGT GGGCAGGCCG AGTACCTACG GGTGCCGCAG
GCACAGTTCA CCCACATCAA AGTCCCTGAC GGCCCGCCGG ATTCGCGGTT CGTGTATCTG
TCCGATGTGT TACCCACCGC GTGGCAGGCG GTGGCCTACG CCGACGTGCC CGACGGCGGC
ACGGTCACGG TCCTCGGCCT CGGCCCGATC GGGGACATGG CCGCCCGGAT CGCCCAGCAC
CTCGGCTACC AGGTGTTCGC GGTGGATCTG GTGCCCGAGC GGCTGGACCG GGCCATCGCG
CGCGGCATCC ACACGATCGA CGCGTCGATC GTCGACGGGT CCGTCGGCGA CGAGGTGCGC
CGGCTCACCG GCGGGCGTGG CAGCGACTCG GTGATCGACG CCGTCGGCAT GGAGGCGCAC
GGGTCGCCGG TGGCCAAGTT CGCCCAGCAG GCCACGGCAC TGCTGCCCGA CGCCGTCGCC
AAGCCGATGA TGCAGAAAGC CGGGGTGGAC CGCCTGGACG CGTTGTACAC CGCGATCGAC
TGCGTGCGGC GCGGAGGCAC CCTGTCGCTG ATCGGCGTGT ACGGCGGCAT GGCCGACCCG
ATGCCGATGC TGACGCTGTT CGACAAGCAG ATCCAGGTGC GGATGGGGCA GGCCAACGTC
AAGAAGTGGG TCGACGACAT CATGCCGCTG CTGACCGACT CGGACCCGCT GGGCGTCGAC
ACCTTCGCCA CCCACGTTCT GCCGATGGAG GAGGCCCCGC ACGCCTACAA GATCTTCCAG
CAGAAGCAGG ACGGTGCGGT GAAGGTTATT TTGCAGCCTT GA
 
Protein sequence
MRAVTWHGRR DVRVDSVPDP KIEEPTDAII EVTSTNICGS DLHLYEVLGA FMNEGDILGH 
EPMGVVREVG SAVSNLAVGD RVVIPFQISC GHCFMCDRKL YTQCETTQVR DQGMGAALFG
YSELYGSVPG GQAEYLRVPQ AQFTHIKVPD GPPDSRFVYL SDVLPTAWQA VAYADVPDGG
TVTVLGLGPI GDMAARIAQH LGYQVFAVDL VPERLDRAIA RGIHTIDASI VDGSVGDEVR
RLTGGRGSDS VIDAVGMEAH GSPVAKFAQQ ATALLPDAVA KPMMQKAGVD RLDALYTAID
CVRRGGTLSL IGVYGGMADP MPMLTLFDKQ IQVRMGQANV KKWVDDIMPL LTDSDPLGVD
TFATHVLPME EAPHAYKIFQ QKQDGAVKVI LQP