Gene Mvan_5450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5450 
Symbol 
ID4644563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5831852 
End bp5832922 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content67% 
IMG OID639808926 
Productalcohol dehydrogenase 
Protein accessionYP_956226 
Protein GI120406397 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.524745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATGC GGGCCGCCCG CATGTACGGG TACAAGCAGC CACTGCGCCT CGAAGAGGTC 
GATGTGCCGT CTCCCGGTCC CGAGGAGGTG CTCGTCCGGG TGGGCGGTGC GGGCATGTGC
CGCACTGACT TCCAGCTGAT CGACGGCTAT TTCGACAACG GTCTTTCGAT GGACTTTCCC
ATCACGCCCG GACACGAGGT CGCCGGCTGG GTCGACGGAG TCGGATCGGC GGTTCCGAAG
TCGGCGGGGC TGGCGGAGGG CGATCAGGTG GTGGTGTTCG GCAGCTGGGG CGATGGGGCG
TGCAGGCAAT GTCACGAAGG CAACGAGCAG CTGTGCGCGC ACGGCGTCTG GGCGGGCTTC
GGCCGCCACG GCGGCTATCA GGAGTACCTG CCCGTCAACT ACCGGTACCT GATCAAGATC
CCCGGCGGGG GTGAACTGTC GCCGGACAAC CTGGCGCCGC TGACAGACGC CGGGCTGACG
CCCTACCGGG GGTTGAAGAA GTTGCGCAAC GCGGGGCACC TCGGGCCCGG CAGGACCGTC
GCGGTGTCGG GAATCGGCGG CCTGGGGAGT TACGCCACGC AGTACGCGAA ACTCCTTGGC
GGCGGCGCGG AGGTGGTCGC GTTCGCGCGC AGCGACGAGA AGCTCAAGAT CGCCAAGGAC
AACGGCGCCG ACCATGTCGT CAACGTGCGG GACAAGGACA CCGAAGACGT CCGCGCCGAA
CTCGAATCGG CCACTGGGCG AACAGAACTC GATGCGGTCA TCGAGTGCGC CGGATCGGAG
GACTCGATCC GGCTGGCGTT CTCGCTGCTG GCAGCCGAGG GCGCGGTGGC GTCCGTCGGC
CTCATCGGCA ACCGCGTCGA CATCCCGCTC TTCCCGTTGG TGGCGCGGGA GTACACCTTT
TACGGGTCGT TCTGGGGCAA CTACAACGAC CTCACCGAGG TGCTGGCGCT GGCGCGGACG
GGACAGCTCA AGCACTCGGT CACCCGGGTG CGCTTCGACG ACGTCAACGA GACCCTCGAG
GCGATCGCCC GTGGCGACGT GCTCGGGCGC GCAGTGATCG TCTACGACTG A
 
Protein sequence
MKMRAARMYG YKQPLRLEEV DVPSPGPEEV LVRVGGAGMC RTDFQLIDGY FDNGLSMDFP 
ITPGHEVAGW VDGVGSAVPK SAGLAEGDQV VVFGSWGDGA CRQCHEGNEQ LCAHGVWAGF
GRHGGYQEYL PVNYRYLIKI PGGGELSPDN LAPLTDAGLT PYRGLKKLRN AGHLGPGRTV
AVSGIGGLGS YATQYAKLLG GGAEVVAFAR SDEKLKIAKD NGADHVVNVR DKDTEDVRAE
LESATGRTEL DAVIECAGSE DSIRLAFSLL AAEGAVASVG LIGNRVDIPL FPLVAREYTF
YGSFWGNYND LTEVLALART GQLKHSVTRV RFDDVNETLE AIARGDVLGR AVIVYD