Gene Mkms_4650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4650 
Symbol 
ID4612598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4875432 
End bp4877387 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content68% 
IMG OID639794341 
Productglycosyl transferase family protein 
Protein accessionYP_940631 
Protein GI119870679 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.492865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGAAC CGGTAGCGCG GCAGCACCGG GACTCCCAGA CGCCATCGGC ACCGCCGCCG 
CAGATACCGC AGAAGCCCAA AGCTGTTGTC AGGCACTATC GCTCGATCGA CAGCGCTCCG
CCGGCGTATG CGGTCAAACG TCCGCCCAGC GCTGTCAACG GTGCGCTGCT CATCTTTGTC
GCGTTGAGCA GTCTGACGCT GCTGCTCGGA ACCGTACAGG CCAGGGCGTG GGGAGACCAC
GCCCGCGACC TGGTCTTCGC CACGGCCGAC GGGCAGGCGG CCGCGATACC GGTGCGGGCG
TTCCTGCTGG TGATGGTCAC CTCGGTGGCG TGGTCGCTGG ACACGAACTT CTGGCGCCGG
TTGGCTGTGC ACCTCGAGCT GACCGGCGTG CTGATCCTGG TCTGCGCGGT GGTGGATTTC
TCGGCTTACC TCGGCTACCA CGTCGGACTC TTCTATCCGC AGATCGTCGG CCAGCAGTTG
GCGTCAAGCC TGGCGGCGAT GGTGCTGTTG CCGTTCACCG TGATGCGGCA CGCCCGGCTG
CCGAGGCCGG CGCGCCTGCG GCCGGCGGGG CGGATGCGTT GGCATGCCTG GGTGCGGCTG
GCGGTTCCGC TGGCGGTGGC GTTCGTGGCA GCCGCCTGGA TCGAGGACCG CATGCCGGTC
CCGGTGGCCT GGATGCGGGA GTGGGCGCTG ATGGGTGGCG TGGGTCCGGG GATCTTCCTG
GTCCAGCAGC TGTTCGGCAT CCTCGCCGCG GGGATCGGGC TGGTGATGAT CCGCCGGTCG
CGCCGCGCAC GTTTCGCGCC GCCCCTCGCG GTGATCATCC CGGCGCACAA CGAGGCCCAC
GACATCACCG CCACGATCGA GGCCGTCGAC CGGGCCGCGG CCCGGTACGC CGAGACGGTC
CACATCTATG TCATCGACAA TGCCTCCACC GACGACACCG CGGACGTCGC ACAGACCGCC
ATCGCCGCCT GCGCACACTC CACCGGGGAG GTGCACGAAT GCGCGGTCCC CGGGAAGGCG
GTGGCGCTCA ACTACGGCCT GTCGGTGATC CGGGAGGAGT TCGTCGTGCG CATCGATGCC
GACACCGTGA TCGGCGAGAA CTGCCTCGAC GTCACGCTGC GTCATTTCAC CGATGCGAAG
GTCGCCGCCG TCGGCGGGAT GCCGCGGCCG GAACGTATCC GAACCTTCTT CGACCGGGTG
CGATTGGTCG AGGTGCTCGT CAAACACGGC TTCTTCCAGG TCGCGATGAT GGGCTACGAC
GGGATCATCG GCGAGCCCGG CATGTTCGTG GTCTACCGGC GCCGCGTCGT CGAAGAGGTC
GGCGGCATCG TGCAGGGCAT GAACGGTGAG GACACCGACA TCTGCATGAG GATGAGCAGT
CAGGGCTACC TGAGCCTGGT CGACCCCACC GCGGTCTACT TCAGCGAGAC CCCGCAGAGC
TGGGCGCATC TGCGCGAACA ACGCACCCGC TGGTTTCGCA GCATCTACCA CATCGCCGCC
CACAACCGGC ACGCGATCCT GAGCCGGAGT TCGATGGCCG GGGCGGTGAT GCTGCCGTTT
CAGCTCGCCA ACTCGGCGCG CCGAGCGATG ATGCTGCCCC TGCTGTTGTT CGGCCTCTTG
ATCTTCGGAC TGTTCCGCGA GTCGTTCCCC GGTCTGCACC CCGAGCGGCT CCTCGCGGTG
TTCCTCGGGC TGCCGCTGCT GGTGGCACTC GGCGTATGCC TCGTGCGTCA GCCCCGAGCG
GTCCTCTACC TCCCCGAGTA CCTCCTATTC CGGATAGTGC GCAGCTATTT CACCCTCGCC
GCGGTGCTGA GCCTGGTGTT TCCGCCGCTG CATCCCCGGC AGGCGCTGCG GGAGCGAAGG
CGAACGCGTA GGCGACCCCG TCACCGACGC AACCGTGCCA CGCCCGCCGA CCGCAGTTCC
AGCGCCGCAA GCCCGGATAT CGCGGCGACG TCCTGA
 
Protein sequence
MNEPVARQHR DSQTPSAPPP QIPQKPKAVV RHYRSIDSAP PAYAVKRPPS AVNGALLIFV 
ALSSLTLLLG TVQARAWGDH ARDLVFATAD GQAAAIPVRA FLLVMVTSVA WSLDTNFWRR
LAVHLELTGV LILVCAVVDF SAYLGYHVGL FYPQIVGQQL ASSLAAMVLL PFTVMRHARL
PRPARLRPAG RMRWHAWVRL AVPLAVAFVA AAWIEDRMPV PVAWMREWAL MGGVGPGIFL
VQQLFGILAA GIGLVMIRRS RRARFAPPLA VIIPAHNEAH DITATIEAVD RAAARYAETV
HIYVIDNAST DDTADVAQTA IAACAHSTGE VHECAVPGKA VALNYGLSVI REEFVVRIDA
DTVIGENCLD VTLRHFTDAK VAAVGGMPRP ERIRTFFDRV RLVEVLVKHG FFQVAMMGYD
GIIGEPGMFV VYRRRVVEEV GGIVQGMNGE DTDICMRMSS QGYLSLVDPT AVYFSETPQS
WAHLREQRTR WFRSIYHIAA HNRHAILSRS SMAGAVMLPF QLANSARRAM MLPLLLFGLL
IFGLFRESFP GLHPERLLAV FLGLPLLVAL GVCLVRQPRA VLYLPEYLLF RIVRSYFTLA
AVLSLVFPPL HPRQALRERR RTRRRPRHRR NRATPADRSS SAASPDIAAT S