Gene Mmcs_4562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4562 
Symbol 
ID4113391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4836292 
End bp4838247 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content68% 
IMG OID638033712 
Productglycosyl transferase family protein 
Protein accessionYP_641722 
Protein GI108801525 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.829181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGAAC CGGTAGCGCG GCAGCACCGG GACTCCCAGA CGCCATCGGC ACCGCCGCCG 
CAGATACCGC AGAAGCCCAA AGCTGTTGTC AGGCACTATC GCTCGATCGA CAGCGCTCCG
CCGGCGTATG CGGTCAAACG TCCGCCCAGC GCTGTCAACG GTGCGCTGCT CATCTTTGTC
GCGTTGAGCA GTCTGACGCT GCTGCTCGGA ACCGTACAGG CCAGGGCGTG GGGAGACCAC
GCCCGCGACC TGGTCTTCGC CACGGCCGAC GGGCAGGCGG CCGCGATACC GGTGCGGGCG
TTCCTGCTGG TGATGGTCAC CTCGGTGGCG TGGTCGCTGG ACACGAACTT CTGGCGCCGG
TTGGCTGTGC ACCTCGAGCT GACCGGCGTG CTGATCCTGG TCTGCGCGGT GGTGGATTTC
TCGGCTTACC TCGGCTACCA CGTCGGACTC TTCTATCCGC AGATCGTCGG CCAGCAGTTG
GCGTCAAGCC TGGCGGCGAT GGTGCTGTTG CCGTTCACCG TGATGCGGCA CGCCCGGCTG
CCGAGGCCGG CGCGCCTGCG GCCGGCGGGG CGGATGCGTT GGCATGCCTG GGTGCGGCTG
GCGGTTCCGC TGGCGGTGGC GTTCGTGGCA GCCGCCTGGA TCGAGGACCG CATGCCGGTC
CCGGTGGCCT GGATGCGGGA GTGGGCGCTG ATGGGTGGCG TGGGTCCGGG GATCTTCCTG
GTCCAGCAGC TGTTCGGCAT CCTCGCCGCG GGGATCGGGC TGGTGATGAT CCGCCGGTCG
CGCCGCGCAC GTTTCGCGCC GCCCCTCGCG GTGATCATCC CGGCGCACAA CGAGGCCCAC
GACATCACCG CCACGATCGA GGCCGTCGAC CGGGCCGCGG CCCGGTACGC CGAGACGGTC
CACATCTATG TCATCGACAA TGCCTCCACC GACGACACCG CGGACGTCGC ACAGACCGCC
ATCGCCGCCT GCGCACACTC CACCGGGGAG GTGCACGAAT GCGCGGTCCC CGGGAAGGCG
GTGGCGCTCA ACTACGGCCT GTCGGTGATC CGGGAGGAGT TCGTCGTGCG CATCGATGCC
GACACCGTGA TCGGCGAGAA CTGCCTCGAC GTCACGCTGC GTCATTTCAC CGATGCGAAG
GTCGCCGCCG TCGGCGGGAT GCCGCGGCCG GAACGTATCC GAACCTTCTT CGACCGGGTG
CGATTGGTCG AGGTGCTCGT CAAACACGGC TTCTTCCAGG TCGCGATGAT GGGCTACGAC
GGGATCATCG GCGAGCCCGG CATGTTCGTG GTCTACCGGC GCCGCGTCGT CGAAGAGGTC
GGCGGCATCG TGCAGGGCAT GAACGGTGAG GACACCGACA TCTGCATGAG GATGAGCAGT
CAGGGCTACC TGAGCCTGGT CGACCCCACC GCGGTCTACT TCAGCGAGAC CCCGCAGAGC
TGGGCGCATC TGCGCGAACA ACGCACCCGC TGGTTTCGCA GCATCTACCA CATCGCCGCC
CACAACCGGC ACGCGATCCT GAGCCGGAGT TCGATGGCCG GGGCGGTGAT GCTGCCGTTT
CAGCTCGCCA ACTCGGCGCG CCGAGCGATG ATGCTGCCCC TGCTGTTGTT CGGCCTCTTG
ATCTTCGGAC TGTTCCGCGA GTCGTTCCCC GGTCTGCACC CCGAGCGGCT CCTCGCGGTG
TTCCTCGGGC TGCCGCTGCT GGTGGCACTC GGCGTATGCC TCGTGCGTCA GCCCCGAGCG
GTCCTCTACC TCCCCGAGTA CCTCCTATTC CGGATAGTGC GCAGCTATTT CACCCTCGCC
GCGGTGCTGA GCCTGGTGTT TCCGCCGCTG CATCCCCGGC AGGCGCTGCG GGAGCGAAGG
CGAACGCGTA GGCGACCCCG TCACCGACGC AACCGTGCCA CGCCCGCCGA CCGCAGTTCC
AGCGCCGCAA GCCCGGATAT CGCGGCGACG TCCTGA
 
Protein sequence
MNEPVARQHR DSQTPSAPPP QIPQKPKAVV RHYRSIDSAP PAYAVKRPPS AVNGALLIFV 
ALSSLTLLLG TVQARAWGDH ARDLVFATAD GQAAAIPVRA FLLVMVTSVA WSLDTNFWRR
LAVHLELTGV LILVCAVVDF SAYLGYHVGL FYPQIVGQQL ASSLAAMVLL PFTVMRHARL
PRPARLRPAG RMRWHAWVRL AVPLAVAFVA AAWIEDRMPV PVAWMREWAL MGGVGPGIFL
VQQLFGILAA GIGLVMIRRS RRARFAPPLA VIIPAHNEAH DITATIEAVD RAAARYAETV
HIYVIDNAST DDTADVAQTA IAACAHSTGE VHECAVPGKA VALNYGLSVI REEFVVRIDA
DTVIGENCLD VTLRHFTDAK VAAVGGMPRP ERIRTFFDRV RLVEVLVKHG FFQVAMMGYD
GIIGEPGMFV VYRRRVVEEV GGIVQGMNGE DTDICMRMSS QGYLSLVDPT AVYFSETPQS
WAHLREQRTR WFRSIYHIAA HNRHAILSRS SMAGAVMLPF QLANSARRAM MLPLLLFGLL
IFGLFRESFP GLHPERLLAV FLGLPLLVAL GVCLVRQPRA VLYLPEYLLF RIVRSYFTLA
AVLSLVFPPL HPRQALRERR RTRRRPRHRR NRATPADRSS SAASPDIAAT S