Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_4650 |
Symbol | |
ID | 4612598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 4875432 |
End bp | 4877387 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639794341 |
Product | glycosyl transferase family protein |
Protein accession | YP_940631 |
Protein GI | 119870679 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.492865 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGAAC CGGTAGCGCG GCAGCACCGG GACTCCCAGA CGCCATCGGC ACCGCCGCCG CAGATACCGC AGAAGCCCAA AGCTGTTGTC AGGCACTATC GCTCGATCGA CAGCGCTCCG CCGGCGTATG CGGTCAAACG TCCGCCCAGC GCTGTCAACG GTGCGCTGCT CATCTTTGTC GCGTTGAGCA GTCTGACGCT GCTGCTCGGA ACCGTACAGG CCAGGGCGTG GGGAGACCAC GCCCGCGACC TGGTCTTCGC CACGGCCGAC GGGCAGGCGG CCGCGATACC GGTGCGGGCG TTCCTGCTGG TGATGGTCAC CTCGGTGGCG TGGTCGCTGG ACACGAACTT CTGGCGCCGG TTGGCTGTGC ACCTCGAGCT GACCGGCGTG CTGATCCTGG TCTGCGCGGT GGTGGATTTC TCGGCTTACC TCGGCTACCA CGTCGGACTC TTCTATCCGC AGATCGTCGG CCAGCAGTTG GCGTCAAGCC TGGCGGCGAT GGTGCTGTTG CCGTTCACCG TGATGCGGCA CGCCCGGCTG CCGAGGCCGG CGCGCCTGCG GCCGGCGGGG CGGATGCGTT GGCATGCCTG GGTGCGGCTG GCGGTTCCGC TGGCGGTGGC GTTCGTGGCA GCCGCCTGGA TCGAGGACCG CATGCCGGTC CCGGTGGCCT GGATGCGGGA GTGGGCGCTG ATGGGTGGCG TGGGTCCGGG GATCTTCCTG GTCCAGCAGC TGTTCGGCAT CCTCGCCGCG GGGATCGGGC TGGTGATGAT CCGCCGGTCG CGCCGCGCAC GTTTCGCGCC GCCCCTCGCG GTGATCATCC CGGCGCACAA CGAGGCCCAC GACATCACCG CCACGATCGA GGCCGTCGAC CGGGCCGCGG CCCGGTACGC CGAGACGGTC CACATCTATG TCATCGACAA TGCCTCCACC GACGACACCG CGGACGTCGC ACAGACCGCC ATCGCCGCCT GCGCACACTC CACCGGGGAG GTGCACGAAT GCGCGGTCCC CGGGAAGGCG GTGGCGCTCA ACTACGGCCT GTCGGTGATC CGGGAGGAGT TCGTCGTGCG CATCGATGCC GACACCGTGA TCGGCGAGAA CTGCCTCGAC GTCACGCTGC GTCATTTCAC CGATGCGAAG GTCGCCGCCG TCGGCGGGAT GCCGCGGCCG GAACGTATCC GAACCTTCTT CGACCGGGTG CGATTGGTCG AGGTGCTCGT CAAACACGGC TTCTTCCAGG TCGCGATGAT GGGCTACGAC GGGATCATCG GCGAGCCCGG CATGTTCGTG GTCTACCGGC GCCGCGTCGT CGAAGAGGTC GGCGGCATCG TGCAGGGCAT GAACGGTGAG GACACCGACA TCTGCATGAG GATGAGCAGT CAGGGCTACC TGAGCCTGGT CGACCCCACC GCGGTCTACT TCAGCGAGAC CCCGCAGAGC TGGGCGCATC TGCGCGAACA ACGCACCCGC TGGTTTCGCA GCATCTACCA CATCGCCGCC CACAACCGGC ACGCGATCCT GAGCCGGAGT TCGATGGCCG GGGCGGTGAT GCTGCCGTTT CAGCTCGCCA ACTCGGCGCG CCGAGCGATG ATGCTGCCCC TGCTGTTGTT CGGCCTCTTG ATCTTCGGAC TGTTCCGCGA GTCGTTCCCC GGTCTGCACC CCGAGCGGCT CCTCGCGGTG TTCCTCGGGC TGCCGCTGCT GGTGGCACTC GGCGTATGCC TCGTGCGTCA GCCCCGAGCG GTCCTCTACC TCCCCGAGTA CCTCCTATTC CGGATAGTGC GCAGCTATTT CACCCTCGCC GCGGTGCTGA GCCTGGTGTT TCCGCCGCTG CATCCCCGGC AGGCGCTGCG GGAGCGAAGG CGAACGCGTA GGCGACCCCG TCACCGACGC AACCGTGCCA CGCCCGCCGA CCGCAGTTCC AGCGCCGCAA GCCCGGATAT CGCGGCGACG TCCTGA
|
Protein sequence | MNEPVARQHR DSQTPSAPPP QIPQKPKAVV RHYRSIDSAP PAYAVKRPPS AVNGALLIFV ALSSLTLLLG TVQARAWGDH ARDLVFATAD GQAAAIPVRA FLLVMVTSVA WSLDTNFWRR LAVHLELTGV LILVCAVVDF SAYLGYHVGL FYPQIVGQQL ASSLAAMVLL PFTVMRHARL PRPARLRPAG RMRWHAWVRL AVPLAVAFVA AAWIEDRMPV PVAWMREWAL MGGVGPGIFL VQQLFGILAA GIGLVMIRRS RRARFAPPLA VIIPAHNEAH DITATIEAVD RAAARYAETV HIYVIDNAST DDTADVAQTA IAACAHSTGE VHECAVPGKA VALNYGLSVI REEFVVRIDA DTVIGENCLD VTLRHFTDAK VAAVGGMPRP ERIRTFFDRV RLVEVLVKHG FFQVAMMGYD GIIGEPGMFV VYRRRVVEEV GGIVQGMNGE DTDICMRMSS QGYLSLVDPT AVYFSETPQS WAHLREQRTR WFRSIYHIAA HNRHAILSRS SMAGAVMLPF QLANSARRAM MLPLLLFGLL IFGLFRESFP GLHPERLLAV FLGLPLLVAL GVCLVRQPRA VLYLPEYLLF RIVRSYFTLA AVLSLVFPPL HPRQALRERR RTRRRPRHRR NRATPADRSS SAASPDIAAT S
|
| |