Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_5107 |
Symbol | |
ID | 4612790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 5357358 |
End bp | 5359283 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639794804 |
Product | glycosyltransferases-like protein |
Protein accession | YP_941086 |
Protein GI | 119871134 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.351774 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.369198 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGA TCCCGACCGG TGCGCCGGCC GCCGGTGATT CACGCGCAGT GAGCCTGCTC GCGCGTGTGA TCCTGCCGCG GCCCGGCGAA CCCCTCGACG TCCGCAAGCT CTATATAGAG GAATCCGAGA CCAACGCCAG GCGCGCCCAC GCCCGCACCC GCACCACCCT CGAAATCGGT GCCGAGTCTG AGGTGTCGTT CGCCACGTAC TTCAACGCGT TCCCGGCGAG CTACTGGCGG CGCTGGTCGA CGCTGACGTC GGTGGTGCTG CAGGTCGAGT TGACCGGCAG CGGCCGGGTG GACCTCTACC GCACCAAGGC CACCGGCGCC CGGATCTTCG TGGAGGGCAA GGCGTTCGCG AGCGAGGCCG ACCGGCCCAC CGTCGTCGAG GTGGAGATCG GCCTGCAGCC GTTCGAGGAC GGCGGCTGGA TCTGGTTCGA CATCACCACC GACAGTGCCG TCACCCTGCA CAGCGCCGGC TGGTATGCGC CGGTGCCCGC GCCGGGCGTG GCCAACATCG CGGTCGGCAT CCCGACCTTC AACCGCCCCG ACGACTGCGT GAACTCGCTG CGGGCGCTCA CCTCGGATCC GTTGGTCGAC GAGGTGATCA GCGCGGTCAT CGTGCCGGAC CAGGGGAACC GCAAGGTGCG CGACCACCCC GAGTTCGCCG AGGCCACCGC GCCGCTGGGC AACCGGCTGT CCATTCACGA CCAGCCCAAC CTCGGCGGGT CCGGCGGTTA CAGCCGGGTG ATGTACGAGG CGCTGAAGAA CACCGACTGC GAACAGATCC TGTTCATGGA CGACGACATC CGCGTCGAAC CGGATTCGGT CCTGCGGGCG CTGGCGTTGA ACCGCTTCGC CAAATCGCCG ATGCTGGTCG GCGGGCAGAT GCTCAACCTG CAGGAGCCGT CACACCTGCA CATCATGGGT GAGGTCGTCG ACCGGGACAA CTTCATGTGG ACGTCGGCGC CCTACACCGA ATACGACCAC GACTTCGCGA AGTACCCGCT GCACGCCAAC AACGAACGCA GCCAGCTGCT GCACCGGCGT ATCGACGTCG ACTACAACGG CTGGTGGATG TGCATGATCC CGCGGCAGGT CGCCGAGGAA CTGGGCCAAC CGCTGCCGCT GTTCATCAAA TGGGACGACG CCGAATACGG GCTGCGCGCC GCCGAACAGG GCTACCCGAC CGCGACCATG CCCGGTACCG CGATCTGGCA CATGGCGTGG AGCGACAAGG ACGACGCGAT CGACTGGCAG GCGTACTTCC ACCTCCGCAA CCGGCTGGTG GTGGCGGCGC TGCACTGGGA CGGCGACATC GGCGGGCTGG TCCGCAGCCA CTTCAAGGCC ACGCTGAAAC ACCTTGCCTG CCTTGAGTAC TCGACCGTCG AGATCCAGAA CAAGGCGATG GACGACTTCC TCGCCGGCCC GGAACACATC TTCTCGATCC TCGAGAGCGC GCTGCCCGAG GTGCACCGCA TCCGCAAGCA GTATCCGGAC GCCGTGGTGA TGCCCGCGGC GAGTGAGCTG CCGCCGCCGT CGGAGAAGCT GCAGAAGATC GACCCGCCGG TGGCCAAACC GGTGATCGCC TACCACCTGA TGCGCGGCAT CGTGCACAAC ATCAAGAAGC CCGATCCGCA GCACCACGAA CGCCCACAGC TCAACGTGCC GACGCAGAAC TCGCGATGGT TCCTGCTGTC CAAGTACGAC GGTGTCACGG TGACCACCGC CGACGGCCGC GGTGTGGTCT ACCGCAAGCG GGACCGCGCG AAGATGATCG CCCTGCTCGG TCAGTCGGTG CGCAGGCAGC GGCGGCTGGC TCGGCGGTTC GACCACATGC GGCGCGTCTA CCGCGAGGCG CTGCCGGTGC TGGCCAGCAA GCAGAAGTGG GAGACGGTGC TGCTTCCCCC GCAAGAAGTG CGATGA
|
Protein sequence | MSEIPTGAPA AGDSRAVSLL ARVILPRPGE PLDVRKLYIE ESETNARRAH ARTRTTLEIG AESEVSFATY FNAFPASYWR RWSTLTSVVL QVELTGSGRV DLYRTKATGA RIFVEGKAFA SEADRPTVVE VEIGLQPFED GGWIWFDITT DSAVTLHSAG WYAPVPAPGV ANIAVGIPTF NRPDDCVNSL RALTSDPLVD EVISAVIVPD QGNRKVRDHP EFAEATAPLG NRLSIHDQPN LGGSGGYSRV MYEALKNTDC EQILFMDDDI RVEPDSVLRA LALNRFAKSP MLVGGQMLNL QEPSHLHIMG EVVDRDNFMW TSAPYTEYDH DFAKYPLHAN NERSQLLHRR IDVDYNGWWM CMIPRQVAEE LGQPLPLFIK WDDAEYGLRA AEQGYPTATM PGTAIWHMAW SDKDDAIDWQ AYFHLRNRLV VAALHWDGDI GGLVRSHFKA TLKHLACLEY STVEIQNKAM DDFLAGPEHI FSILESALPE VHRIRKQYPD AVVMPAASEL PPPSEKLQKI DPPVAKPVIA YHLMRGIVHN IKKPDPQHHE RPQLNVPTQN SRWFLLSKYD GVTVTTADGR GVVYRKRDRA KMIALLGQSV RRQRRLARRF DHMRRVYREA LPVLASKQKW ETVLLPPQEV R
|
| |