Gene Mkms_5107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5107 
Symbol 
ID4612790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5357358 
End bp5359283 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content67% 
IMG OID639794804 
Productglycosyltransferases-like protein 
Protein accessionYP_941086 
Protein GI119871134 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.351774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.369198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGA TCCCGACCGG TGCGCCGGCC GCCGGTGATT CACGCGCAGT GAGCCTGCTC 
GCGCGTGTGA TCCTGCCGCG GCCCGGCGAA CCCCTCGACG TCCGCAAGCT CTATATAGAG
GAATCCGAGA CCAACGCCAG GCGCGCCCAC GCCCGCACCC GCACCACCCT CGAAATCGGT
GCCGAGTCTG AGGTGTCGTT CGCCACGTAC TTCAACGCGT TCCCGGCGAG CTACTGGCGG
CGCTGGTCGA CGCTGACGTC GGTGGTGCTG CAGGTCGAGT TGACCGGCAG CGGCCGGGTG
GACCTCTACC GCACCAAGGC CACCGGCGCC CGGATCTTCG TGGAGGGCAA GGCGTTCGCG
AGCGAGGCCG ACCGGCCCAC CGTCGTCGAG GTGGAGATCG GCCTGCAGCC GTTCGAGGAC
GGCGGCTGGA TCTGGTTCGA CATCACCACC GACAGTGCCG TCACCCTGCA CAGCGCCGGC
TGGTATGCGC CGGTGCCCGC GCCGGGCGTG GCCAACATCG CGGTCGGCAT CCCGACCTTC
AACCGCCCCG ACGACTGCGT GAACTCGCTG CGGGCGCTCA CCTCGGATCC GTTGGTCGAC
GAGGTGATCA GCGCGGTCAT CGTGCCGGAC CAGGGGAACC GCAAGGTGCG CGACCACCCC
GAGTTCGCCG AGGCCACCGC GCCGCTGGGC AACCGGCTGT CCATTCACGA CCAGCCCAAC
CTCGGCGGGT CCGGCGGTTA CAGCCGGGTG ATGTACGAGG CGCTGAAGAA CACCGACTGC
GAACAGATCC TGTTCATGGA CGACGACATC CGCGTCGAAC CGGATTCGGT CCTGCGGGCG
CTGGCGTTGA ACCGCTTCGC CAAATCGCCG ATGCTGGTCG GCGGGCAGAT GCTCAACCTG
CAGGAGCCGT CACACCTGCA CATCATGGGT GAGGTCGTCG ACCGGGACAA CTTCATGTGG
ACGTCGGCGC CCTACACCGA ATACGACCAC GACTTCGCGA AGTACCCGCT GCACGCCAAC
AACGAACGCA GCCAGCTGCT GCACCGGCGT ATCGACGTCG ACTACAACGG CTGGTGGATG
TGCATGATCC CGCGGCAGGT CGCCGAGGAA CTGGGCCAAC CGCTGCCGCT GTTCATCAAA
TGGGACGACG CCGAATACGG GCTGCGCGCC GCCGAACAGG GCTACCCGAC CGCGACCATG
CCCGGTACCG CGATCTGGCA CATGGCGTGG AGCGACAAGG ACGACGCGAT CGACTGGCAG
GCGTACTTCC ACCTCCGCAA CCGGCTGGTG GTGGCGGCGC TGCACTGGGA CGGCGACATC
GGCGGGCTGG TCCGCAGCCA CTTCAAGGCC ACGCTGAAAC ACCTTGCCTG CCTTGAGTAC
TCGACCGTCG AGATCCAGAA CAAGGCGATG GACGACTTCC TCGCCGGCCC GGAACACATC
TTCTCGATCC TCGAGAGCGC GCTGCCCGAG GTGCACCGCA TCCGCAAGCA GTATCCGGAC
GCCGTGGTGA TGCCCGCGGC GAGTGAGCTG CCGCCGCCGT CGGAGAAGCT GCAGAAGATC
GACCCGCCGG TGGCCAAACC GGTGATCGCC TACCACCTGA TGCGCGGCAT CGTGCACAAC
ATCAAGAAGC CCGATCCGCA GCACCACGAA CGCCCACAGC TCAACGTGCC GACGCAGAAC
TCGCGATGGT TCCTGCTGTC CAAGTACGAC GGTGTCACGG TGACCACCGC CGACGGCCGC
GGTGTGGTCT ACCGCAAGCG GGACCGCGCG AAGATGATCG CCCTGCTCGG TCAGTCGGTG
CGCAGGCAGC GGCGGCTGGC TCGGCGGTTC GACCACATGC GGCGCGTCTA CCGCGAGGCG
CTGCCGGTGC TGGCCAGCAA GCAGAAGTGG GAGACGGTGC TGCTTCCCCC GCAAGAAGTG
CGATGA
 
Protein sequence
MSEIPTGAPA AGDSRAVSLL ARVILPRPGE PLDVRKLYIE ESETNARRAH ARTRTTLEIG 
AESEVSFATY FNAFPASYWR RWSTLTSVVL QVELTGSGRV DLYRTKATGA RIFVEGKAFA
SEADRPTVVE VEIGLQPFED GGWIWFDITT DSAVTLHSAG WYAPVPAPGV ANIAVGIPTF
NRPDDCVNSL RALTSDPLVD EVISAVIVPD QGNRKVRDHP EFAEATAPLG NRLSIHDQPN
LGGSGGYSRV MYEALKNTDC EQILFMDDDI RVEPDSVLRA LALNRFAKSP MLVGGQMLNL
QEPSHLHIMG EVVDRDNFMW TSAPYTEYDH DFAKYPLHAN NERSQLLHRR IDVDYNGWWM
CMIPRQVAEE LGQPLPLFIK WDDAEYGLRA AEQGYPTATM PGTAIWHMAW SDKDDAIDWQ
AYFHLRNRLV VAALHWDGDI GGLVRSHFKA TLKHLACLEY STVEIQNKAM DDFLAGPEHI
FSILESALPE VHRIRKQYPD AVVMPAASEL PPPSEKLQKI DPPVAKPVIA YHLMRGIVHN
IKKPDPQHHE RPQLNVPTQN SRWFLLSKYD GVTVTTADGR GVVYRKRDRA KMIALLGQSV
RRQRRLARRF DHMRRVYREA LPVLASKQKW ETVLLPPQEV R