Gene EcSMS35_2080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2080 
SymbolmdoH 
ID6142742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2093891 
End bp2096404 
Gene Length2514 bp 
Protein Length837 aa 
Translation table11 
GC content55% 
IMG OID641616956 
Productglucosyltransferase MdoH 
Protein accessionYP_001744132 
Protein GI170684144 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.144186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.981326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATCG CCGCAAGCGA GAAAGCGGCA TTGCCGAAGA CTGATATCCG CGCCGTTCAT 
CAGGCGCTGG ATGCCGAACA CCGCACCTGG GCGCGGGAGG ATGACTCCCC GCAAGGCTCG
GTAAAGGCGC GTCTGGAACA AGCCTGGCCA GATTCACTTG CTGATGGACA GTTAATTAAA
GACGACGAAG GGCGCGATCA GCTTAAGGCG ATGCCAGAAG CAAAACGCTC CTCGATGTTT
CCCGACCCGT GGCGTACCAA CCCGGTAGGC CGTTTCTGGG ATCGCCTGCG TGGACGCGAT
GTAACGCCGC GCTATCTGGC TCGTTTGACC AAAGAAGAGC AGGAAAGCGA GCAAAAGTGG
CGTACCGTCG GTACTATCCG CCGTTACATT TTGTTAATCC TGACGCTCGC GCAAACTGTT
GTCGCGACCT GGTATATGAA GACCATTCTT CCTTATCAGG GGTGGGCGCT GATTAATCCT
ATGGATATGG TTGGTCAGGA TGTGTGGGTT TCCTTTATGC AGCTTCTGCC TTATATGCTG
CAAACCGGTA TCCTGATCCT CTTTGCGGTA CTGTTCTGTT GGGTGTCCGC CGGATTCTGG
ACGGCGTTGA TGGGGTTCCT GCAACTGCTT ATTGGTCGCG ATAAATACAG TATATCAGCG
TCAACAGTTG GCGATGAACC ATTAAACCCG GAGCATCGCA CGGCGTTGAT CATGCCTATC
TGTAACGAAG ACGTGAACCG TGTTTTTGCT GGCCTGCGTG CAACGTGGGA ATCAGTAAAA
GCCACCGGGA ATGCCAAACA CTTTGATGTC TACATTCTGA GTGACAGTTA TAACCCGGAT
ATCTGCGTCG CAGAGCAAAA AGCCTGGATG GAGCTTATCG CTGAAGTTGG TGGCGAAGGT
CAGATTTTCT ATCGCCGCCG CCGTCGCCGC GTGAAGCGTA AAAGCGGTAA TATCGATGAC
TTCTGCCGTC GCTGGGGCAG CCAGTACAGC TACATGGTGG TGCTGGATGC TGACTCGGTA
ATGACCGGTG ATTGTTTATG CGGCCTGGTG CGTTTGATGG AAGCCAACCC GAACGCCGGG
ATCATTCAGT CGTCGCCGAA AGCGTCCGGT ATGGATACGC TGTATGCACG CTGTCAGCAG
TTCGCGACTC GCGTGTATGG GCCACTGTTT ACAGCCGGTT TGCACTTCTG GCAACTTGGC
GAGTCGCACT ACTGGGGACA TAACGCGATT ATCCGCGTGA AACCGTTTAT CGAGCACTGC
GCACTGGCTC CGCTGCCGGG CGAAGGTTCC TTTGCCGGTT CAATCCTGTC ACATGACTTC
GTGGAAGCGG CGTTGATGCG CCGTGCAGGT TGGGGGGTGT GGATTGCTTA CGATCTCCCG
GGTTCTTATG AAGAATTGCC GCCTAACTTG CTTGATGAGC TAAAACGTGA CCGCCGCTGG
TGCCACGGTA ACCTGATGAA CTTCCGTCTG TTCCTGGTTA AGGGTATGCA CCCGGTTCAC
CGTGCGGTGT TCCTGACGGG CGTGATGTCT TATCTCTCCG CTCCGTTGTG GTTTATGTTC
CTCGCGCTCT CTACTGCATT GCAGGTAGTG CATGCGTTGA CCGAACCGCA ATACTTCCTG
CAACCACGGC AGTTATTCCC GGTGTGGCCG CAGTGGCGTC CTGAGCTGGC GATTGCACTT
TTTGCTTCGA CCATGGTGCT GTTGTTCCTG CCGAAGTTAT TGAGCATTTT GCTTATCTGG
TGCAAAGGAA CGAAAGAATA CGGCGGCTTC TGGCGCGTTA CATTATCGTT GCTGCTGGAA
GTGCTGTTTT CCGTGCTGCT GGCTCCGGTA CGCATGCTGT TCCATACGGT CTTCGTCGTC
AGCGCGTTCC TTGGCTGGGA AGTGGTGTGG AATTCACCGC AGCGTGATGA TGACTCCACT
TCCTGGGGTG AAGCGTTCAA ACGCCACGGC TCACAGCTGC TGTTAGGGTT AGTGTGGGCT
GTTGGGATGG CGTGGTTGGA TCTGCGTTTC CTGTTCTGGC TGGCACCGAT TGTCTTCTCG
TTGATCCTGT CACCGTTTGT TTCGGTGATT TCCAGCCGTG CCACCGTTGG TCTGCGCACC
AAACGCTGGA AACTGTTCCT GATCCCGGAA GAGTATTCAC CGCCGCAGGT GCTGGTTGAT
ACCGATCGGT TCCTTGAGAT GAATCGCCAA CGCTCCCTTG ATGATGGTTT TATGCACGCA
GTGTTTAACC CGTCATTCAA CGCTCTGGCA ACCGCAATGG CGACCGCGCG TCACCGCGCC
AGTAAGGTGC TGGAAATCGC CCGTGACCGC CACGTTGAAC AGGCGCTGAA CGAGACGCCA
GAGAAGCTGA ATCGCGATCG TCGCCTGGTG CTGCTAAGCG ATCCGGTGAC GATGGCCCGT
CTGCATTTCC GCGTCTGGAA TTCCCCTGAG AGATATTCTT CATGGGTGAG TTATTACGAA
GGGATAAAGC TCAATCCACT GGCATTGCGT AAACCGGATG CGGCTTCGCA ATAA
 
Protein sequence
MPIAASEKAA LPKTDIRAVH QALDAEHRTW AREDDSPQGS VKARLEQAWP DSLADGQLIK 
DDEGRDQLKA MPEAKRSSMF PDPWRTNPVG RFWDRLRGRD VTPRYLARLT KEEQESEQKW
RTVGTIRRYI LLILTLAQTV VATWYMKTIL PYQGWALINP MDMVGQDVWV SFMQLLPYML
QTGILILFAV LFCWVSAGFW TALMGFLQLL IGRDKYSISA STVGDEPLNP EHRTALIMPI
CNEDVNRVFA GLRATWESVK ATGNAKHFDV YILSDSYNPD ICVAEQKAWM ELIAEVGGEG
QIFYRRRRRR VKRKSGNIDD FCRRWGSQYS YMVVLDADSV MTGDCLCGLV RLMEANPNAG
IIQSSPKASG MDTLYARCQQ FATRVYGPLF TAGLHFWQLG ESHYWGHNAI IRVKPFIEHC
ALAPLPGEGS FAGSILSHDF VEAALMRRAG WGVWIAYDLP GSYEELPPNL LDELKRDRRW
CHGNLMNFRL FLVKGMHPVH RAVFLTGVMS YLSAPLWFMF LALSTALQVV HALTEPQYFL
QPRQLFPVWP QWRPELAIAL FASTMVLLFL PKLLSILLIW CKGTKEYGGF WRVTLSLLLE
VLFSVLLAPV RMLFHTVFVV SAFLGWEVVW NSPQRDDDST SWGEAFKRHG SQLLLGLVWA
VGMAWLDLRF LFWLAPIVFS LILSPFVSVI SSRATVGLRT KRWKLFLIPE EYSPPQVLVD
TDRFLEMNRQ RSLDDGFMHA VFNPSFNALA TAMATARHRA SKVLEIARDR HVEQALNETP
EKLNRDRRLV LLSDPVTMAR LHFRVWNSPE RYSSWVSYYE GIKLNPLALR KPDAASQ