Gene EcE24377A_1170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1170 
SymbolmdoH 
ID5589701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1188327 
End bp1190870 
Gene Length2544 bp 
Protein Length847 aa 
Translation table11 
GC content55% 
IMG OID640924870 
Productglucosyltransferase MdoH 
Protein accessionYP_001462282 
Protein GI157159340 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.19912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA CAACTGAGTA CATTGACGCA ATGCCCATCG CCGCAAGCGA GAAAGCGGCA 
TTGCCGAAGA CTGATATCCG CGCCGTTCAT CAGGCGCTGG ATGCCGAACA CCGCACCTGG
GAGCGGGAGG ATGATTCCCC GCAAGGCTCG GTAAAGGCGC GTCTGGAACA AGCCTGGCCA
GACTCACTTG CTGATGGACA GTTAATTAAA GACGACGAAG GGCGCGATCA GCTGAAGGCG
ATGCCAGAAG CAAAACGCTC CTCGATGTTT CCCGACCCGT GGCGTACCAA CCCGGTAGGC
CGTTTCTGGG ATCGCCTGCG TGGACGCGAT GTCACGCCGC GCTATCTGGC TCGTTTGACC
AAAGAAGAGC AGGAGAGCGA GCAAAAGTGG CGTACCGTCG GTACCATCCG CCGTTACATT
CTGTTGATCC TGACGCTCGC GCAAACTGTC GTCGCGACCT GGTATATGAA GACCATTCTT
CCTTATCAGG GTTGGGCGCT GATTAATCCG ATGGATATGG TTGGTCAGGA TTTGTGGGTT
TCCTTTATGC AGCTTCTGCC TTATATGCTG CAAACCGGTA TCCTGATCCT CTTTGCGGTA
CTGTTCTGTT GGGTGTCCGC CGGATTCTGG ACGGCGTTGA TGGGCTTCCT GCAACTGCTT
ATTGGTCGCG ATAAATACAG TATATCTGCG TCGACTGTTG GCGATGAGCC ATTAAACCCG
GAGCATCGCA CGGCGTTGAT CATGCCTATC TGTAATGAAG ACGTGAACCG TGTTTTTGCT
GGCCTGCGTG CAACGTGGGA ATCAGTAAAA GCCACCGGGA ATGCCAAACA CTTTGATGTC
TACATTCTGA GTGACAGTTA TAACCCGGAT ATCTGCGTCG CAGAGCAAAA AGCCTGGATG
GAGCTTATCG CTGAAGTCGG TGGCGAAGGT CAGATTTTCT ATCGCCGCCG CCGTCGCCGC
GTGAAGCGTA AAAGCGGTAA TATCGATGAC TTCTGCCGTC GCTGGGGCAG CCAGTACAGC
TACATGGTGG TGCTGGATGC TGACTCGGTA ATGACCGGTG ATTGTTTGTG CGGCCTGGTG
CGCCTGATGG AAGCCAACCC GAACGCCGGG ATCATTCAGT CGTCGCCGAA AGCGTCCGGT
ATGGATACGC TGTATGCGCG CTGTCAGCAG TTCGCGACCC GCGTGTATGG GCCACTGTTT
ACAGCCGGTT TGCACTTCTG GCAACTTGGC GAGTCGCACT ACTGGGGACA TAACGCGATT
ATCCGCGTGA AACCGTTTAT CGAGCACTGC GCACTGGCTC CGCTGCCGGG CGAAGGTTCC
TTTGCCGGTT CAATCCTGTC ACATGACTTC GTGGAAGCGG CGTTGATGCG CCGTGCAGGT
TGGGGGGTCT GGATTGCTTA CGATCTCCCG GGTTCTTATG AAGAATTGCC GCCTAACTTG
CTTGATGAGC TAAAACGTGA CCGCCGCTGG TGCCACGGTA ACCTGATGAA CTTCCGTCTG
TTCCTGGTGA AGGGTATGCA CCCGGTTCAC CGTGCGGTGT TCCTGACGGG CGTGATGTCT
TATCTCTCCG CTCCGCTGTG GTTTATGTTC CTCGCGCTCT CTACTGCATT GCAGGTAGTC
CATGCGTTGA CCGAACCGCA ATACTTCCTG CAACCACGGC AGTTGTTCCC GGTATGGCCG
CAGTGGCGTC CTGAGCTGGC GATTGCACTT TTTGCTTCGA CCATGGTGCT GTTGTTCCTG
CCGAAGTTAT TGAGCATTTT GCTTATCTGG TGCAAAGGAA CGAAAGAATA CGGCGGCTTC
TGGCGCGTTA CATTATCGTT GCTGCTGGAA GTGCTGTTTT CCGTGCTGCT GGCTCCGGTA
CGCATGCTGT TCCATACGGT CTTCGTCGTC AGCGCGTTCC TTGGCTGGGA AGTTGTGTGG
AATTCACCGC AGCGTGATGA TGACTCCACT TCCTGGGGTG AAGCGTTCAA ACGCCACGGC
TCACAGCTGC TGTTAGGGTT AGTGTGGGCT GTTGGTATGG CGTGGTTGGA TCTGCGTTTC
CTGTTCTGGC TGGCACCGAT TGTCTTCTCG TTGATCCTGT CACCGTTTGT TTCGGTGATT
TCCAGCCGTG CCACCGTTGG TCTGCGAACC AAACGCTGGA AACTGTTCCT GATCCCGGAA
GAGTATTCAC CGCCGCAGGT GCTGGTTGAT ACCGATCGCT TCCTTGAGAT GAATCGTCAA
CGCTCCCTTG ATGATGGTTT TATGCACGCG GTGTTTAACC CGTCATTTAA CGCTCTGGCA
ACCGCAATGG CGACCGCGCG TCACCGCGCC AGTAAGGTGC TGGAAATCGC CCGTGACCGC
CACGTTGAAC AGGCGCTTAA CGAGACGCCA GAGAAGCTGA ATCGCGATCG TCGCCTGGTG
CTGCTAAGCG ATCCGGTGAC GATGGCCCGT CTGCATTTCC GCGTCTGGAA TTCCCCGGAG
AGATATTCTT CATGGGTGAG TTATTACGAA GGGATAAAGC TCAATCCACT GGCATTGCGT
AAACCGGATG CGGCTTCGCA ATAA
 
Protein sequence
MNKTTEYIDA MPIAASEKAA LPKTDIRAVH QALDAEHRTW EREDDSPQGS VKARLEQAWP 
DSLADGQLIK DDEGRDQLKA MPEAKRSSMF PDPWRTNPVG RFWDRLRGRD VTPRYLARLT
KEEQESEQKW RTVGTIRRYI LLILTLAQTV VATWYMKTIL PYQGWALINP MDMVGQDLWV
SFMQLLPYML QTGILILFAV LFCWVSAGFW TALMGFLQLL IGRDKYSISA STVGDEPLNP
EHRTALIMPI CNEDVNRVFA GLRATWESVK ATGNAKHFDV YILSDSYNPD ICVAEQKAWM
ELIAEVGGEG QIFYRRRRRR VKRKSGNIDD FCRRWGSQYS YMVVLDADSV MTGDCLCGLV
RLMEANPNAG IIQSSPKASG MDTLYARCQQ FATRVYGPLF TAGLHFWQLG ESHYWGHNAI
IRVKPFIEHC ALAPLPGEGS FAGSILSHDF VEAALMRRAG WGVWIAYDLP GSYEELPPNL
LDELKRDRRW CHGNLMNFRL FLVKGMHPVH RAVFLTGVMS YLSAPLWFMF LALSTALQVV
HALTEPQYFL QPRQLFPVWP QWRPELAIAL FASTMVLLFL PKLLSILLIW CKGTKEYGGF
WRVTLSLLLE VLFSVLLAPV RMLFHTVFVV SAFLGWEVVW NSPQRDDDST SWGEAFKRHG
SQLLLGLVWA VGMAWLDLRF LFWLAPIVFS LILSPFVSVI SSRATVGLRT KRWKLFLIPE
EYSPPQVLVD TDRFLEMNRQ RSLDDGFMHA VFNPSFNALA TAMATARHRA SKVLEIARDR
HVEQALNETP EKLNRDRRLV LLSDPVTMAR LHFRVWNSPE RYSSWVSYYE GIKLNPLALR
KPDAASQ