Gene ECH74115_1429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1429 
SymbolmdoH 
ID6969476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1413620 
End bp1416133 
Gene Length2514 bp 
Protein Length837 aa 
Translation table11 
GC content55% 
IMG OID643385403 
Productglucosyltransferase MdoH 
Protein accessionYP_002269897 
Protein GI209399078 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00571263 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.261445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATCG CCGCAAGCGA GAAAGCGGCA TTGCCGAAGA CTGATATCCG CGCCGTTCAT 
CAGGCGCTGG ATGCCGAACA CCGCACCTGG GCGCGGGAGG ATGACTCCCC GCAAGGCTCG
GTAAAGGCGC GTCTGGAACA AGCCTGGCCA GATTCACTTG CTGATGGACA GTTAATTAAA
GATGACGAAG GGCGCGATCA GCTTAAGGCG ATGCCAGAAG CAAAACGCTC CTCGATGTTT
CCCGACCCGT GGCGTACCAA CCCGGTAGGC CGTTTCTGGG ATCGCCTGCG TGGACGCGAT
GTAACACCGC GCTATCTGGC TCGTTTGACC AAAGAAGAGC AGGAGAGCGA GCAAAAGTGG
CGTACCGTCG GTACCATCCG CCGTTACATT CTGTTGATCC TGACGCTCGC GCAAACTGTC
GTCGCGACCT GGTATATGAA GACCATTCTT CCTTATCAGG GTTGGGCGCT GATTAATCCT
ATGGATATGG TTGGTCAGGA TTTGTGGGTT TCCTTTATGC AGCTTCTGCC TTATATGCTG
CAAACCGGTA TCCTGATCCT CTTTGCGGTA CTGTTCTGTT GGGTGTCCGC CGGATTCTGG
ACGGCGTTAA TGGGCTTCCT GCAACTACTT ATTGGTCGCG ATAAATACAG TATATCTGCG
TCAACAGTTG GCGATGAACC ATTAAACCCG GAGCATCGCA CGGCGTTGAT CATGCCTATC
TGTAACGAAG ACGTGAACCG TGTTTTTGCT GGCTTGCGTG CAACGTGGGA ATCAGTAAAA
GCCACCGGGA ATGCCAAACA CTTTGATGTC TACATTCTGA GTGACAGTTA CAACCCGGAT
ATCTGCGTCG CAGAGCAAAA AGCCTGGATG GAGCTTATCG CTGAAGTCGG TGGCGAAGGT
CAGATTTTCT ATCGCCGCCG CCGCCGTCGC GTGAAGCGTA AAAGCGGTAA TATCGATGAC
TTCTGCCGTC GCTGGGGCAG CCAGTACAGC TATATGGTGG TGCTGGATGC TGACTCGGTA
ATGACCGGTG ATTGTTTGTG CGGCCTGGTG CGCCTGATGG AAGCCAACCC GAACGCCGGG
ATCATTCAGT CGTCGCCGAA AGCGTCCGGT ATGGATACGC TGTATGCACG CTGTCAGCAG
TTCGCGACCC GCGTGTATGG GCCACTGTTT ACAGCCGGTT TGCACTTCTG GCAACTTGGC
GAGTCGCACT ACTGGGGGCA TAACGCGATT ATCCGCGTGA AACCGTTTAT CGAGCACTGT
GCACTGGCCC CGCTGCCGGG CGAAGGTTCC TTTGCCGGTT CAATCCTGTC ACATGACTTC
GTGGAAGCGG CGTTGATGCG CCGTGCAGGT TGGGGAGTCT GGATTGCTTA CGATCTCCCG
GGTTCTTATG AAGAATTACC GCCTAACTTG CTTGATGAGC TAAAACGTGA CCGCCGCTGG
TGCCACGGTA ACCTGATGAA CTTCCGTCTG TTCCTGGTGA AGGGTATGCA CCCGGTTCAC
CGTGCGGTAT TCCTGACGGG CGTGATGTCT TATCTCTCCG CTCCGCTGTG GTTTATGTTC
CTCGCGCTCT CTACTGCATT GCAGGTAGTG CATGCGTTGA CCGAACCGCA ATACTTCCTG
CAACCACGGC AGTTGTTCCC GGTGTGGCCG CAGTGGCGTC CTGAGCTGGC GATTGCACTT
TTTGCTTCGA CCATGGTGCT GTTGTTCCTG CCAAAGCTAT TGAGCATTTT GCTTATTTGG
TGTAAAGGAA CGAAAGAATA TGGCGGCTTC TGGCGCGTTA CATTATCGTT GCTGCTGGAA
GTGCTGTTTT CCGTGCTGCT GGCTCCGGTA CGCATGCTGT TCCATACGGT CTTCGTCGTC
AGCGCGTTCC TTGGCTGGGA AGTGGTGTGG AATTCACCGC AGCGTGATGA TGACTCCACT
TCCTGGGGTG AAGCGTTCAA ACGCCACGGC TCACAGCTGC TGTTAGGGTT AGTGTGGGCT
GTCGGGATGG CGTGGTTGGA TCTGCGTTTC CTGTTCTGGC TGGCACCGAT TGTCTTCTCG
TTGATCCTGT CACCGTTTGT TTCGGTGATT TCCAGCCGTG CCACCGTTGG TCTGCGCACC
AAACGCTGGA AACTGTTCCT GATCCCGGAA GAGTATTCAC CGCCGCAGGT ACTGGTTGAT
ACCGATCGGT TCCTTGAGAT GAACCGTCAA CGCTCCCTTG ATGATGGTTT TATGCACGCA
GTGTTTAACC CGTCATTTAA CGCTCTGGCA ACCGCAATGG CGACCGCGCG TCACCGCGCC
AGTAAGGTGC TGGAAATCGC CCGTGACCGC CATGTTGAAC AGGCGCTGAA CGAGACGCCA
GAGAAGCTGA ATCGCGATCG TCGCCTTGTG CTGCTAAGCG ATCCGGTGAC GATGGCCCGT
CTGCATTTCC GCGTCTGGAA TTCCCCTGAG AGATATTCTT CATGGGTGAG TTATTACGAA
GGGATAAAGC TCAATCCACT GGCATTGCGT AAACCGGATG CGGCTTCGCA ATAA
 
Protein sequence
MPIAASEKAA LPKTDIRAVH QALDAEHRTW AREDDSPQGS VKARLEQAWP DSLADGQLIK 
DDEGRDQLKA MPEAKRSSMF PDPWRTNPVG RFWDRLRGRD VTPRYLARLT KEEQESEQKW
RTVGTIRRYI LLILTLAQTV VATWYMKTIL PYQGWALINP MDMVGQDLWV SFMQLLPYML
QTGILILFAV LFCWVSAGFW TALMGFLQLL IGRDKYSISA STVGDEPLNP EHRTALIMPI
CNEDVNRVFA GLRATWESVK ATGNAKHFDV YILSDSYNPD ICVAEQKAWM ELIAEVGGEG
QIFYRRRRRR VKRKSGNIDD FCRRWGSQYS YMVVLDADSV MTGDCLCGLV RLMEANPNAG
IIQSSPKASG MDTLYARCQQ FATRVYGPLF TAGLHFWQLG ESHYWGHNAI IRVKPFIEHC
ALAPLPGEGS FAGSILSHDF VEAALMRRAG WGVWIAYDLP GSYEELPPNL LDELKRDRRW
CHGNLMNFRL FLVKGMHPVH RAVFLTGVMS YLSAPLWFMF LALSTALQVV HALTEPQYFL
QPRQLFPVWP QWRPELAIAL FASTMVLLFL PKLLSILLIW CKGTKEYGGF WRVTLSLLLE
VLFSVLLAPV RMLFHTVFVV SAFLGWEVVW NSPQRDDDST SWGEAFKRHG SQLLLGLVWA
VGMAWLDLRF LFWLAPIVFS LILSPFVSVI SSRATVGLRT KRWKLFLIPE EYSPPQVLVD
TDRFLEMNRQ RSLDDGFMHA VFNPSFNALA TAMATARHRA SKVLEIARDR HVEQALNETP
EKLNRDRRLV LLSDPVTMAR LHFRVWNSPE RYSSWVSYYE GIKLNPLALR KPDAASQ