Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1429 |
Symbol | mdoH |
ID | 6969476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1413620 |
End bp | 1416133 |
Gene Length | 2514 bp |
Protein Length | 837 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643385403 |
Product | glucosyltransferase MdoH |
Protein accession | YP_002269897 |
Protein GI | 209399078 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2943] Membrane glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00571263 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.261445 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATCG CCGCAAGCGA GAAAGCGGCA TTGCCGAAGA CTGATATCCG CGCCGTTCAT CAGGCGCTGG ATGCCGAACA CCGCACCTGG GCGCGGGAGG ATGACTCCCC GCAAGGCTCG GTAAAGGCGC GTCTGGAACA AGCCTGGCCA GATTCACTTG CTGATGGACA GTTAATTAAA GATGACGAAG GGCGCGATCA GCTTAAGGCG ATGCCAGAAG CAAAACGCTC CTCGATGTTT CCCGACCCGT GGCGTACCAA CCCGGTAGGC CGTTTCTGGG ATCGCCTGCG TGGACGCGAT GTAACACCGC GCTATCTGGC TCGTTTGACC AAAGAAGAGC AGGAGAGCGA GCAAAAGTGG CGTACCGTCG GTACCATCCG CCGTTACATT CTGTTGATCC TGACGCTCGC GCAAACTGTC GTCGCGACCT GGTATATGAA GACCATTCTT CCTTATCAGG GTTGGGCGCT GATTAATCCT ATGGATATGG TTGGTCAGGA TTTGTGGGTT TCCTTTATGC AGCTTCTGCC TTATATGCTG CAAACCGGTA TCCTGATCCT CTTTGCGGTA CTGTTCTGTT GGGTGTCCGC CGGATTCTGG ACGGCGTTAA TGGGCTTCCT GCAACTACTT ATTGGTCGCG ATAAATACAG TATATCTGCG TCAACAGTTG GCGATGAACC ATTAAACCCG GAGCATCGCA CGGCGTTGAT CATGCCTATC TGTAACGAAG ACGTGAACCG TGTTTTTGCT GGCTTGCGTG CAACGTGGGA ATCAGTAAAA GCCACCGGGA ATGCCAAACA CTTTGATGTC TACATTCTGA GTGACAGTTA CAACCCGGAT ATCTGCGTCG CAGAGCAAAA AGCCTGGATG GAGCTTATCG CTGAAGTCGG TGGCGAAGGT CAGATTTTCT ATCGCCGCCG CCGCCGTCGC GTGAAGCGTA AAAGCGGTAA TATCGATGAC TTCTGCCGTC GCTGGGGCAG CCAGTACAGC TATATGGTGG TGCTGGATGC TGACTCGGTA ATGACCGGTG ATTGTTTGTG CGGCCTGGTG CGCCTGATGG AAGCCAACCC GAACGCCGGG ATCATTCAGT CGTCGCCGAA AGCGTCCGGT ATGGATACGC TGTATGCACG CTGTCAGCAG TTCGCGACCC GCGTGTATGG GCCACTGTTT ACAGCCGGTT TGCACTTCTG GCAACTTGGC GAGTCGCACT ACTGGGGGCA TAACGCGATT ATCCGCGTGA AACCGTTTAT CGAGCACTGT GCACTGGCCC CGCTGCCGGG CGAAGGTTCC TTTGCCGGTT CAATCCTGTC ACATGACTTC GTGGAAGCGG CGTTGATGCG CCGTGCAGGT TGGGGAGTCT GGATTGCTTA CGATCTCCCG GGTTCTTATG AAGAATTACC GCCTAACTTG CTTGATGAGC TAAAACGTGA CCGCCGCTGG TGCCACGGTA ACCTGATGAA CTTCCGTCTG TTCCTGGTGA AGGGTATGCA CCCGGTTCAC CGTGCGGTAT TCCTGACGGG CGTGATGTCT TATCTCTCCG CTCCGCTGTG GTTTATGTTC CTCGCGCTCT CTACTGCATT GCAGGTAGTG CATGCGTTGA CCGAACCGCA ATACTTCCTG CAACCACGGC AGTTGTTCCC GGTGTGGCCG CAGTGGCGTC CTGAGCTGGC GATTGCACTT TTTGCTTCGA CCATGGTGCT GTTGTTCCTG CCAAAGCTAT TGAGCATTTT GCTTATTTGG TGTAAAGGAA CGAAAGAATA TGGCGGCTTC TGGCGCGTTA CATTATCGTT GCTGCTGGAA GTGCTGTTTT CCGTGCTGCT GGCTCCGGTA CGCATGCTGT TCCATACGGT CTTCGTCGTC AGCGCGTTCC TTGGCTGGGA AGTGGTGTGG AATTCACCGC AGCGTGATGA TGACTCCACT TCCTGGGGTG AAGCGTTCAA ACGCCACGGC TCACAGCTGC TGTTAGGGTT AGTGTGGGCT GTCGGGATGG CGTGGTTGGA TCTGCGTTTC CTGTTCTGGC TGGCACCGAT TGTCTTCTCG TTGATCCTGT CACCGTTTGT TTCGGTGATT TCCAGCCGTG CCACCGTTGG TCTGCGCACC AAACGCTGGA AACTGTTCCT GATCCCGGAA GAGTATTCAC CGCCGCAGGT ACTGGTTGAT ACCGATCGGT TCCTTGAGAT GAACCGTCAA CGCTCCCTTG ATGATGGTTT TATGCACGCA GTGTTTAACC CGTCATTTAA CGCTCTGGCA ACCGCAATGG CGACCGCGCG TCACCGCGCC AGTAAGGTGC TGGAAATCGC CCGTGACCGC CATGTTGAAC AGGCGCTGAA CGAGACGCCA GAGAAGCTGA ATCGCGATCG TCGCCTTGTG CTGCTAAGCG ATCCGGTGAC GATGGCCCGT CTGCATTTCC GCGTCTGGAA TTCCCCTGAG AGATATTCTT CATGGGTGAG TTATTACGAA GGGATAAAGC TCAATCCACT GGCATTGCGT AAACCGGATG CGGCTTCGCA ATAA
|
Protein sequence | MPIAASEKAA LPKTDIRAVH QALDAEHRTW AREDDSPQGS VKARLEQAWP DSLADGQLIK DDEGRDQLKA MPEAKRSSMF PDPWRTNPVG RFWDRLRGRD VTPRYLARLT KEEQESEQKW RTVGTIRRYI LLILTLAQTV VATWYMKTIL PYQGWALINP MDMVGQDLWV SFMQLLPYML QTGILILFAV LFCWVSAGFW TALMGFLQLL IGRDKYSISA STVGDEPLNP EHRTALIMPI CNEDVNRVFA GLRATWESVK ATGNAKHFDV YILSDSYNPD ICVAEQKAWM ELIAEVGGEG QIFYRRRRRR VKRKSGNIDD FCRRWGSQYS YMVVLDADSV MTGDCLCGLV RLMEANPNAG IIQSSPKASG MDTLYARCQQ FATRVYGPLF TAGLHFWQLG ESHYWGHNAI IRVKPFIEHC ALAPLPGEGS FAGSILSHDF VEAALMRRAG WGVWIAYDLP GSYEELPPNL LDELKRDRRW CHGNLMNFRL FLVKGMHPVH RAVFLTGVMS YLSAPLWFMF LALSTALQVV HALTEPQYFL QPRQLFPVWP QWRPELAIAL FASTMVLLFL PKLLSILLIW CKGTKEYGGF WRVTLSLLLE VLFSVLLAPV RMLFHTVFVV SAFLGWEVVW NSPQRDDDST SWGEAFKRHG SQLLLGLVWA VGMAWLDLRF LFWLAPIVFS LILSPFVSVI SSRATVGLRT KRWKLFLIPE EYSPPQVLVD TDRFLEMNRQ RSLDDGFMHA VFNPSFNALA TAMATARHRA SKVLEIARDR HVEQALNETP EKLNRDRRLV LLSDPVTMAR LHFRVWNSPE RYSSWVSYYE GIKLNPLALR KPDAASQ
|
| |