Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2080 |
Symbol | mdoH |
ID | 6142742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2093891 |
End bp | 2096404 |
Gene Length | 2514 bp |
Protein Length | 837 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616956 |
Product | glucosyltransferase MdoH |
Protein accession | YP_001744132 |
Protein GI | 170684144 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2943] Membrane glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.144186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.981326 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATCG CCGCAAGCGA GAAAGCGGCA TTGCCGAAGA CTGATATCCG CGCCGTTCAT CAGGCGCTGG ATGCCGAACA CCGCACCTGG GCGCGGGAGG ATGACTCCCC GCAAGGCTCG GTAAAGGCGC GTCTGGAACA AGCCTGGCCA GATTCACTTG CTGATGGACA GTTAATTAAA GACGACGAAG GGCGCGATCA GCTTAAGGCG ATGCCAGAAG CAAAACGCTC CTCGATGTTT CCCGACCCGT GGCGTACCAA CCCGGTAGGC CGTTTCTGGG ATCGCCTGCG TGGACGCGAT GTAACGCCGC GCTATCTGGC TCGTTTGACC AAAGAAGAGC AGGAAAGCGA GCAAAAGTGG CGTACCGTCG GTACTATCCG CCGTTACATT TTGTTAATCC TGACGCTCGC GCAAACTGTT GTCGCGACCT GGTATATGAA GACCATTCTT CCTTATCAGG GGTGGGCGCT GATTAATCCT ATGGATATGG TTGGTCAGGA TGTGTGGGTT TCCTTTATGC AGCTTCTGCC TTATATGCTG CAAACCGGTA TCCTGATCCT CTTTGCGGTA CTGTTCTGTT GGGTGTCCGC CGGATTCTGG ACGGCGTTGA TGGGGTTCCT GCAACTGCTT ATTGGTCGCG ATAAATACAG TATATCAGCG TCAACAGTTG GCGATGAACC ATTAAACCCG GAGCATCGCA CGGCGTTGAT CATGCCTATC TGTAACGAAG ACGTGAACCG TGTTTTTGCT GGCCTGCGTG CAACGTGGGA ATCAGTAAAA GCCACCGGGA ATGCCAAACA CTTTGATGTC TACATTCTGA GTGACAGTTA TAACCCGGAT ATCTGCGTCG CAGAGCAAAA AGCCTGGATG GAGCTTATCG CTGAAGTTGG TGGCGAAGGT CAGATTTTCT ATCGCCGCCG CCGTCGCCGC GTGAAGCGTA AAAGCGGTAA TATCGATGAC TTCTGCCGTC GCTGGGGCAG CCAGTACAGC TACATGGTGG TGCTGGATGC TGACTCGGTA ATGACCGGTG ATTGTTTATG CGGCCTGGTG CGTTTGATGG AAGCCAACCC GAACGCCGGG ATCATTCAGT CGTCGCCGAA AGCGTCCGGT ATGGATACGC TGTATGCACG CTGTCAGCAG TTCGCGACTC GCGTGTATGG GCCACTGTTT ACAGCCGGTT TGCACTTCTG GCAACTTGGC GAGTCGCACT ACTGGGGACA TAACGCGATT ATCCGCGTGA AACCGTTTAT CGAGCACTGC GCACTGGCTC CGCTGCCGGG CGAAGGTTCC TTTGCCGGTT CAATCCTGTC ACATGACTTC GTGGAAGCGG CGTTGATGCG CCGTGCAGGT TGGGGGGTGT GGATTGCTTA CGATCTCCCG GGTTCTTATG AAGAATTGCC GCCTAACTTG CTTGATGAGC TAAAACGTGA CCGCCGCTGG TGCCACGGTA ACCTGATGAA CTTCCGTCTG TTCCTGGTTA AGGGTATGCA CCCGGTTCAC CGTGCGGTGT TCCTGACGGG CGTGATGTCT TATCTCTCCG CTCCGTTGTG GTTTATGTTC CTCGCGCTCT CTACTGCATT GCAGGTAGTG CATGCGTTGA CCGAACCGCA ATACTTCCTG CAACCACGGC AGTTATTCCC GGTGTGGCCG CAGTGGCGTC CTGAGCTGGC GATTGCACTT TTTGCTTCGA CCATGGTGCT GTTGTTCCTG CCGAAGTTAT TGAGCATTTT GCTTATCTGG TGCAAAGGAA CGAAAGAATA CGGCGGCTTC TGGCGCGTTA CATTATCGTT GCTGCTGGAA GTGCTGTTTT CCGTGCTGCT GGCTCCGGTA CGCATGCTGT TCCATACGGT CTTCGTCGTC AGCGCGTTCC TTGGCTGGGA AGTGGTGTGG AATTCACCGC AGCGTGATGA TGACTCCACT TCCTGGGGTG AAGCGTTCAA ACGCCACGGC TCACAGCTGC TGTTAGGGTT AGTGTGGGCT GTTGGGATGG CGTGGTTGGA TCTGCGTTTC CTGTTCTGGC TGGCACCGAT TGTCTTCTCG TTGATCCTGT CACCGTTTGT TTCGGTGATT TCCAGCCGTG CCACCGTTGG TCTGCGCACC AAACGCTGGA AACTGTTCCT GATCCCGGAA GAGTATTCAC CGCCGCAGGT GCTGGTTGAT ACCGATCGGT TCCTTGAGAT GAATCGCCAA CGCTCCCTTG ATGATGGTTT TATGCACGCA GTGTTTAACC CGTCATTCAA CGCTCTGGCA ACCGCAATGG CGACCGCGCG TCACCGCGCC AGTAAGGTGC TGGAAATCGC CCGTGACCGC CACGTTGAAC AGGCGCTGAA CGAGACGCCA GAGAAGCTGA ATCGCGATCG TCGCCTGGTG CTGCTAAGCG ATCCGGTGAC GATGGCCCGT CTGCATTTCC GCGTCTGGAA TTCCCCTGAG AGATATTCTT CATGGGTGAG TTATTACGAA GGGATAAAGC TCAATCCACT GGCATTGCGT AAACCGGATG CGGCTTCGCA ATAA
|
Protein sequence | MPIAASEKAA LPKTDIRAVH QALDAEHRTW AREDDSPQGS VKARLEQAWP DSLADGQLIK DDEGRDQLKA MPEAKRSSMF PDPWRTNPVG RFWDRLRGRD VTPRYLARLT KEEQESEQKW RTVGTIRRYI LLILTLAQTV VATWYMKTIL PYQGWALINP MDMVGQDVWV SFMQLLPYML QTGILILFAV LFCWVSAGFW TALMGFLQLL IGRDKYSISA STVGDEPLNP EHRTALIMPI CNEDVNRVFA GLRATWESVK ATGNAKHFDV YILSDSYNPD ICVAEQKAWM ELIAEVGGEG QIFYRRRRRR VKRKSGNIDD FCRRWGSQYS YMVVLDADSV MTGDCLCGLV RLMEANPNAG IIQSSPKASG MDTLYARCQQ FATRVYGPLF TAGLHFWQLG ESHYWGHNAI IRVKPFIEHC ALAPLPGEGS FAGSILSHDF VEAALMRRAG WGVWIAYDLP GSYEELPPNL LDELKRDRRW CHGNLMNFRL FLVKGMHPVH RAVFLTGVMS YLSAPLWFMF LALSTALQVV HALTEPQYFL QPRQLFPVWP QWRPELAIAL FASTMVLLFL PKLLSILLIW CKGTKEYGGF WRVTLSLLLE VLFSVLLAPV RMLFHTVFVV SAFLGWEVVW NSPQRDDDST SWGEAFKRHG SQLLLGLVWA VGMAWLDLRF LFWLAPIVFS LILSPFVSVI SSRATVGLRT KRWKLFLIPE EYSPPQVLVD TDRFLEMNRQ RSLDDGFMHA VFNPSFNALA TAMATARHRA SKVLEIARDR HVEQALNETP EKLNRDRRLV LLSDPVTMAR LHFRVWNSPE RYSSWVSYYE GIKLNPLALR KPDAASQ
|
| |