Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2281 |
Symbol | mdoH |
ID | 6272054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2072468 |
End bp | 2075011 |
Gene Length | 2544 bp |
Protein Length | 847 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641726295 |
Product | glucosyltransferase MdoH |
Protein accession | YP_001880779 |
Protein GI | 187730418 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2943] Membrane glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0709206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGA CAACTGAGTA CATTGACGCA ATGCCCATCG CCGCAAGCGA GAAAGCGGCA TTGCCGAAGA GTGATATCCG CGCCGTACAT CAGGCGCTGG ATGCCGAACA CCGCACCTGG GCGCGGGAGG ATGATTCCCC GCAAGGCTCG GTAAAGGCCC GTCTGGAACA AGCCTGGCCA GATTCACTTG CTGATGGACA GTTAATTAAA GACGACGAAG GGCGCGATCA GCTGAAGGCG ATGCCAGAAG CAAAACGCTC CTCGATGTTT CCCGACCCGT GGCGTACCAA CCCGGTAGGC CGTTTCTGGG ATCGCCTGCG TGGACGCGAT GTCACGCCGC GCTATCTGGC TCGTTTGACC AAAGAAGAGC AGGAGAGCGA GCAAAAGTGG CGTACCGTCG GTACCATCCG CCGTTACATT CTGTTGATCC TGACGCTCGC GCAAACTGTC GTCGCGACCT GGTATATGAA GACCATTCTT CCTTATCAGG GTTGGGCGCT GATTAATCCT ATGGATATGG TTGGTCAGGA TTTGTGGGTT TCCTTTATGC AGCTTCTGCC TTATATGCTG CAAACCGGTA TCCTGATCCT CTTTGCGGTA CTGTTCTGTT GGGTGTCCGC CGGATTCTGG ACGGCGTTAA TGGGCTTCCT GCAACTGCTT ATTGGTCGCG ATAAATACAG TATATCTGCG TCAACAGTTG GCGATGAACC ATTAAACCCG GAGCATCGCA CGGCGTTGAT CATGCCTATC TGTAACGAAG ACGTGAACCG TGTTTTTGCT GGCCTGCGTG CAACGTGGGA ATCAGTAAAA GCCACCGGGA ATGCCAAACA CTTTGATGTC TACATTCTTA GTGACAGTTA TAACCCGGAT ATCTGCGTCG CAGAGCAAAA AGCCTGGATG GAGCTTATCG CTGAAGTCGG TGGCGAAGGT CAGATTTTCT ATCGCCGCCG CCGTCGCCGC GTGAAGCGTA AAAGCGGTAA TATCGATGAC TTCTGCCGTC GCTGGGGCAG CCAGTACAGC TACATGGTGG TGCTGGATGC TGACTCGGTA ATGACCGGTG ATTGTTTGTG CGGGCTGGTG CGCCTGATGG AAGCCAACCC GAACGCCGGG ATCATTCAGT CGTCGCCGAA AGCGTCCGGT ATGGATACGC TGTATGCGCG CTGTCAGCAG TTCGCGACCC GCGTGTATGG GCCACTATTT ACAGCCGGTT TGCACTTCTG GCAACTTGGC GAGTCGCACT ACTGGGGACA TAACGCGATT ATCCGCGTGA AACCGTTTAT CGAGCACTGC GCACTGGCTC CGCTGCCGGG CGAAGGTTCC TTTGCCGGTT CAATCCTGTC ACATGACTTT GTAGAAGCGG CGTTGATGCG CCGTGCAGGT TGGGGGGTCT GGATTGCTTA CGATCTCCCG GGTTCTTATG AAGAATTACC GCCTAACTTG CTTGATGAGC TAAAACGTGA CCGCCGCTGG TGCCACGGTA ACCTGATGAA CTTCCGTCTG TTCCTGGTGA AGGGTATGCA CCCGGTTCAC CGTGCGGTGT TCCTGACGGG CGTGATGTCT TATCTCTCCG CTCCGCTGTG GTTTATGTTC CTCGCGCTCT CTACTGCATT GCAGGTAGTA CATGCGTTGA CCGAACCGCA ATACTTCCTG CAACCACGGC AGTTGTTCCC GGTGTGGCCG CAGTGGCGTC CTGAGCTGGC GATTGCACTT TTTGCTTCGA CTATGGTGCT GTTGTTCCTG CCGAAGTTAT TGAGCATTTT GCTTATCTGG TGCAAAGGAA CGAAAGAATA CGGCGGCTTC TGGCGCGTTA CATTATCGTT GCTACTGGAA GTGCTTTTTT CCGTGCTGCT GGCTCCGGTA CGCATGCTGT TCCATACGGT CTTCGTTGTC AGCGCGTTCC TTGGCTGGGA AGTGGTGTGG AATTCACCGC AGCGTGATGA TGACTCCACT TCCTGGGGTG AAGCGTTCAA ACGCCACGGC TCACAGCTGC TGTTAGGGTT AGTGTGGGCT GTTGGGATGG CGTGGCTGGA TCTGCGTTTC CTGTTCTGGC TGGCACCGAT TGTCTTCTCG TTGATCCTGT CACCGTTTGT TTCGGTGATT TCCAGCCGTG CCACCGTTGG TCTGCGAACC AAACGCTGGA AACTGTTCCT GATCCCGGAA GAGTATTCAC CGCCGCAGGT GCTGGTTGAT ACCGATCGCT TCCTTGAGAT GAATCGTCAA CGCTCCCTTG ATGATGGTTT TATGCACGCG GTGTTTAACC CGTCATTTAA CGCTCTGGCA ACCGCAATGG CGACCGCGCG TCACCGCGCC AGTAAGGTGC TGGAAATCGC CCGTGACCGC CACGTTGAAC AGGCGCTTAA CGAGACGCCA GAGAAGCTGA ATCGCGATCG TCGCCTGGTG CTGCTAAGCG ATCCGGTGAC GATGGCCCGT CTGCATTTCC GTGTCTGGAA TTCCCCGGAG AGATATTCTT CATGGGTGAG TTATTACGAA GGGATAAAGC TCAATCCACT GGCATTGCGT AAACCGGATG CGGCTTCGCA ATAA
|
Protein sequence | MNKTTEYIDA MPIAASEKAA LPKSDIRAVH QALDAEHRTW AREDDSPQGS VKARLEQAWP DSLADGQLIK DDEGRDQLKA MPEAKRSSMF PDPWRTNPVG RFWDRLRGRD VTPRYLARLT KEEQESEQKW RTVGTIRRYI LLILTLAQTV VATWYMKTIL PYQGWALINP MDMVGQDLWV SFMQLLPYML QTGILILFAV LFCWVSAGFW TALMGFLQLL IGRDKYSISA STVGDEPLNP EHRTALIMPI CNEDVNRVFA GLRATWESVK ATGNAKHFDV YILSDSYNPD ICVAEQKAWM ELIAEVGGEG QIFYRRRRRR VKRKSGNIDD FCRRWGSQYS YMVVLDADSV MTGDCLCGLV RLMEANPNAG IIQSSPKASG MDTLYARCQQ FATRVYGPLF TAGLHFWQLG ESHYWGHNAI IRVKPFIEHC ALAPLPGEGS FAGSILSHDF VEAALMRRAG WGVWIAYDLP GSYEELPPNL LDELKRDRRW CHGNLMNFRL FLVKGMHPVH RAVFLTGVMS YLSAPLWFMF LALSTALQVV HALTEPQYFL QPRQLFPVWP QWRPELAIAL FASTMVLLFL PKLLSILLIW CKGTKEYGGF WRVTLSLLLE VLFSVLLAPV RMLFHTVFVV SAFLGWEVVW NSPQRDDDST SWGEAFKRHG SQLLLGLVWA VGMAWLDLRF LFWLAPIVFS LILSPFVSVI SSRATVGLRT KRWKLFLIPE EYSPPQVLVD TDRFLEMNRQ RSLDDGFMHA VFNPSFNALA TAMATARHRA SKVLEIARDR HVEQALNETP EKLNRDRRLV LLSDPVTMAR LHFRVWNSPE RYSSWVSYYE GIKLNPLALR KPDAASQ
|
| |