Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_01046 |
Symbol | mdoH |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | + |
Start bp | 1113430 |
End bp | 1115973 |
Gene Length | 2544 bp |
Protein Length | 847 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | glucosyltransferase MdoH |
Protein accession | ACT42941 |
Protein GI | 253977271 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00471398 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGA CAACTGAGTA CATTGACGCA ATGCCCATCG CCGCAAGCGA GAAAGCGGCA TTGCCGAAGA CTGATATCCG CGCCGTTCAT CAGGCGCTGG ATGCCGAACA CCGCACCTGG GCGCGGGAGG ATGATTCCCC GCAAGGCTCG GTAAAGGCGC GTCTGGAACA AGCCTGGCCA GATTCACTTG CTGATGGACA GTTAATTAAA GACGACGAAG GGCGCGATCA GCTGAAGGCG ATGCCAGAAG CAAAACGCTC CTCGATGTTT CCCGACCCGT GGCGTACCAA CCCGGTAGGC CGTTTCTGGG ATCGCCTGCG TGGACGCGAT GTCACGCCGC GCTATCTGGC TCGTTTGACC AAAGAAGAGC AGGAGAGCGA GCAAAAGTGG CGTACCGTCG GTACCATCCG CCGTTACATT CTGTTGATCC TGACGCTCGC GCAAACTGTC GTCGCGACCT GGTATATGAA GACCATTCTT CCTTATCAGG GTTGGGCGCT GATTAATCCT ATGGATATGG TTGGTCAGGA TTTGTGGGTT TCCTTTATGC AGCTTCTGCC TTATATGCTG CAAACCGGTA TCCTGATCCT CTTTGCGGTA CTGTTCTGTT GGGTGTCCGC CGGATTCTGG ACGGCGTTAA TGGGCTTCCT GCAACTGCTT ATTGGTCGCG ATAAATACAG TATATCTGCG TCAACAGTTG GCGATGAACC ATTAAACCCG GAGCATCGCA CGGCGTTGAT CATGCCTATC TGTAACGAAG ACGTGAACCG TGTTTTTGCT GGCCTGCGTG CAACGTGGGA ATCAGTAAAA GCCACCGGGA ATGCCAAACA CTTTGATGTC TACATTCTTA GTGACAGTTA TAACCCGGAT ATCTGCGTCG CAGAGCAAAA AGCCTGGATG GAGCTTATCG CTGAAGTCGG TGGCGAAGGT CAGATTTTCT ATCGCCGCCG CCGTCGCCGC GTGAAGCGTA AAAGCGGTAA TATCGATGAC TTCTGCCGTC GCTGGGGCAG CCAGTACAGC TACATGGTGG TGCTGGATGC TGACTCGGTA ATGACCGGTG ATTGTTTGTG CGGGCTGGTG CGCCTGATGG AAGCCAACCC GAACGCCGGG ATCATTCAGT CGTCGCCGAA AGCGTCCGGT ATGGATACGC TGTATGCGCG CTGTCAGCAG TTCGCGACCC GCGTGTATGG GCCACTGTTT ACAGCCGGTT TGCACTTCTG GCAACTTGGC GAGTCGCACT ACTGGGGACA TAACGCGATT ATCCGCGTGA AACCGTTTAT CGAGCACTGC GCACTGGCTC CGCTGCCGGG CGAAGGTTCC TTTGCCGGTT CAATCCTGTC ACATGACTTC GTGGAAGCGG CGTTGATGCG CCGTGCAGGT TGGGGGGTCT GGATTGCTTA CGATCTCCCG GGTTCTTATG AAGAATTGCC GCCTAACTTG CTTGATGAGC TAAAACGTGA CCGCCGATGG TGCCACGGTA ACCTGATGAA CTTCCGTCTG TTCCTGGTGA AGGGTATGCA CCCGGTTCAC CGTGCGGTGT TCCTGACGGG CGTGATGTCT TATCTCTCCG CTCCGCTGTG GTTTATGTTC CTCGCGCTCT CTACTGCATT GCAGGTAGTG CATGCGTTGA CCGAACCGCA ATACTTCCTG CAACCACGGC AGTTGTTCCC AGTGTGGCCG CAGTGGCGTC CTGAGCTGGC GATTGCACTT TTTGCTTCGA CCATGGTGCT GTTGTTCCTG CCGAAGTTAT TGAGCATTTT GCTTATCTGG TGCAAAGGAA CGAAAGAATA CGGCGGCTTC TGGCGCGTTA CATTATCGTT GCTGCTGGAA GTGCTTTTTT CCGTGCTGCT GGCTCCGGTA CGCATGCTGT TCCATACGGT CTTCGTTGTC AGCGCGTTCC TTGGCTGGGA AGTGGTGTGG AATTCACCGC AGCGTGATGA TGACTCCACT TCCTGGGGTG AAGCGTTCAA ACGCCACGGC TCACAGCTGC TGTTAGGGTT AGTGTGGGCT GTTGGGATGG CGTGGCTGGA TCTGCGTTTC CTGTTCTGGC TGGCACCGAT TGTCTTCTCG TTGATCCTGT CACCGTTTGT TTCGGTGATT TCCAGCCGTG CCACCGTTGG TCTGCGCACC AAACGCTGGA AACTGTTCCT GATCCCGGAA GAGTATTCGC CGCCGCAGGT GCTGGTTGAT ACCGATCGGT TCCTTGAGAT GAATCGTCAA CGCTCCCTTG ATGATGGCTT TATGCACGCA GTGTTTAACC CGTCATTTAA CGCTCTGGCA ACCGCAATGG CGACCGCGCG TCACCGCGCC AGTAAGGTGC TGGAAATCGC CCGTGACCGC CACGTTGAAC AGGCGCTGAA CGAGACGCCA GAGAAGCTGA ATCGCGATCG TCGCCTGGTG CTGCTAAGCG ATCCGGTGAC GATGGCCCGT CTGCATTTCC GTGTCTGGAA TTCCCCGGAG AGATATTCTT CATGGGTGAG TTATTACGAA GGGATAAAGC TCAATCCACT GGCATTGCGT AAACCGGATG CGGCTTCGCA ATAA
|
Protein sequence | MNKTTEYIDA MPIAASEKAA LPKTDIRAVH QALDAEHRTW AREDDSPQGS VKARLEQAWP DSLADGQLIK DDEGRDQLKA MPEAKRSSMF PDPWRTNPVG RFWDRLRGRD VTPRYLARLT KEEQESEQKW RTVGTIRRYI LLILTLAQTV VATWYMKTIL PYQGWALINP MDMVGQDLWV SFMQLLPYML QTGILILFAV LFCWVSAGFW TALMGFLQLL IGRDKYSISA STVGDEPLNP EHRTALIMPI CNEDVNRVFA GLRATWESVK ATGNAKHFDV YILSDSYNPD ICVAEQKAWM ELIAEVGGEG QIFYRRRRRR VKRKSGNIDD FCRRWGSQYS YMVVLDADSV MTGDCLCGLV RLMEANPNAG IIQSSPKASG MDTLYARCQQ FATRVYGPLF TAGLHFWQLG ESHYWGHNAI IRVKPFIEHC ALAPLPGEGS FAGSILSHDF VEAALMRRAG WGVWIAYDLP GSYEELPPNL LDELKRDRRW CHGNLMNFRL FLVKGMHPVH RAVFLTGVMS YLSAPLWFMF LALSTALQVV HALTEPQYFL QPRQLFPVWP QWRPELAIAL FASTMVLLFL PKLLSILLIW CKGTKEYGGF WRVTLSLLLE VLFSVLLAPV RMLFHTVFVV SAFLGWEVVW NSPQRDDDST SWGEAFKRHG SQLLLGLVWA VGMAWLDLRF LFWLAPIVFS LILSPFVSVI SSRATVGLRT KRWKLFLIPE EYSPPQVLVD TDRFLEMNRQ RSLDDGFMHA VFNPSFNALA TAMATARHRA SKVLEIARDR HVEQALNETP EKLNRDRRLV LLSDPVTMAR LHFRVWNSPE RYSSWVSYYE GIKLNPLALR KPDAASQ
|
| |