Gene ECD_01046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01046 
SymbolmdoH 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1113430 
End bp1115973 
Gene Length2544 bp 
Protein Length847 aa 
Translation table11 
GC content55% 
IMG OID 
Productglucosyltransferase MdoH 
Protein accessionACT42941 
Protein GI253977271 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00471398 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA CAACTGAGTA CATTGACGCA ATGCCCATCG CCGCAAGCGA GAAAGCGGCA 
TTGCCGAAGA CTGATATCCG CGCCGTTCAT CAGGCGCTGG ATGCCGAACA CCGCACCTGG
GCGCGGGAGG ATGATTCCCC GCAAGGCTCG GTAAAGGCGC GTCTGGAACA AGCCTGGCCA
GATTCACTTG CTGATGGACA GTTAATTAAA GACGACGAAG GGCGCGATCA GCTGAAGGCG
ATGCCAGAAG CAAAACGCTC CTCGATGTTT CCCGACCCGT GGCGTACCAA CCCGGTAGGC
CGTTTCTGGG ATCGCCTGCG TGGACGCGAT GTCACGCCGC GCTATCTGGC TCGTTTGACC
AAAGAAGAGC AGGAGAGCGA GCAAAAGTGG CGTACCGTCG GTACCATCCG CCGTTACATT
CTGTTGATCC TGACGCTCGC GCAAACTGTC GTCGCGACCT GGTATATGAA GACCATTCTT
CCTTATCAGG GTTGGGCGCT GATTAATCCT ATGGATATGG TTGGTCAGGA TTTGTGGGTT
TCCTTTATGC AGCTTCTGCC TTATATGCTG CAAACCGGTA TCCTGATCCT CTTTGCGGTA
CTGTTCTGTT GGGTGTCCGC CGGATTCTGG ACGGCGTTAA TGGGCTTCCT GCAACTGCTT
ATTGGTCGCG ATAAATACAG TATATCTGCG TCAACAGTTG GCGATGAACC ATTAAACCCG
GAGCATCGCA CGGCGTTGAT CATGCCTATC TGTAACGAAG ACGTGAACCG TGTTTTTGCT
GGCCTGCGTG CAACGTGGGA ATCAGTAAAA GCCACCGGGA ATGCCAAACA CTTTGATGTC
TACATTCTTA GTGACAGTTA TAACCCGGAT ATCTGCGTCG CAGAGCAAAA AGCCTGGATG
GAGCTTATCG CTGAAGTCGG TGGCGAAGGT CAGATTTTCT ATCGCCGCCG CCGTCGCCGC
GTGAAGCGTA AAAGCGGTAA TATCGATGAC TTCTGCCGTC GCTGGGGCAG CCAGTACAGC
TACATGGTGG TGCTGGATGC TGACTCGGTA ATGACCGGTG ATTGTTTGTG CGGGCTGGTG
CGCCTGATGG AAGCCAACCC GAACGCCGGG ATCATTCAGT CGTCGCCGAA AGCGTCCGGT
ATGGATACGC TGTATGCGCG CTGTCAGCAG TTCGCGACCC GCGTGTATGG GCCACTGTTT
ACAGCCGGTT TGCACTTCTG GCAACTTGGC GAGTCGCACT ACTGGGGACA TAACGCGATT
ATCCGCGTGA AACCGTTTAT CGAGCACTGC GCACTGGCTC CGCTGCCGGG CGAAGGTTCC
TTTGCCGGTT CAATCCTGTC ACATGACTTC GTGGAAGCGG CGTTGATGCG CCGTGCAGGT
TGGGGGGTCT GGATTGCTTA CGATCTCCCG GGTTCTTATG AAGAATTGCC GCCTAACTTG
CTTGATGAGC TAAAACGTGA CCGCCGATGG TGCCACGGTA ACCTGATGAA CTTCCGTCTG
TTCCTGGTGA AGGGTATGCA CCCGGTTCAC CGTGCGGTGT TCCTGACGGG CGTGATGTCT
TATCTCTCCG CTCCGCTGTG GTTTATGTTC CTCGCGCTCT CTACTGCATT GCAGGTAGTG
CATGCGTTGA CCGAACCGCA ATACTTCCTG CAACCACGGC AGTTGTTCCC AGTGTGGCCG
CAGTGGCGTC CTGAGCTGGC GATTGCACTT TTTGCTTCGA CCATGGTGCT GTTGTTCCTG
CCGAAGTTAT TGAGCATTTT GCTTATCTGG TGCAAAGGAA CGAAAGAATA CGGCGGCTTC
TGGCGCGTTA CATTATCGTT GCTGCTGGAA GTGCTTTTTT CCGTGCTGCT GGCTCCGGTA
CGCATGCTGT TCCATACGGT CTTCGTTGTC AGCGCGTTCC TTGGCTGGGA AGTGGTGTGG
AATTCACCGC AGCGTGATGA TGACTCCACT TCCTGGGGTG AAGCGTTCAA ACGCCACGGC
TCACAGCTGC TGTTAGGGTT AGTGTGGGCT GTTGGGATGG CGTGGCTGGA TCTGCGTTTC
CTGTTCTGGC TGGCACCGAT TGTCTTCTCG TTGATCCTGT CACCGTTTGT TTCGGTGATT
TCCAGCCGTG CCACCGTTGG TCTGCGCACC AAACGCTGGA AACTGTTCCT GATCCCGGAA
GAGTATTCGC CGCCGCAGGT GCTGGTTGAT ACCGATCGGT TCCTTGAGAT GAATCGTCAA
CGCTCCCTTG ATGATGGCTT TATGCACGCA GTGTTTAACC CGTCATTTAA CGCTCTGGCA
ACCGCAATGG CGACCGCGCG TCACCGCGCC AGTAAGGTGC TGGAAATCGC CCGTGACCGC
CACGTTGAAC AGGCGCTGAA CGAGACGCCA GAGAAGCTGA ATCGCGATCG TCGCCTGGTG
CTGCTAAGCG ATCCGGTGAC GATGGCCCGT CTGCATTTCC GTGTCTGGAA TTCCCCGGAG
AGATATTCTT CATGGGTGAG TTATTACGAA GGGATAAAGC TCAATCCACT GGCATTGCGT
AAACCGGATG CGGCTTCGCA ATAA
 
Protein sequence
MNKTTEYIDA MPIAASEKAA LPKTDIRAVH QALDAEHRTW AREDDSPQGS VKARLEQAWP 
DSLADGQLIK DDEGRDQLKA MPEAKRSSMF PDPWRTNPVG RFWDRLRGRD VTPRYLARLT
KEEQESEQKW RTVGTIRRYI LLILTLAQTV VATWYMKTIL PYQGWALINP MDMVGQDLWV
SFMQLLPYML QTGILILFAV LFCWVSAGFW TALMGFLQLL IGRDKYSISA STVGDEPLNP
EHRTALIMPI CNEDVNRVFA GLRATWESVK ATGNAKHFDV YILSDSYNPD ICVAEQKAWM
ELIAEVGGEG QIFYRRRRRR VKRKSGNIDD FCRRWGSQYS YMVVLDADSV MTGDCLCGLV
RLMEANPNAG IIQSSPKASG MDTLYARCQQ FATRVYGPLF TAGLHFWQLG ESHYWGHNAI
IRVKPFIEHC ALAPLPGEGS FAGSILSHDF VEAALMRRAG WGVWIAYDLP GSYEELPPNL
LDELKRDRRW CHGNLMNFRL FLVKGMHPVH RAVFLTGVMS YLSAPLWFMF LALSTALQVV
HALTEPQYFL QPRQLFPVWP QWRPELAIAL FASTMVLLFL PKLLSILLIW CKGTKEYGGF
WRVTLSLLLE VLFSVLLAPV RMLFHTVFVV SAFLGWEVVW NSPQRDDDST SWGEAFKRHG
SQLLLGLVWA VGMAWLDLRF LFWLAPIVFS LILSPFVSVI SSRATVGLRT KRWKLFLIPE
EYSPPQVLVD TDRFLEMNRQ RSLDDGFMHA VFNPSFNALA TAMATARHRA SKVLEIARDR
HVEQALNETP EKLNRDRRLV LLSDPVTMAR LHFRVWNSPE RYSSWVSYYE GIKLNPLALR
KPDAASQ