Gene EcolC_2550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2550 
Symbol 
ID6067328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2796506 
End bp2799049 
Gene Length2544 bp 
Protein Length847 aa 
Translation table11 
GC content55% 
IMG OID641601956 
Productglucosyltransferase MdoH 
Protein accessionYP_001725508 
Protein GI170020554 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000666555 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.464032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA CAACTGAGTA CATTGACGCA ATGCCCATCG CCGCAAGCGA GAAAGCGGCA 
TTGCCGAAGA CTGATATCCG CGCCGTTCAT CAGGCGCTGG ATGCCGAACA CCGCACCTGG
GCGCGGGAGG ATGATTCCCC GCAAGGCTCG GTAAAGGCGC GTCTGGAACA AGCCTGGCCA
GATTCACTTG CTGATGGACA GTTAATTAAA GACGACGAAG GGCGCGATCA GCTGAAGGCG
ATGCCAGAAG CAAAACGCTC CTCGATGTTT CCCGACCCGT GGCGTACCAA CCCGGTAGGC
CGTTTCTGGG ATCGCCTGCG TGGACGCGAT GTCACGCCGC GCTATCTGGC TCGTTTGACC
AAAGAAGAGC AGGAGAGCGA GCAAAAGTGG CGTACCGTCG GTACCATCCG CCGTTACATT
CTGTTGATCC TGACGCTCGC GCAAACTGTC GTCGCGACCT GGTATATGAA GACCATTCTT
CCTTATCAGG GTTGGGCGCT GATTAATCCT ATGGATATGG TTGGTCAGGA TTTGTGGGTT
TCCTTTATGC AGCTTCTGCC TTATATGCTG CAAACCGGTA TCCTGATCCT CTTTGCGGTA
CTGTTCTGTT GGGTGTCCGC CGGATTCTGG ACGGCGTTAA TGGGCTTCCT GCAACTGCTT
ATTGGTCGCG ATAAATACAG TATATCTGCG TCAACAGTTG GCGATGAACC ATTAAACCCG
GAGCATCGCA CGGCGTTGAT CATGCCTATC TGTAACGAAG ACGTGAACCG TGTTTTTGCT
GGCCTGCGTG CAACGTGGGA ATCAGTAAAA GCCACCGGGA ATGCCAAACA CTTTGATGTC
TACATTCTTA GTGACAGTTA TAACCCGGAT ATCTGCGTCG CAGAGCAAAA AGCCTGGATG
GAGCTTATCG CTGAAGTCGG TGGCGAAGGT CAGATTTTCT ATCGCCGCCG CCGTCGCCGC
GTGAAGCGTA AAAGCGGTAA TATCGATGAC TTCTGCCGTC GCTGGGGCAG CCAGTACAGC
TACATGGTGG TGCTGGATGC TGACTCGGTA ATGACCGGTG ATTGTTTGTG CGGGCTGGTG
CGCCTGATGG AAGCCAACCC GAACGCCGGG ATCATTCAGT CGTCGCCGAA AGCGTCCGGT
ATGGATACGC TGTATGCGCG CTGTCAGCAG TTCGCGACCC GCGTGTATGG GCCACTGTTT
ACAGCCGGTT TGCACTTCTG GCAACTTGGC GAGTCGCACT ACTGGGGACA TAACGCGATT
ATCCGCGTGA AACCGTTTAT CGAGCACTGC GCACTGGCTC CGCTGCCGGG CGAAGGTTCC
TTTGCCGGTT CAATCCTGTC ACATGACTTC GTGGAAGCGG CGTTGATGCG CCGTGCAGGT
TGGGGGGTCT GGATTGCTTA CGATCTCCCG GGTTCTTATG AAGAATTGCC GCCTAACTTG
CTTGATGAGC TAAAACGTGA CCGCCGCTGG TGCCACGGTA ACCTGATGAA CTTCCGTCTG
TTCCTGGTGA AGGGTATGCA CCCGGTTCAC CGTGCGGTGT TCCTGACGGG CGTGATGTCT
TATCTCTCCG CTCCGCTGTG GTTTATGTTC CTCGCGCTCT CTACTGCATT GCAGGTAGTA
CATGCGTTGA CCGAACCGCA ATACTTCCTG CAACCACGGC AGTTGTTCCC GGTGTGGCCG
CAGTGGCGTC CTGAGCTGGC GATTGCACTT TTTGCTTCGA CTATGGTGCT GTTGTTCCTG
CCGAAGTTAT TGAGCATTTT GCTTATCTGG TGCAAAGGAA CGAAAGAATA CGGCGGATTC
TGGCGCGTTA CATTATCGTT GCTGCTGGAA GTGCTGTTTT CCGTGCTGCT GGCTCCGGTA
CGCATGCTGT TCCATACGGT CTTCGTCGTC AGCGCGTTCC TTGGCTGGGA AGTTGTGTGG
AATTCACCGC AGCGTGATGA TGACTCCACT TCCTGGGGTG AAGCGTTCAA ACGCCACGGC
TCACAGCTGC TGTTAGGGTT AGTGTGGGCT GTTGGTATGG CGTGGTTGGA TCTGCGTTTC
CTGTTCTGGC TGGCACCGAT TGTCTTCTCG TTGATCCTGT CACCGTTTGT TTCGGTGATT
TCCAGCCGTG CCACCGTTGG TCTGCGAACC AAACGCTGGA AACTGTTCCT GATCCCGGAA
GAGTATTCAC CGCCGCAGGT GCTGGTTGAT ACCGATCGCT TCCTTGAGAT GAATCGTCAA
CGCTCCCTTG ATGATGGTTT TATGCACGCG GTGTTTAACC CGTCATTTAA CGCTCTGGCA
ACCGCAATGG CGACCGCGCG TCACCGCGCC AGTAAGGTGC TGGAAATCGC CCGTGACCGC
CACGTTGAAC AGGCGCTTAA CGAGACGCCA GAGAAGCTGA ATCGCGATCG TCGCCTGGTG
CTGCTAAGCG ATCCGGTGAC GATGGCCCGT CTGCATTTCC GCGTCTGGAA TTCCCCGGAG
AGATATTCTT CATGGGTGAG TTATTACGAA GGGATAAAGC TCAATCCACT GGCATTGCGT
AAACCGGATG CGGCTTCGCA ATAA
 
Protein sequence
MNKTTEYIDA MPIAASEKAA LPKTDIRAVH QALDAEHRTW AREDDSPQGS VKARLEQAWP 
DSLADGQLIK DDEGRDQLKA MPEAKRSSMF PDPWRTNPVG RFWDRLRGRD VTPRYLARLT
KEEQESEQKW RTVGTIRRYI LLILTLAQTV VATWYMKTIL PYQGWALINP MDMVGQDLWV
SFMQLLPYML QTGILILFAV LFCWVSAGFW TALMGFLQLL IGRDKYSISA STVGDEPLNP
EHRTALIMPI CNEDVNRVFA GLRATWESVK ATGNAKHFDV YILSDSYNPD ICVAEQKAWM
ELIAEVGGEG QIFYRRRRRR VKRKSGNIDD FCRRWGSQYS YMVVLDADSV MTGDCLCGLV
RLMEANPNAG IIQSSPKASG MDTLYARCQQ FATRVYGPLF TAGLHFWQLG ESHYWGHNAI
IRVKPFIEHC ALAPLPGEGS FAGSILSHDF VEAALMRRAG WGVWIAYDLP GSYEELPPNL
LDELKRDRRW CHGNLMNFRL FLVKGMHPVH RAVFLTGVMS YLSAPLWFMF LALSTALQVV
HALTEPQYFL QPRQLFPVWP QWRPELAIAL FASTMVLLFL PKLLSILLIW CKGTKEYGGF
WRVTLSLLLE VLFSVLLAPV RMLFHTVFVV SAFLGWEVVW NSPQRDDDST SWGEAFKRHG
SQLLLGLVWA VGMAWLDLRF LFWLAPIVFS LILSPFVSVI SSRATVGLRT KRWKLFLIPE
EYSPPQVLVD TDRFLEMNRQ RSLDDGFMHA VFNPSFNALA TAMATARHRA SKVLEIARDR
HVEQALNETP EKLNRDRRLV LLSDPVTMAR LHFRVWNSPE RYSSWVSYYE GIKLNPLALR
KPDAASQ