Gene EcDH1_2596 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2596 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2766333 
End bp2768846 
Gene Length2514 bp 
Protein Length837 aa 
Translation table11 
GC content55% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionACX40232 
Protein GI260449810 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0184053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATCG CCGCAAGCGA GAAAGCGGCA TTGCCGAAGA CTGATATCCG CGCCGTTCAT 
CAGGCGCTGG ATGCCGAACA CCGCACCTGG GCGCGGGAGG ATGATTCCCC GCAAGGCTCG
GTAAAGGCGC GTCTGGAACA AGCCTGGCCA GATTCACTTG CTGATGGACA GTTAATTAAA
GACGACGAAG GGCGCGATCA GCTGAAGGCG ATGCCAGAAG CAAAACGCTC CTCGATGTTT
CCCGACCCGT GGCGTACCAA CCCGGTAGGC CGTTTCTGGG ATCGCCTGCG TGGACGCGAT
GTCACGCCGC GCTATCTGGC TCGTTTGACC AAAGAAGAGC AGGAGAGCGA GCAAAAGTGG
CGTACCGTCG GTACCATCCG CCGTTACATT CTGTTGATCC TGACGCTCGC GCAAACTGTC
GTCGCGACCT GGTATATGAA GACCATTCTT CCTTATCAGG GTTGGGCGCT GATTAATCCT
ATGGATATGG TTGGTCAGGA TTTGTGGGTT TCCTTTATGC AGCTTCTGCC TTATATGCTG
CAAACCGGTA TCCTGATCCT CTTTGCGGTA CTGTTCTGTT GGGTGTCCGC CGGATTCTGG
ACGGCGTTAA TGGGCTTCCT GCAACTGCTT ATTGGTCGCG ATAAATACAG TATATCTGCG
TCAACAGTTG GCGATGAACC ATTAAACCCG GAGCATCGCA CGGCGTTGAT CATGCCTATC
TGTAACGAAG ACGTGAACCG TGTTTTTGCT GGCCTGCGTG CAACGTGGGA ATCAGTAAAA
GCCACCGGGA ATGCCAAACA CTTTGATGTC TACATTCTTA GTGACAGTTA TAACCCGGAT
ATCTGCGTCG CAGAGCAAAA AGCCTGGATG GAGCTTATCG CTGAAGTCGG TGGCGAAGGT
CAGATTTTCT ATCGCCGCCG CCGTCGCCGC GTGAAGCGTA AAAGCGGTAA TATCGATGAC
TTCTGCCGTC GCTGGGGCAG CCAGTACAGC TACATGGTGG TGCTGGATGC TGACTCGGTA
ATGACCGGTG ATTGTTTGTG CGGGCTGGTG CGCCTGATGG AAGCCAACCC GAACGCCGGG
ATCATTCAGT CGTCGCCGAA AGCGTCCGGT ATGGATACGC TGTATGCGCG CTGTCAGCAG
TTCGCGACCC GCGTGTATGG GCCACTGTTT ACAGCCGGTT TGCACTTCTG GCAACTTGGC
GAGTCGCACT ACTGGGGACA TAACGCGATT ATCCGCGTGA AACCGTTTAT CGAGCACTGC
GCACTGGCTC CGCTGCCGGG CGAAGGTTCC TTTGCCGGTT CAATCCTGTC ACATGACTTC
GTGGAAGCGG CGTTGATGCG CCGTGCAGGT TGGGGGGTCT GGATTGCTTA CGATCTCCCG
GGTTCTTATG AAGAATTGCC GCCTAACTTG CTTGATGAGC TAAAACGTGA CCGCCGATGG
TGCCACGGTA ACCTGATGAA CTTCCGTCTG TTCCTGGTGA AGGGTATGCA CCCGGTTCAC
CGTGCGGTGT TCCTGACGGG CGTGATGTCT TATCTCTCCG CTCCGCTGTG GTTTATGTTC
CTCGCGCTCT CTACTGCATT GCAGGTAGTG CATGCGTTGA CCGAACCGCA ATACTTCCTG
CAACCACGGC AGTTGTTCCC AGTGTGGCCG CAGTGGCGTC CTGAGCTGGC GATTGCACTT
TTTGCTTCGA CCATGGTGCT GTTGTTCCTG CCGAAGTTAT TGAGCATTTT GCTTATCTGG
TGCAAAGGAA CGAAAGAATA CGGCGGCTTC TGGCGCGTTA CATTATCGTT GCTGCTGGAA
GTGCTTTTTT CCGTGCTGCT GGCTCCGGTA CGCATGCTGT TCCATACGGT CTTCGTTGTC
AGCGCGTTCC TTGGCTGGGA AGTGGTGTGG AATTCACCGC AGCGTGATGA TGACTCCACT
TCCTGGGGTG AAGCGTTCAA ACGCCACGGC TCACAGCTGC TGTTAGGGTT AGTGTGGGCT
GTTGGGATGG CGTGGCTGGA TCTGCGTTTC CTGTTCTGGC TGGCACCGAT TGTCTTCTCG
TTGATCCTGT CACCGTTTGT TTCGGTGATT TCCAGCCGTG CCACCGTTGG TCTGCGCACC
AAACGCTGGA AACTGTTCCT GATCCCGGAA GAGTATTCGC CGCCGCAGGT GCTGGTTGAT
ACCGATCGGT TCCTTGAGAT GAATCGTCAA CGCTCCCTTG ATGATGGCTT TATGCACGCA
GTGTTTAACC CGTCATTTAA CGCTCTGGCA ACCGCAATGG CGACCGCGCG TCACCGCGCC
AGTAAGGTGC TGGAAATCGC CCGTGACCGC CACGTTGAAC AGGCGCTGAA CGAGACGCCA
GAGAAGCTGA ATCGCGATCG TCGCCTGGTG CTGCTAAGCG ATCCGGTGAC GATGGCCCGT
CTGCATTTCC GTGTCTGGAA TTCCCCGGAG AGATATTCTT CATGGGTGAG TTATTACGAA
GGGATAAAGC TCAATCCACT GGCATTGCGT AAACCGGATG CGGCTTCGCA ATAA
 
Protein sequence
MPIAASEKAA LPKTDIRAVH QALDAEHRTW AREDDSPQGS VKARLEQAWP DSLADGQLIK 
DDEGRDQLKA MPEAKRSSMF PDPWRTNPVG RFWDRLRGRD VTPRYLARLT KEEQESEQKW
RTVGTIRRYI LLILTLAQTV VATWYMKTIL PYQGWALINP MDMVGQDLWV SFMQLLPYML
QTGILILFAV LFCWVSAGFW TALMGFLQLL IGRDKYSISA STVGDEPLNP EHRTALIMPI
CNEDVNRVFA GLRATWESVK ATGNAKHFDV YILSDSYNPD ICVAEQKAWM ELIAEVGGEG
QIFYRRRRRR VKRKSGNIDD FCRRWGSQYS YMVVLDADSV MTGDCLCGLV RLMEANPNAG
IIQSSPKASG MDTLYARCQQ FATRVYGPLF TAGLHFWQLG ESHYWGHNAI IRVKPFIEHC
ALAPLPGEGS FAGSILSHDF VEAALMRRAG WGVWIAYDLP GSYEELPPNL LDELKRDRRW
CHGNLMNFRL FLVKGMHPVH RAVFLTGVMS YLSAPLWFMF LALSTALQVV HALTEPQYFL
QPRQLFPVWP QWRPELAIAL FASTMVLLFL PKLLSILLIW CKGTKEYGGF WRVTLSLLLE
VLFSVLLAPV RMLFHTVFVV SAFLGWEVVW NSPQRDDDST SWGEAFKRHG SQLLLGLVWA
VGMAWLDLRF LFWLAPIVFS LILSPFVSVI SSRATVGLRT KRWKLFLIPE EYSPPQVLVD
TDRFLEMNRQ RSLDDGFMHA VFNPSFNALA TAMATARHRA SKVLEIARDR HVEQALNETP
EKLNRDRRLV LLSDPVTMAR LHFRVWNSPE RYSSWVSYYE GIKLNPLALR KPDAASQ