Gene ECH74115_4894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4894 
Symbol 
ID6971803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4528845 
End bp4530833 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content52% 
IMG OID643388582 
Productputative phosphodiesterase 
Protein accessionYP_002273010 
Protein GI209400445 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.509756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCGTAA GTCGCTCGTT AACAATCAAG CAGATGGCAA TGGTGGCAGC CGTTGTCCTG 
GTGTTCGTTT TTATTTTTTG CACCGTTTTG CTGTTCCATC TGGTCCAGCA GAATCGCTAT
AACACGGCTA CGCAACTGGA AAGCATTGCT CGCTCTGTCC GCGAACCTTT ATCTTCAGCC
ATTTTGAAAG GTGATATTCC CGAAGCGGAA GCTATTCTTG CCAGCATTAA ACCGGCAGGC
GTGGTCAGCC GTGCCGATGT AGTGCTGCCT AACCAGTTCC AGGCGCTGCG TAAAAGTTTT
ATTCCAGAGC GTCCGGTGCC GGTAATGGTT ACTCGCCTGT TTGAGCTACC GGTTCAAATC
TCGCTGGGCG TCTACTCGCT CGAACGTCCG GCAAATCCGC AGCCAATTGC CTATCTGGTG
CTACAGGCGG ATTCCTTCCG TATGTATAAG TTCGTGATGA GCACCCTCTC AACGTTAGTG
ACCATTTACT TACTTTTGTC GTTAATATTG ACGGTGGCGA TTAGCTGGTG CATTAACCGC
CTGATTTTGC ATCCGTTACG CAATATTGCT CGCGAACTTA ACGCCATCCC AGCCCAGGAG
CTTGTTGGTC ACCAACTGGC ATTACCGCGT CTGCATCAGG ACGATGAAAT CGGTATGTTG
GTGCGCAGTT ACAACCTCAA CCAGCAATTG CTGCAGCGCC ATTATGAAGA ACAGAACGAA
AATGCGATGC GCTTCCCGGT GTCGGATTTG CCGAACAAAG CCTTGCTGAT GGAGATGCTG
GAGCAGGTTG TCGCGCGTAA ACAAACCACC GCGCTGATGA TCATCACCTG TGAAACTCTG
CGTGATACTG CGGGCGTGCT GAAAGAGGCG CAACGAGAAA TTCTGCTGCT GACGCTGGTG
GAAAAACTCA AATCGGTACT GTCGCCACGT ATGATCCTCG CGCAGATTAG CGGTTATGAC
TTTGCTGTCA TTGCCAACGG TGTACATGAA CCGTGGCACG CAATCACTTT GGGTCAGCAA
GTGCTCACTA TCATGAGCGA GCGCCTGCCG ATTGAACGTA TTCAACTCCG TCCGCACTGT
AGCATTAGCG TGGCGATGTT CTACGGCGAT CTCACCGCCG AACAGCTTTA CAGTCGCGCT
ATTTCTGCGG CATTTACCGC TCGCCATAAA GGCAAGAATC AGATTCAGTT CTTTGATCCG
CAGCAGATGG AAGCCGCCCA GAAGCGGTTG ACGGAAGAGA GCGATATCCT TAATGCACTG
GAAAATCATC AGTTTGCTAT TTGGTTACAG CCACAGGTCG AGATGACCAG CGGTAAACTG
GTCAGTGCGG AAGTGTTACT GCGTATCCAG CAACCGGATG GCAGTTGGGA CCTGCCGGAT
GGCTTAATCG ATCGCATTGA GTGCTGTGGG CTGATGGTTA CCGTCGGTCA CTGGGTGCTG
GAAGAGTCCT GTCGATTGCT TGCAGCCTGG CAAGAGCGCG GCATTATGCT GCCCTTGTCG
GTAAACCTCT CTGTGCTGCA ACTGATGCAC CCGAATATGG TGGCGGATAT GCTGGAACTG
TTAACCCGCT ATCGCATTCA GCCGGGAACA CTGATTCTGG AAGTGACAGA AAGCCGACGT
ATTGACGACC CTCATGCTGC GGTGGCAATC CTCCGTCCGC TGCGTAATGC CGGAGTTCGG
GTGGCGCTGG ATGATTTCGG CATGGGCTAC GCAGGGCTGC GTCAGCTGCA GCATATGAAA
TCGTTGCCAA TCGACGTTCT GAAAATCGAC AAAATGTTTG TTGAAGGCTT GCCGGAAGAT
AGCAGCATGA TTGCTGCAAT TATCATGCTG GCGCAGAGCC TGAACTTACA AATGATTGCC
GAAGGCGTGG AGACTGAAGC ACAACGCGAC TGGCTGGCAA AAGCGGGCGT TGGTATTGCC
CAGGGCTTCC TTTTTGCTCG TCCACTCCCT ATTGAAATCT TCGAAGAGAG TTACCTGGAA
GAAAAGTAG
 
Protein sequence
MRVSRSLTIK QMAMVAAVVL VFVFIFCTVL LFHLVQQNRY NTATQLESIA RSVREPLSSA 
ILKGDIPEAE AILASIKPAG VVSRADVVLP NQFQALRKSF IPERPVPVMV TRLFELPVQI
SLGVYSLERP ANPQPIAYLV LQADSFRMYK FVMSTLSTLV TIYLLLSLIL TVAISWCINR
LILHPLRNIA RELNAIPAQE LVGHQLALPR LHQDDEIGML VRSYNLNQQL LQRHYEEQNE
NAMRFPVSDL PNKALLMEML EQVVARKQTT ALMIITCETL RDTAGVLKEA QREILLLTLV
EKLKSVLSPR MILAQISGYD FAVIANGVHE PWHAITLGQQ VLTIMSERLP IERIQLRPHC
SISVAMFYGD LTAEQLYSRA ISAAFTARHK GKNQIQFFDP QQMEAAQKRL TEESDILNAL
ENHQFAIWLQ PQVEMTSGKL VSAEVLLRIQ QPDGSWDLPD GLIDRIECCG LMVTVGHWVL
EESCRLLAAW QERGIMLPLS VNLSVLQLMH PNMVADMLEL LTRYRIQPGT LILEVTESRR
IDDPHAAVAI LRPLRNAGVR VALDDFGMGY AGLRQLQHMK SLPIDVLKID KMFVEGLPED
SSMIAAIIML AQSLNLQMIA EGVETEAQRD WLAKAGVGIA QGFLFARPLP IEIFEESYLE
EK