Gene ECH74115_0984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0984 
Symbol 
ID6969842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1000676 
End bp1003024 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content42% 
IMG OID643385000 
Productcyclic diguanylate phosphodiesterase (EAL) domain protein 
Protein accessionYP_002269500 
Protein GI209398618 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.56885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGTT TATACGAAAA GATAAAGATA AGGCTGATAA TTTTATTTTT ATTGGCAGCA 
CTGTCATTTA TTGGCCTTTT TTTCATCATT AACTATCAAC TGGTATCGGA ACGCGCGGTA
AAACGTGCCG ATAGCCGCTT TGAACTTATT CAGAAAAACG TTGGCTATTT CTTTAAAGAT
ATTGAACGTT CGGCCCTGAC ATTAAAGGAC TCACTGTATT TATTAAAAAA TACAGAGGAG
ATTCAACGCG CTGTGATCCT GAAAATGGAA ATGATGCCAT TTTTAGACTC GGTGGGACTG
GTACTTGATG ATAATAAATA TTATCTCTTT TCGCGGAGGG CGAACGATAA AATCGTTGTT
TATCATCAGG AACAAGTAAA TGGACCGCTT GTCGACGAGT CAGGGCAGGT TATTTTTGCC
GATTTTAACC CATCGAAACG ACCGTGGTCG GTGGCTTCAG ATGACTCTAA CAACAGCTGG
AATCCGGCCT ACAATTGCTT TGATCGTCCG GGTAAAAAAT GTATCTCTTT TACGCTACGC
ATCAACGGCA AAGATCACGA TTTGTTAGCG GTGGATAAAA TTCATGTCGA TTTAAACTGG
CGATATCTGA ACGAGTATCT TGATCAAATC AGCGCTAATG ATGAAGTTCT ATTTTTGAAA
CAAGGCCATG AGATCATTGC CAAGAATCAA CTCGCTCGTG AAAAACTGAT TATTTATAAT
AGCGAAGGTG ATTATAATAT TATTGATTCT GTCGATACTG AATATATCGA AAAAACATCA
GCGGTGCCAA ATAACGCATT ATTCGAAATC TATTTTTATT ATCCGGGCGG TAATTTATTG
AACGCATCAG ATAAACTTTT TTATCTGCCG TTTGCGTTCA TTATTATCGT ATTGCTGGTG
GTTTATTTAA TGACCACTCG TGTGTTCCGT CGGCAATTTT CTGAAATGAC AGAGCTGATT
AATACGCTGG CGTTTTTGCC TGACTCAACG GATCAGATCG AGGCTCTGAA AATTCGCGAA
GGCGATGCGA AAGAGATTAT CAGCATCAAA AATTCGATCG CGGAAATGAA AGATGCCGAA
ATTGAACGGT CAAATAAATT GCTCTCACTG ATCTCTTACG ATCAGGAAAG TGGTTTTATT
AAAAATATGG CGATTATTGA GTCCAACAAT AATCAGTATC TGGCTGTGGG GATCATCAAA
CTGTGTGGTC TGGAAGCCGT GGAAGCTGTG TTTGGTGTTG ATGAACGCAA TAAAATCGTC
AGGAAATTGT GTCAGCGAAT TGCTGAGAAA TATGCACAAT GCTGCGATAT CGTGACATTC
AATGCCGATC TCTATTTACT TCTGTGTCGG GAAAATGTAC AGACATTTAC CCGTAAAATA
GCGATGGTAA ACGATTTTGA CAGCAGCTTT GGCTACCGCA ATCTGCGCAT CCATAAGTCT
GCCATTTGTG AACCTTTGCA GGGGGAAAAC GCCTGGAGTT ACGCAGAAAA ACTGAAACTG
GCGATTTCCA GTATCCGTGA CCATATGTTC TCCGAGTTTA TTTTCTGTGA TGACGCGAAA
CTCAACGAAA TAGAAGAGAA TATCTGGATT GCGCGTAATA TTCGCCATGC AATGGAAATT
GGCGAACTAT TCCTCGTCTA TCAACCGATC GTTGATATTA ACACCCGCGC CATTCTGGGC
GCGGAGGCGT TGTGCCGTTG GGTGTCTGCG GAGCGGGGGA TCATTTCACC GCTGAAGTTC
ATTACCATTG CTGAAGATAT CGGGTTTATC AATGAGCTGG GTTATCAGAT TATTAAAACG
GCAATGGGTG AATTCAGACA TTTTAGTCAG CGTGCATCGC TGAAGGATGA TTTCTTACTG
CATATTAATG TTTCGCCCTG GCAGTTAAAC GAACCGCACT TTCATGAGCG TTTTACCACC
ATCATGAAAG AAAATGGCCT GAAGGCGAAC AGCCTCTGTG TTGAGATCAC TGAAACCGTG
ATCGAGCGAA TTAATGAACA TTTTTATCTC AATATTGAAC AACTGCGTAA ACAAGGGTTA
CGGATATCGA TTGATGACTT TGGCACCGGT TTGTCAAACC TGAAACGTTT TTATGAAATT
AATCCAGACA GCATAAAGGT GGACTCGCAA TTTACCGGCG ATATTTTCGG TACTGCGGGA
AAAATTGTGC GCATTATTTT CGACCTGGCA CGCTATAACC GGATCCCGGT GATTGCGGAA
GGCGTAGAGA GCGAAGACGT TGCGCGCGAA TTAATCAAAT TAGGATGTGT TCAGGCTCAG
GGGTATCTGT ATCAGAAACC CATGCCATTC TCCGCCTGGG ATAAAAGTGG AAAATTAGTA
AAAGAGTAG
 
Protein sequence
MLSLYEKIKI RLIILFLLAA LSFIGLFFII NYQLVSERAV KRADSRFELI QKNVGYFFKD 
IERSALTLKD SLYLLKNTEE IQRAVILKME MMPFLDSVGL VLDDNKYYLF SRRANDKIVV
YHQEQVNGPL VDESGQVIFA DFNPSKRPWS VASDDSNNSW NPAYNCFDRP GKKCISFTLR
INGKDHDLLA VDKIHVDLNW RYLNEYLDQI SANDEVLFLK QGHEIIAKNQ LAREKLIIYN
SEGDYNIIDS VDTEYIEKTS AVPNNALFEI YFYYPGGNLL NASDKLFYLP FAFIIIVLLV
VYLMTTRVFR RQFSEMTELI NTLAFLPDST DQIEALKIRE GDAKEIISIK NSIAEMKDAE
IERSNKLLSL ISYDQESGFI KNMAIIESNN NQYLAVGIIK LCGLEAVEAV FGVDERNKIV
RKLCQRIAEK YAQCCDIVTF NADLYLLLCR ENVQTFTRKI AMVNDFDSSF GYRNLRIHKS
AICEPLQGEN AWSYAEKLKL AISSIRDHMF SEFIFCDDAK LNEIEENIWI ARNIRHAMEI
GELFLVYQPI VDINTRAILG AEALCRWVSA ERGIISPLKF ITIAEDIGFI NELGYQIIKT
AMGEFRHFSQ RASLKDDFLL HINVSPWQLN EPHFHERFTT IMKENGLKAN SLCVEITETV
IERINEHFYL NIEQLRKQGL RISIDDFGTG LSNLKRFYEI NPDSIKVDSQ FTGDIFGTAG
KIVRIIFDLA RYNRIPVIAE GVESEDVARE LIKLGCVQAQ GYLYQKPMPF SAWDKSGKLV
KE