Gene ECH74115_2510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2510 
Symbol 
ID6968875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2379940 
End bp2381430 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content46% 
IMG OID643386379 
Productdiguanylate cyclase (GGDEF) domain protein 
Protein accessionYP_002270861 
Protein GI209395799 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.291593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.751069 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGC ACCATAGAAT GCTCCGGCAT TTTATCGCCG CAAGTGTCAT TGTGCTGACA 
TCTTCCTTCC TTATTTTTGA ACTTGTCGCC AGCGACAGAG CAATGAGTGC CTATCTGCGC
TATATCGTGC AGAAAGCAGA CTCCTCCTTT CTTTATGATA AGTATCAGAA TCAGAGTATT
GCCGCGCATG TGATGCGCGC TCTCGCTGCT GAGCAGTCGG AAGTGTCGCC AGAACAGCGG
CGCGCCATCT GCGAGGCTTT TGAGTCTGCC AATAATACCC ATGGCTTAAA CCTGACTGCC
CATAAATACC CGGGCTTACG CGGCACATTA CAAACCGCAT CCACTGACTG CGACACAATT
GTGGAAGCTG CAGCACTATT ACCCGCTTTT GATCAGGCAG TGGAAGGCAA CCGCCACCAG
GATGATTACG GTTCAGGTCT TGGGATGGCC GAAGAGAAAT TTCACTATTA TCTCGATCTC
AATGACCGCT ATGTCTATTT TTATGAGCCG GTTAATGTTG AATACTTTGC GATGAATAAC
TGGTCCATCC TGCAGTCAGG AAGTATTGGC ATCGATCGCA AAGATATTGA AAAGGTATTT
ACCGGGCGTA CCGTATTGTC GAGCATTTAC CAGGATCAGC GTACTAAACA GAACGTGATG
AGTTTGCTGA CGCCGGTGTA TGTTGCAGGG CAGCTAAAAG GGATTGTGCT GCTGGATATT
AACAAAAACA ATCTGCGGAA TATCTTTTAC ACTCATGACC GCCCTCTCCT CTGGCGTTTT
CTCAATGTCA CGCTAACCGA TACAGATTCG GGGCGCGACA TTATCATCAA CCAGAGCGAA
GATAATCTGT TCCAGTATGT CAGTTACGTC CATGACTTAC CGGGCGGCAT TCGTGTCTCG
TTATCCATTG ATATTCTTTA CTTTATCACG TCTTCGTGGA AAAGCGTTCT GTTCTGGATT
TTGACGGCGT TAATTTTGCT GAATATGGTG CGGATGCACT TCCGCTTATA CCAAAATGTG
TCGCGAGAAA ATATTAGTGA TGCGATGACT GGACTGTATA ATCGCAAAAT TTTAACCCCT
GAACTGGAGC AGCGGTTGCA GAAACTGGTG CAATCCGGTT CTTCGGTGAT GTTTATTGCA
ATTGACATGG ACAAGTTAAA GCTAATAAAT GACACCCTCG GTCATCAGGA AGGGGATTTA
GCGATTACGT TGTTAGCTCA GGCGATTAAA CAATCGATTC GTAAAAGTGA TTATGCCATC
CGACTCGGTG GCGATGAATT CTGCATCATT CTTGTCGATT CGACGCCGCA AATTGCAGCA
CAACTGCCTG AACGTATCGA AAAACGTCTG CAACATATCG CGCCGCAGAA AGAGATCGGC
TTCTCTTCCG GTATTTACGC GATGAAAGAA AACGATACGT TACATGATGC GTATAAAGCT
TCCGATGAGC GTTTATATGT CAATAAGCAG AACAAAAACA GCCGTTCATG A
 
Protein sequence
MKLHHRMLRH FIAASVIVLT SSFLIFELVA SDRAMSAYLR YIVQKADSSF LYDKYQNQSI 
AAHVMRALAA EQSEVSPEQR RAICEAFESA NNTHGLNLTA HKYPGLRGTL QTASTDCDTI
VEAAALLPAF DQAVEGNRHQ DDYGSGLGMA EEKFHYYLDL NDRYVYFYEP VNVEYFAMNN
WSILQSGSIG IDRKDIEKVF TGRTVLSSIY QDQRTKQNVM SLLTPVYVAG QLKGIVLLDI
NKNNLRNIFY THDRPLLWRF LNVTLTDTDS GRDIIINQSE DNLFQYVSYV HDLPGGIRVS
LSIDILYFIT SSWKSVLFWI LTALILLNMV RMHFRLYQNV SRENISDAMT GLYNRKILTP
ELEQRLQKLV QSGSSVMFIA IDMDKLKLIN DTLGHQEGDL AITLLAQAIK QSIRKSDYAI
RLGGDEFCII LVDSTPQIAA QLPERIEKRL QHIAPQKEIG FSSGIYAMKE NDTLHDAYKA
SDERLYVNKQ NKNSRS