Gene ECH74115_2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2103 
Symbol 
ID6967635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2001591 
End bp2002973 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content45% 
IMG OID643386002 
Productdiguanylate cyclase 
Protein accessionYP_002270491 
Protein GI209396130 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATGT ATTTTAAAAG AATGAAAGAT GAGTGGACCG GACTTGTCGA GCAGGCAGAT 
CCGCCCATTC GTGCTAAAGC CGCGGAAATT GCCGTTGCGC ATGCTCATTA TCTGAGTATC
GAGTTTTATC GAATTGTCCG CATCGACCCG CATGCCGAAG AATTCTTGAG TAATGAACAA
GTTGAGCGGC AGTTGAAAAG TGCGATGGAA CGCTGGATTA TTAACGTGCT TTCTGCCCAG
GTTGACGATG TCGAAAGGCT AATACAAATC CAGCATACCG TCGCGGAAGT GCATGCCCGC
ATAGGAATTC CGGTAGAAAT TGTCGAGATG GGGTTTCGGG TGCTGAAAAA GATCCTCTAT
CCGGTAATCT TCTCTTCGGA TTATTCTGCC GCAGAAAAAC TTCAGGTCTA CCATTTCTCG
ATTAACAGTA TTGATATCGC TATGGAAGTG ATGACCCGCG CGTTTACCTT TAGTGACAGT
AGTGCCTCAA AAGAAGATGA AAACTATCGT ATCTTCTCGT TACTGGAAAA CGCCGAAGAA
GAAAAAGAAC GGCAAATAGC CTCAATACTT TCATGGGAAA TAGATATTAT CTATAAAATC
CTGCTGGATT CTGATTTAGG CAGTAGTTTG CCTTTAAGCC AGGCTGATTT TGGCCTGTGG
TTTAACCATA AAGGTCGACA TTATTTCAGC GGCATTGCTG AAGTGGGCCA TATCTCTCGT
CTGATTCAGG ATTTCGACGG TATTTTCAAT CAAACCATGC GTAACACCAG GAATTTGAAT
AACAGAAGTC TGCGGGTGAA ATTTTTATTA CAGATAAGAA ATACCGTATC GCAAATTATT
ACCTTGCTGC GTGAATTGTT TGAAGAAGTA TCGCGCCACG AAGTCGGTAT GGATGTACTG
ACGAAATTAC TTAACCGCCG TTTCCTGCCG ACTATCTTCA AACGCGAAAT TGCCCATGCC
AACCGGACCG GTACACCGCT GTCAGTGCTG ATTATTGACG TTGATAAATT CAAAGAGATC
AACGATACGT GGGGCCATAA CACTGGCGAT GAAATTCTGC GTAAAGTCTC TCAGGCCTTT
TATGACAACG TCCGCAGTAG TGATTATGTT TTCCGCTACG GGGGCGATGA ATTTATCATT
GTTTTGACTG AAGCTTCTGA AAACGAAACG TTACGTACCG CAGAACGTAT TCGCAGTCGG
GTGGAGAAAA CCAAACTGAA AGCCGCAAAC GGCGAAGATA TTGCCCTCTC ACTTTCCATC
GGTGCCGCCA TGTTTAATGG TCATCCTGAC TATGAGCGCC TCATTCAAAT AGCCGATGAA
GCTCTGTATA TCGCCAAAAG ACGAGGTAGA AACCGTGTTG AACTCTGGAA AGCCAGTCTT
TAG
 
Protein sequence
MEMYFKRMKD EWTGLVEQAD PPIRAKAAEI AVAHAHYLSI EFYRIVRIDP HAEEFLSNEQ 
VERQLKSAME RWIINVLSAQ VDDVERLIQI QHTVAEVHAR IGIPVEIVEM GFRVLKKILY
PVIFSSDYSA AEKLQVYHFS INSIDIAMEV MTRAFTFSDS SASKEDENYR IFSLLENAEE
EKERQIASIL SWEIDIIYKI LLDSDLGSSL PLSQADFGLW FNHKGRHYFS GIAEVGHISR
LIQDFDGIFN QTMRNTRNLN NRSLRVKFLL QIRNTVSQII TLLRELFEEV SRHEVGMDVL
TKLLNRRFLP TIFKREIAHA NRTGTPLSVL IIDVDKFKEI NDTWGHNTGD EILRKVSQAF
YDNVRSSDYV FRYGGDEFII VLTEASENET LRTAERIRSR VEKTKLKAAN GEDIALSLSI
GAAMFNGHPD YERLIQIADE ALYIAKRRGR NRVELWKASL