Gene ECH74115_5659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5659 
SymbolgroEL 
ID6971053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5300676 
End bp5302322 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content53% 
IMG OID643389292 
Productchaperonin GroEL 
Protein accessionYP_002273688 
Protein GI209398215 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.315114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCTA AAGACGTAAA ATTCGGTAAC GACGCTCGTG TGAAAATGCT GCGCGGCGTA 
AACGTACTGG CAGATGCAGT GAAAGTTACC CTCGGTCCGA AAGGCCGTAA CGTAGTTCTG
GATAAATCTT TCGGTGCACC GACCATCACC AAAGATGGTG TTTCCGTTGC TCGTGAAATC
GAACTGGAAG ACAAGTTCGA AAACATGGGT GCGCAGATGG TGAAAGAAGT TGCCTCTAAA
GCGAACGACG CTGCAGGCGA CGGTACCACC ACTGCAACCG TACTGGCTCA GGCTATCATC
ACTGAAGGTC TGAAAGCTGT TGCTGCGGGC ATGAACCCGA TGGACCTGAA ACGTGGTATC
GACAAAGCTG TTACCGCTGC AGTTGAAGAA CTGAAAGCGC TGTCCGTACC GTGCTCTGAC
TCTAAAGCGA TTGCTCAGGT TGGTACTATC TCCGCTAACT CCGACGAAAC CGTAGGTAAA
CTGATCGCTG AAGCGATGGA CAAAGTCGGT AAAGAAGGCG TTATCACCGT TGAAGACGGT
ACCGGTCTGC AGGACGAACT GGACGTGGTT GAAGGTATGC AGTTCGACCG TGGCTACCTG
TCTCCTTACT TCATCAACAA GCCGGAAACT GGCGCAGTAG AACTGGAAAG CCCGTTCATC
CTGCTGGCTG ACAAGAAAAT CTCCAACATC CGCGAAATGC TGCCGGTTCT GGAAGCCGTT
GCCAAAGCAG GCAAACCGCT GCTGATCATC GCTGAAGATG TAGAAGGCGA AGCGCTGGCA
ACTCTGGTTG TTAACACCAT GCGTGGCATC GTGAAAGTTG CTGCAGTTAA AGCTCCGGGC
TTCGGCGATC GTCGTAAAGC TATGCTGCAG GATATCGCAA CCCTGACTGG CGGTACCGTA
ATCTCTGAAG AGATCGGTAT GGAGCTGGAA AAAGCAACCC TGGAAGACCT GGGTCAGGCT
AAACGCGTTG TGATCAACAA AGACACCACC ACCATCATCG ATGGCGTGGG CGAAGAAGCT
GCAATCCAGG GCCGTGTTGC TCAGATCCGT CAGCAGATTG AAGAAGCAAC TTCTGACTAC
GACCGTGAAA AACTGCAGGA GCGCGTAGCG AAACTGGCAG GCGGCGTTGC AGTTATCAAA
GTAGGTGCTG CTACCGAAGT TGAAATGAAA GAGAAAAAAG CACGCGTTGA AGACGCCCTG
CACGCGACCC GTGCTGCGGT AGAAGAAGGC GTGGTTGCTG GTGGTGGTGT TGCGCTGATC
CGCGTAGCGT CTAAACTGGC TGACCTGCGT GGTCAGAACG AAGACCAGAA CGTGGGTATC
AAAGTTGCAC TGCGTGCAAT GGAAGCTCCG CTGCGTCAGA TCGTCCTGAA CTGCGGCGAA
GAACCGTCTG TTGTTGCTAA CACCGTTAAA GGCGGCGACG GCAACTACGG TTACAACGCA
GCAACCGAAG AATACGGCAA CATGATCGAC ATGGGTATCC TGGACCCAAC CAAAGTAACC
CGTTCTGCTC TGCAGTACGC GGCTTCTGTG GCTGGCCTGA TGATCACCAC CGAATGCATG
GTTACCGACC TGCCGAAAAA CGATGCAGCT GACTTAGGCG CTGCTGGCGG CATGGGTGGC
ATGGGTGGCA TGGGCGGCAT GATGTAA
 
Protein sequence
MAAKDVKFGN DARVKMLRGV NVLADAVKVT LGPKGRNVVL DKSFGAPTIT KDGVSVAREI 
ELEDKFENMG AQMVKEVASK ANDAAGDGTT TATVLAQAII TEGLKAVAAG MNPMDLKRGI
DKAVTAAVEE LKALSVPCSD SKAIAQVGTI SANSDETVGK LIAEAMDKVG KEGVITVEDG
TGLQDELDVV EGMQFDRGYL SPYFINKPET GAVELESPFI LLADKKISNI REMLPVLEAV
AKAGKPLLII AEDVEGEALA TLVVNTMRGI VKVAAVKAPG FGDRRKAMLQ DIATLTGGTV
ISEEIGMELE KATLEDLGQA KRVVINKDTT TIIDGVGEEA AIQGRVAQIR QQIEEATSDY
DREKLQERVA KLAGGVAVIK VGAATEVEMK EKKARVEDAL HATRAAVEEG VVAGGGVALI
RVASKLADLR GQNEDQNVGI KVALRAMEAP LRQIVLNCGE EPSVVANTVK GGDGNYGYNA
ATEEYGNMID MGILDPTKVT RSALQYAASV AGLMITTECM VTDLPKNDAA DLGAAGGMGG
MGGMGGMM