Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5659 |
Symbol | groEL |
ID | 6971053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5300676 |
End bp | 5302322 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643389292 |
Product | chaperonin GroEL |
Protein accession | YP_002273688 |
Protein GI | 209398215 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.315114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCTA AAGACGTAAA ATTCGGTAAC GACGCTCGTG TGAAAATGCT GCGCGGCGTA AACGTACTGG CAGATGCAGT GAAAGTTACC CTCGGTCCGA AAGGCCGTAA CGTAGTTCTG GATAAATCTT TCGGTGCACC GACCATCACC AAAGATGGTG TTTCCGTTGC TCGTGAAATC GAACTGGAAG ACAAGTTCGA AAACATGGGT GCGCAGATGG TGAAAGAAGT TGCCTCTAAA GCGAACGACG CTGCAGGCGA CGGTACCACC ACTGCAACCG TACTGGCTCA GGCTATCATC ACTGAAGGTC TGAAAGCTGT TGCTGCGGGC ATGAACCCGA TGGACCTGAA ACGTGGTATC GACAAAGCTG TTACCGCTGC AGTTGAAGAA CTGAAAGCGC TGTCCGTACC GTGCTCTGAC TCTAAAGCGA TTGCTCAGGT TGGTACTATC TCCGCTAACT CCGACGAAAC CGTAGGTAAA CTGATCGCTG AAGCGATGGA CAAAGTCGGT AAAGAAGGCG TTATCACCGT TGAAGACGGT ACCGGTCTGC AGGACGAACT GGACGTGGTT GAAGGTATGC AGTTCGACCG TGGCTACCTG TCTCCTTACT TCATCAACAA GCCGGAAACT GGCGCAGTAG AACTGGAAAG CCCGTTCATC CTGCTGGCTG ACAAGAAAAT CTCCAACATC CGCGAAATGC TGCCGGTTCT GGAAGCCGTT GCCAAAGCAG GCAAACCGCT GCTGATCATC GCTGAAGATG TAGAAGGCGA AGCGCTGGCA ACTCTGGTTG TTAACACCAT GCGTGGCATC GTGAAAGTTG CTGCAGTTAA AGCTCCGGGC TTCGGCGATC GTCGTAAAGC TATGCTGCAG GATATCGCAA CCCTGACTGG CGGTACCGTA ATCTCTGAAG AGATCGGTAT GGAGCTGGAA AAAGCAACCC TGGAAGACCT GGGTCAGGCT AAACGCGTTG TGATCAACAA AGACACCACC ACCATCATCG ATGGCGTGGG CGAAGAAGCT GCAATCCAGG GCCGTGTTGC TCAGATCCGT CAGCAGATTG AAGAAGCAAC TTCTGACTAC GACCGTGAAA AACTGCAGGA GCGCGTAGCG AAACTGGCAG GCGGCGTTGC AGTTATCAAA GTAGGTGCTG CTACCGAAGT TGAAATGAAA GAGAAAAAAG CACGCGTTGA AGACGCCCTG CACGCGACCC GTGCTGCGGT AGAAGAAGGC GTGGTTGCTG GTGGTGGTGT TGCGCTGATC CGCGTAGCGT CTAAACTGGC TGACCTGCGT GGTCAGAACG AAGACCAGAA CGTGGGTATC AAAGTTGCAC TGCGTGCAAT GGAAGCTCCG CTGCGTCAGA TCGTCCTGAA CTGCGGCGAA GAACCGTCTG TTGTTGCTAA CACCGTTAAA GGCGGCGACG GCAACTACGG TTACAACGCA GCAACCGAAG AATACGGCAA CATGATCGAC ATGGGTATCC TGGACCCAAC CAAAGTAACC CGTTCTGCTC TGCAGTACGC GGCTTCTGTG GCTGGCCTGA TGATCACCAC CGAATGCATG GTTACCGACC TGCCGAAAAA CGATGCAGCT GACTTAGGCG CTGCTGGCGG CATGGGTGGC ATGGGTGGCA TGGGCGGCAT GATGTAA
|
Protein sequence | MAAKDVKFGN DARVKMLRGV NVLADAVKVT LGPKGRNVVL DKSFGAPTIT KDGVSVAREI ELEDKFENMG AQMVKEVASK ANDAAGDGTT TATVLAQAII TEGLKAVAAG MNPMDLKRGI DKAVTAAVEE LKALSVPCSD SKAIAQVGTI SANSDETVGK LIAEAMDKVG KEGVITVEDG TGLQDELDVV EGMQFDRGYL SPYFINKPET GAVELESPFI LLADKKISNI REMLPVLEAV AKAGKPLLII AEDVEGEALA TLVVNTMRGI VKVAAVKAPG FGDRRKAMLQ DIATLTGGTV ISEEIGMELE KATLEDLGQA KRVVINKDTT TIIDGVGEEA AIQGRVAQIR QQIEEATSDY DREKLQERVA KLAGGVAVIK VGAATEVEMK EKKARVEDAL HATRAAVEEG VVAGGGVALI RVASKLADLR GQNEDQNVGI KVALRAMEAP LRQIVLNCGE EPSVVANTVK GGDGNYGYNA ATEEYGNMID MGILDPTKVT RSALQYAASV AGLMITTECM VTDLPKNDAA DLGAAGGMGG MGGMGGMM
|
| |