Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0231 |
Symbol | groEL |
ID | 7976099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 254734 |
End bp | 256353 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644797225 |
Product | chaperonin GroEL |
Protein accession | YP_002948428 |
Protein GI | 239825804 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00131384 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAG AAATTAAATT CAGCGAAGAA GCTCGTCGTG CGATGTTACG TGGTGTGGAC AAATTAGCTG ATGCGGTAAA AGTAACGTTA GGTCCAAAAG GCCGTAACGT CGTATTAGAG AAAAAATTCG GTTCTCCATT AATTACGAAT GACGGTGTAA CGATCGCGAA AGAAATCGAA TTAGAAGATC CATTTGAAAA CATGGGTGCG AAGCTTGTTG CTGAAGTTGC AAGCAAAACA AACGATGTTG CTGGGGACGG TACAACAACG GCAACAGTTT TAGCGCAAGC AATGATCCGC GAAGGATTGA AAAACGTTAC AGCTGGCGCT AACCCAATGG GCATCCGTAA AGGTATTGAA AAAGCGGTCG CTGTGGCAGT AGAAGAATTA AAAGCAATCT CCAAACCAAT CAAAGGAAAA GAATCGATTG CTCAAGTTGC GGCTATCTCT GCAGCTGACG AAGAAGTTGG CCAATTAATC GCAGAAGCAA TGGAACGCGT TGGTAACGAC GGTGTTATCA CATTAGAAGA ATCAAAAGGC TTCACAACAG AATTAGATGT TGTGGAAGGT ATGCAATTTG ACCGTGGTTA TGTATCTCCA TACATGATCA CAGATACAGA AAAAATGGAA GCAGTGCTTG AAAATCCATA TATCTTAATC ACAGACAAAA AAATCTCTAA CATCCAAGAC ATCTTGCCTA TCTTAGAACA AGTAGTTCAA CAAGGCAAAC CATTATTAAT CATCGCAGAA GATGTCGAAG GCGAAGCACT TGCGACATTA GTAGTGAACA AACTTCGCGG TACATTCACT GCCGTAGCAG TTAAAGCTCC TGGCTTCGGT GATCGTCGTA AAGCAATGCT TGAAGACATC GCAATCTTAA CTGGCGGTGA AGTTATCTCC GAAGAACTAG GACGTGACTT AAAATCTACA ACAATCGCAT CACTTGGCCG CGCTTCGAAA GTAGTCGTAA CAAAAGAAAA TACAACAATT GTAGATGGCG CTGGCGACTC TGAACGCATT AAAGCTCGCA TCAACCAAAT TCGTGCGCAA TTAGAAGAAA CAACTTCTGA ATTCGATCGT GAAAAATTGC AAGAACGTCT AGCAAAATTA GCTGGCGGCG TAGCGGTAAT CAAAGTTGGT GCAGCTACAG AAACAGAATT GAAAGAACGC AAATTACGCA TCGAAGACGC GCTCAACTCT ACTCGTGCCG CTGTGGAAGA AGGTATCGTA GCCGGCGGTG GTACAGCATT AATGAACGTA TACAATAAAG TTGCTGCGAT TGAAGCAGAA GGCGATGAAG CAACTGGTGT GAAAATCGTT CTTCGCGCAA TTGAAGAGCC AGTTCGCCAA ATCGCGCAAA ACGCTGGTTT GGAAGGCTCT GTCATTGTTG AACGCTTAAA AACAGAAAAA CCTGGCATCG GCTTCAACGC GGCTACTGGT GAATGGGTAG ACATGATTGA AGCTGGTATC GTAGACCCAA CGAAAGTAAC TCGTTCCGCA CTTCAAAACG CAGCTTCTGT TGCCGCTATG TTCTTAACAA CAGAAGCAGT TGTCGCTGAC AAACCAGAAG AAAACAAAGG CGGCAACCCA GGCATGCCTG ACATGGGCGG AATGATGTAA
|
Protein sequence | MAKEIKFSEE ARRAMLRGVD KLADAVKVTL GPKGRNVVLE KKFGSPLITN DGVTIAKEIE LEDPFENMGA KLVAEVASKT NDVAGDGTTT ATVLAQAMIR EGLKNVTAGA NPMGIRKGIE KAVAVAVEEL KAISKPIKGK ESIAQVAAIS AADEEVGQLI AEAMERVGND GVITLEESKG FTTELDVVEG MQFDRGYVSP YMITDTEKME AVLENPYILI TDKKISNIQD ILPILEQVVQ QGKPLLIIAE DVEGEALATL VVNKLRGTFT AVAVKAPGFG DRRKAMLEDI AILTGGEVIS EELGRDLKST TIASLGRASK VVVTKENTTI VDGAGDSERI KARINQIRAQ LEETTSEFDR EKLQERLAKL AGGVAVIKVG AATETELKER KLRIEDALNS TRAAVEEGIV AGGGTALMNV YNKVAAIEAE GDEATGVKIV LRAIEEPVRQ IAQNAGLEGS VIVERLKTEK PGIGFNAATG EWVDMIEAGI VDPTKVTRSA LQNAASVAAM FLTTEAVVAD KPEENKGGNP GMPDMGGMM
|
| |