Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2892 |
Symbol | groEL |
ID | 4809099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3417439 |
End bp | 3419064 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640108311 |
Product | chaperonin GroEL |
Protein accession | YP_001039283 |
Protein GI | 125975373 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAGC AAATAAAATT TGGTGAAGAA GCAAGAAGAG CTCTGGAAAG AGGCGTTAAT CAATTAGCAG ATACAGTTAA AGTTACTCTC GGACCTAAGG GAAGAAACGT TGTACTTGAC AAGAAATTCG GTTCACCGAT GATTACAAAT GACGGTGTAA CCATTGCTAA AGAAATTGAG CTTGAAGATC CGTTTGAAAA CATGGGTGCG CAGCTTGTAA AAGAAGTTGC CACAAAAACC AACGATGTTG CCGGTGACGG TACAACTACA GCAACACTTC TTGCACAGGC TATAATCAGA GAAGGACTTA AGAACGTTGC AGCCGGTGCA AACCCGATGC TTCTTAAAAA GGGTATAGCA AAAGCTGTTG ATGCGGCAGT TGAAGGTATC AAGGAAATCA GCCAGAAGGT TAAAGGAAAA GAAGATATAG CAAGGGTTGC TTCAATTTCC GCCAATGACG AAGTTATTGG TGAATTGATA GCCGATGCTA TGGAAAAAGT TACAAATGAC GGTGTTATCA CTGTTGAAGA AGCAAAGACA ATGGGCACAA ACCTCGAAAT AGTTGAAGGT ATGCAGTTTG ACAGAGGTTA TGTATCACCA TACATGGTTA CTGACACTGA AAAGATGGAA GCTGTTCTTG ATGAGCCTTA CATCCTCATT ACAGACAAGA AAATAAGCAA TATCCAGGAC ATTCTCCCAT TGCTGGAACA GATAGTTCAG CAGGGCAAGA AACTGGTTAT CATTGCTGAG GATGTTGAGG GCGAAGCTCT TGCAACATTG CTTGTAAACA AATTAAGAGG TACATTCACA TGCGTTGCTG TTAAAGCACC TGGCTTTGGT GACAGAAGAA AAGCTATGCT TGAAGATATA GCAATTCTCA CCGGCGGTCA GGTTATCACA TCAGACCTCG GTCTTGAACT TAAGGATACT ACTGTTGAAC AGCTCGGTAG AGCAAGACAG GTTAAAGTTC AGAAAGAAAA CACAATTATT GTTGACGGTG CGGGAGATCC AAAAGAAATA CAGAAGAGAA TTGCATCCAT AAAGTCTCAA ATTGAAGAGA CGACTTCCGA CTTTGACAGA GAAAAACTTC AGGAAAGACT TGCAAAACTT GCCGGCGGCG TAGCTGTAAT CCAGGTTGGT GCTGCTACTG AAACAGAAAT GAAGGAAAAG AAATTGAGAA TCGAAGACGC TCTTGCTGCT ACAAAGGCTG CCGTTGAAGA AGGAATAGTA GCAGGCGGAG GAACAGCTCT GGTAAATGTT ATTCCGAAGG TTGCAAAGGT TCTCGATACT GTATCCGGAG ACGAAAAGAC CGGTGTACAG ATTATTTTGA GAGCTTTGGA AGAGCCGGTT AGACAAATTG CTGAAAATGC AGGTCTTGAA GGTTCCGTAA TAGTTGAAAA GGTTAAGGCC AGCGAACCTG GTATTGGATT TGACGCATAC AATGAAAAAT ATGTTAACAT GATTGAAGCC GGAATAGTTG ACCCTGCAAA AGTAACAAGG TCAGCTTTGC AAAATGCTGC ATCCGTTGCT TCAATGGTAC TTACCACTGA AAGTGTTGTT GCCGACATTC CTGAAAAAGA AACAAGCGGA GGCCCCGGTG GAGCGGGCAT GGGCGGAATG TACTAA
|
Protein sequence | MAKQIKFGEE ARRALERGVN QLADTVKVTL GPKGRNVVLD KKFGSPMITN DGVTIAKEIE LEDPFENMGA QLVKEVATKT NDVAGDGTTT ATLLAQAIIR EGLKNVAAGA NPMLLKKGIA KAVDAAVEGI KEISQKVKGK EDIARVASIS ANDEVIGELI ADAMEKVTND GVITVEEAKT MGTNLEIVEG MQFDRGYVSP YMVTDTEKME AVLDEPYILI TDKKISNIQD ILPLLEQIVQ QGKKLVIIAE DVEGEALATL LVNKLRGTFT CVAVKAPGFG DRRKAMLEDI AILTGGQVIT SDLGLELKDT TVEQLGRARQ VKVQKENTII VDGAGDPKEI QKRIASIKSQ IEETTSDFDR EKLQERLAKL AGGVAVIQVG AATETEMKEK KLRIEDALAA TKAAVEEGIV AGGGTALVNV IPKVAKVLDT VSGDEKTGVQ IILRALEEPV RQIAENAGLE GSVIVEKVKA SEPGIGFDAY NEKYVNMIEA GIVDPAKVTR SALQNAASVA SMVLTTESVV ADIPEKETSG GPGGAGMGGM Y
|
| |