Gene Cthe_2892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2892 
SymbolgroEL 
ID4809099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3417439 
End bp3419064 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content43% 
IMG OID640108311 
Productchaperonin GroEL 
Protein accessionYP_001039283 
Protein GI125975373 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGC AAATAAAATT TGGTGAAGAA GCAAGAAGAG CTCTGGAAAG AGGCGTTAAT 
CAATTAGCAG ATACAGTTAA AGTTACTCTC GGACCTAAGG GAAGAAACGT TGTACTTGAC
AAGAAATTCG GTTCACCGAT GATTACAAAT GACGGTGTAA CCATTGCTAA AGAAATTGAG
CTTGAAGATC CGTTTGAAAA CATGGGTGCG CAGCTTGTAA AAGAAGTTGC CACAAAAACC
AACGATGTTG CCGGTGACGG TACAACTACA GCAACACTTC TTGCACAGGC TATAATCAGA
GAAGGACTTA AGAACGTTGC AGCCGGTGCA AACCCGATGC TTCTTAAAAA GGGTATAGCA
AAAGCTGTTG ATGCGGCAGT TGAAGGTATC AAGGAAATCA GCCAGAAGGT TAAAGGAAAA
GAAGATATAG CAAGGGTTGC TTCAATTTCC GCCAATGACG AAGTTATTGG TGAATTGATA
GCCGATGCTA TGGAAAAAGT TACAAATGAC GGTGTTATCA CTGTTGAAGA AGCAAAGACA
ATGGGCACAA ACCTCGAAAT AGTTGAAGGT ATGCAGTTTG ACAGAGGTTA TGTATCACCA
TACATGGTTA CTGACACTGA AAAGATGGAA GCTGTTCTTG ATGAGCCTTA CATCCTCATT
ACAGACAAGA AAATAAGCAA TATCCAGGAC ATTCTCCCAT TGCTGGAACA GATAGTTCAG
CAGGGCAAGA AACTGGTTAT CATTGCTGAG GATGTTGAGG GCGAAGCTCT TGCAACATTG
CTTGTAAACA AATTAAGAGG TACATTCACA TGCGTTGCTG TTAAAGCACC TGGCTTTGGT
GACAGAAGAA AAGCTATGCT TGAAGATATA GCAATTCTCA CCGGCGGTCA GGTTATCACA
TCAGACCTCG GTCTTGAACT TAAGGATACT ACTGTTGAAC AGCTCGGTAG AGCAAGACAG
GTTAAAGTTC AGAAAGAAAA CACAATTATT GTTGACGGTG CGGGAGATCC AAAAGAAATA
CAGAAGAGAA TTGCATCCAT AAAGTCTCAA ATTGAAGAGA CGACTTCCGA CTTTGACAGA
GAAAAACTTC AGGAAAGACT TGCAAAACTT GCCGGCGGCG TAGCTGTAAT CCAGGTTGGT
GCTGCTACTG AAACAGAAAT GAAGGAAAAG AAATTGAGAA TCGAAGACGC TCTTGCTGCT
ACAAAGGCTG CCGTTGAAGA AGGAATAGTA GCAGGCGGAG GAACAGCTCT GGTAAATGTT
ATTCCGAAGG TTGCAAAGGT TCTCGATACT GTATCCGGAG ACGAAAAGAC CGGTGTACAG
ATTATTTTGA GAGCTTTGGA AGAGCCGGTT AGACAAATTG CTGAAAATGC AGGTCTTGAA
GGTTCCGTAA TAGTTGAAAA GGTTAAGGCC AGCGAACCTG GTATTGGATT TGACGCATAC
AATGAAAAAT ATGTTAACAT GATTGAAGCC GGAATAGTTG ACCCTGCAAA AGTAACAAGG
TCAGCTTTGC AAAATGCTGC ATCCGTTGCT TCAATGGTAC TTACCACTGA AAGTGTTGTT
GCCGACATTC CTGAAAAAGA AACAAGCGGA GGCCCCGGTG GAGCGGGCAT GGGCGGAATG
TACTAA
 
Protein sequence
MAKQIKFGEE ARRALERGVN QLADTVKVTL GPKGRNVVLD KKFGSPMITN DGVTIAKEIE 
LEDPFENMGA QLVKEVATKT NDVAGDGTTT ATLLAQAIIR EGLKNVAAGA NPMLLKKGIA
KAVDAAVEGI KEISQKVKGK EDIARVASIS ANDEVIGELI ADAMEKVTND GVITVEEAKT
MGTNLEIVEG MQFDRGYVSP YMVTDTEKME AVLDEPYILI TDKKISNIQD ILPLLEQIVQ
QGKKLVIIAE DVEGEALATL LVNKLRGTFT CVAVKAPGFG DRRKAMLEDI AILTGGQVIT
SDLGLELKDT TVEQLGRARQ VKVQKENTII VDGAGDPKEI QKRIASIKSQ IEETTSDFDR
EKLQERLAKL AGGVAVIQVG AATETEMKEK KLRIEDALAA TKAAVEEGIV AGGGTALVNV
IPKVAKVLDT VSGDEKTGVQ IILRALEEPV RQIAENAGLE GSVIVEKVKA SEPGIGFDAY
NEKYVNMIEA GIVDPAKVTR SALQNAASVA SMVLTTESVV ADIPEKETSG GPGGAGMGGM
Y