Gene Athe_2137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2137 
SymbolgroEL 
ID7408846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2271512 
End bp2273131 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content41% 
IMG OID643716502 
Productchaperonin GroEL 
Protein accessionYP_002573985 
Protein GI222530103 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.817524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCAA AGATGATATT ATTTGACGAA GAGGCAAGAA GGGCTTTAGA GCGTGGTGTT 
AACAAGCTTG CAGATACAGT TAAAGTAACA CTTGGGCCAA AAGGAAGAAA CGTTGTTCTT
GAAAAGAAAT TTGGTTCACC ACAGATTGTA AATGACGGTG TTACAATTGC AAAAGAGATA
GAGCTTGAAG ACCCATTTGA AAACATGGGT GCACAGATTG TAAGAGAGGT TGCATCCAAG
ACAAACGACA TTGCAGGTGA TGGTACAACA ACTGCAACAG TTCTGGCACA GGCAATGATA
AGAGAAGGTC TTAAGAACAT TGCAGCTGGT GCAAACCCAA TGATTTTAAG GAAAGGTATC
CAGAAAGCAG TTGATGTTGT TGTAGAAGAA ATTAGAAAAA TGAGCAAGAA GGTAAGAGGA
AAAGAAGACA TCACATATGT TGCTTCAATC TCAGCAGGTG ACGAAGAGAT TGGCAAACTT
GTTGCAGATG CAATGGAGAA AGTAACAAAT GACGGTGTTA TCACTGTTGA AGAGTCAAAG
ACAACAGAGA CAACTCTTGA GATAGTTGAA GGTATGCAGT TTGACAGAGG TTACATCTCT
GCATACATGG TAACAGACAC AGAGAGAATG GAAGCGGTAC TTGACGACCC GTACATCTTG
ATTACAGATA AGAAAATCTC AACAATCCAA GACATTCTGC CGCTTCTTGA ACAGATAGTT
CAGCAGGGAA GAAAACTTTT GATAATTGCT GAAGATGTTG AAGGCGAAGC ATTGGCAACA
CTTGTAGTAA ACAAGCTCAG AGGAACACTC CAGTGCGTTG CGGTAAAAGC ACCAGGATTT
GGTGACAGAA GAAAAGCAAT GCTTCAAGAC ATTGCAATAT TAACTGGTGG TCAAGTAATT
TCTGAAGAGC TTGGTCTTGA CTTAAGAGAG GTAAAACTCA GCCAGCTTGG TCGTGCAAGA
CAAGTAAAAG TTCAGAAAGA AAATACAATT ATTGTTGACG GTGCAGGCGA CCCAAGCGAA
ATCAAGGCGA GAATTCAGTC AATCAAAAAG CAGATTGAAG AGACAACATC TGACTTTGAC
AGAGAAAAAC TTCAGGAAAG ACTTGCAAAA CTTGCTGGTG GTGTTGCAGT AATTCATGTT
GGTGCTGCAA CTGAGACTGA ACTTAAAGAA AAGAAACTCA GAATTGAAGA TGCTCTTGCT
GCAACAAAGG CTGCAGTAGA AGAAGGAATT GTACCTGGCG GTGGTACAGC TTTAATTAAT
GCAATTCCAG CCCTTGATAA GCTTATTGAA AGCCTCACTG GCGATGAAAA GACAGGTGCA
ATGATTGTAA GAAAAGCTTT GGAAGAGCCA CTCAGACAAA TTGCTGAAAA CGCAGGTTTA
GATGGTTCAG TTATTGTTAA CAAAGTAAAA GAAAGCCCAG CTGGTGTTGG ATTTGACGCA
CTCAACGAGA GATTTGTTGA CATGTTCGAG GCAGGTATTG TTGACCCAAC AAAGGTTACA
AGAACGGCTA TTCAGAACGC TGCATCGGCT GCTGCTATGC TTCTGACAAC AGAAGCAGTT
GTTGCTGAAA AACCTGAAAA GGAAAAGAAT CCACCAGCTC CAGCACCTGA TATGTATTAA
 
Protein sequence
MAAKMILFDE EARRALERGV NKLADTVKVT LGPKGRNVVL EKKFGSPQIV NDGVTIAKEI 
ELEDPFENMG AQIVREVASK TNDIAGDGTT TATVLAQAMI REGLKNIAAG ANPMILRKGI
QKAVDVVVEE IRKMSKKVRG KEDITYVASI SAGDEEIGKL VADAMEKVTN DGVITVEESK
TTETTLEIVE GMQFDRGYIS AYMVTDTERM EAVLDDPYIL ITDKKISTIQ DILPLLEQIV
QQGRKLLIIA EDVEGEALAT LVVNKLRGTL QCVAVKAPGF GDRRKAMLQD IAILTGGQVI
SEELGLDLRE VKLSQLGRAR QVKVQKENTI IVDGAGDPSE IKARIQSIKK QIEETTSDFD
REKLQERLAK LAGGVAVIHV GAATETELKE KKLRIEDALA ATKAAVEEGI VPGGGTALIN
AIPALDKLIE SLTGDEKTGA MIVRKALEEP LRQIAENAGL DGSVIVNKVK ESPAGVGFDA
LNERFVDMFE AGIVDPTKVT RTAIQNAASA AAMLLTTEAV VAEKPEKEKN PPAPAPDMY