Gene Clim_0498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0498 
SymbolgroEL 
ID6354845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp564527 
End bp566170 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content55% 
IMG OID642668131 
Productchaperonin GroEL 
Protein accessionYP_001942570 
Protein GI189346041 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000658221 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCTA AAGATATTAT TTTTGATTCC GACGCCAGAG CAAAGCTCAA AGTCGGTGTG 
GACAAACTGG CTAATGCCGT GAAAGTTACC CTCGGACCTG CCGGACGAAA TGTGCTTATC
GATAAAAAGT TCGGAGCTCC GACCTCGACC AAAGATGGCG TGACCGTAGC AAAAGAGATC
GAACTTGCCG ACGCCGTCGA GAACATGGGC GCACAGATGG TTCGCGAAGT CGCTTCGAAA
ACCAGTGATG TCGCCGGTGA CGGTACTACT ACCGCGACGG TGCTTGCTCA GGCAATCTAC
CGTGAGGGGC TGAAGAACGT TGCCGCAGGT GCACGTCCGA TCGATCTTAA AAGAGGTATC
GACCGTGCCG TGAAAGAGGT TGTCGCAGAA CTGCGCAACA TCAGCCGCAG CATCTCCGGC
AAAAAGGAGA TCGCACAGGT CGGCACCATT TCAGCCAACA ACGATCCTGA AATCGGCGAA
CTGATCGCAG AGGCCATGGA CAAGGTCGGC AAGGACGGCG TTATCACCGT CGAAGAGGCA
AAGGGCATGG ATACCGAGTT GAAGGTTGTC GAGGGCATGC AGTTCGATCG CGGCTACCTC
TCTCCGTATT TCGTGACCAA TCCCGAAACC ATGGAAGCCG AAATCGAAGA CCCGCTGATC
CTCATTCACG ACAAGAAGAT CGGCAACATG AAAGAACTGC TTCCGATCCT CGAAAAATCA
GCCCAGTCAG GTCGTCCTTT GCTCATCATT GCAGAGGATA TCGAAGGCGA AGCGCTGGCA
ACCCTTGTGG TCAACAAGCT CAGAGGTACT CTGAAAGTCT GTGCCGTGAA GGCTCCGGGA
TTCGGCGACC GTCGCAAGGC AATGCTCGAG GATATCGCCA TTCTTACCGG CGGTACCGTG
ATTTCCGAAG AGAAGGGCTA TAAACTCGAA AATGCAACGC TTGCCTACCT CGGTCAGGCA
GGTCGTGTCA ACATCGATAA GGATAACACC ACTATTGTCG AAGGCAAAGG CACACAGGAG
GACATCAAGG CCCGCATCAA CGAAATCAAA GGCCAGATCG ACAAATCGAC TTCCGACTAC
GATACCGAAA AGCTGCAGGA GCGTCTTGCA AAGCTTTCCG GCGGTGTCGC TGTGCTGAAC
ATCGGCGCAT CAACCGAAGT TGAAATGAAG GAAAAGAAAG CCCGCGTCGA AGATGCACTG
CATGCCACCC GCGCTGCAGT CCAGGAAGGC ATCGTGGTCG GCGGCGGTGT TGCTCTGATC
CGTGCGATCA AAGGTCTCGC CAATGCGGTT GCAGACAATG AAGACCAGAA AACCGGTATC
GAGATCATCC GCCGCGCGCT CGAAGAGCCG CTCCGTCAGA TTGTTGCCAA TACCGGCACC
ACCGACGGCG CTGTCGTGCT TGAGAAAGTC AAGGCAGCTG AAGGCGACTT CGGCTTCAAC
GCACGTACCG AGCAGTACGA GAACCTTGTT GAGGCAGGTG TTGTCGATCC GACCAAAGTA
ACCAGAAGCG CTCTTGAAAA TGCCGCTTCG GTTGCCAGCA TCCTTCTTAC CACCGAAGCA
GCCATTACCG ACATCAAGGA AGAGAAATCT GACATGCCGG CAATGCCTCC GGGCGGCATG
GGCGGCATGG GCGGCATGTA CTGA
 
Protein sequence
MTAKDIIFDS DARAKLKVGV DKLANAVKVT LGPAGRNVLI DKKFGAPTST KDGVTVAKEI 
ELADAVENMG AQMVREVASK TSDVAGDGTT TATVLAQAIY REGLKNVAAG ARPIDLKRGI
DRAVKEVVAE LRNISRSISG KKEIAQVGTI SANNDPEIGE LIAEAMDKVG KDGVITVEEA
KGMDTELKVV EGMQFDRGYL SPYFVTNPET MEAEIEDPLI LIHDKKIGNM KELLPILEKS
AQSGRPLLII AEDIEGEALA TLVVNKLRGT LKVCAVKAPG FGDRRKAMLE DIAILTGGTV
ISEEKGYKLE NATLAYLGQA GRVNIDKDNT TIVEGKGTQE DIKARINEIK GQIDKSTSDY
DTEKLQERLA KLSGGVAVLN IGASTEVEMK EKKARVEDAL HATRAAVQEG IVVGGGVALI
RAIKGLANAV ADNEDQKTGI EIIRRALEEP LRQIVANTGT TDGAVVLEKV KAAEGDFGFN
ARTEQYENLV EAGVVDPTKV TRSALENAAS VASILLTTEA AITDIKEEKS DMPAMPPGGM
GGMGGMY