Gene Caci_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1008 
SymbolgroEL 
ID8332342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1146877 
End bp1148499 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content70% 
IMG OID644954157 
Productchaperonin GroEL 
Protein accessionYP_003111777 
Protein GI256390213 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.208595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA TGCTCGAATT CGACGAGGAC GCGCGCCGCA AGCTCGAGCG CGGCGTCAAC 
GCGCTGGCCG ACGCGGTCAA GGTGACCATC GGTCCCAAGG GCCGCAACGT CGTCATCGAC
AAGAAGTTCG GCGCCCCGAC GATCACCAAC GACGGCGTGA CCATCGCCCG CGAGGTGGAG
CTCGAGGACC CCTACGAGAA CCTCGGCGCG CAGCTGGCCA AGGAAGTGGC CACCAAGACC
AACGACATCG CCGGCGACGG CACCACCACC GCGACCGTGC TGGCCCAGGC CATGGTCCGC
GAGGGTCTGC GCAACGTGGC GGCCGGCGCG CAGCCGATCG CGCTCAAGCG CGGTATCGAC
GCGGCGGTCA AGGCCGTGGC CGACCAGCTG CTGTCCACCG CCAAGCAGGT GGAGAGCAAG
GAGTCGATCG CCCAGGTCGG CGCCATCTCC GCGCAGGACA AGGCGATCGG CGACCTGATC
GCCGAGGCCA TGGACAAGGT CGGCAAGGAC GGTGTCATCA CCGTCGAAGA GTCGAACACC
ATGGGCCTGG AGCTCGAGTT CACCGAGGGC ATGCAGTTCG ACAAGGGTTA CCTGTCGCCC
TACATGGTGA CCGACCAGGA GCGCATGGAG GCGGTCCTGG AGGACCCCTA CATCCTGATC
AACCAGGGCA AGATCAGCTC CCTGAACGAG CTGCTGCCGC TGCTGGAGAA GGTCGCGCAG
TCGCGCAAGC CGCTGCTGAT CATCGCCGAG GACGTCGACG GCGAGGCGCT GTCCACGCTG
GTCGTGAACA AGATCCGCGG CACCTTCACC TCCGTCGCGG TCAAGGCTCC GGCCTTCGGC
GACCGCCGCA AGGCGATCCT GGAGGACCTG GCGATCCTCA CCGGCGCGCA GGTCGTGGCG
CCCGAGGTCG GCCTGAAGCT GGACCAGGTC GGCGTCGAGG TGCTGGGCAC CGCCCGCCGC
GTGACCGTCA CCAAGGACGA CACCACCGTC GTCGACGGTG CCGGCGACAC CGCGGCCGTG
GCCGACCGCG TGAAGCAGAT CAAGGCCGCG ATCGACACCA CCGACTCGGA CTGGGACCGC
GAGAAGCTGC AGGAGCGCCT GGCCAAGCTG GCCGGCGGCG TCTGCGTCAT CCGCGTCGGC
GCGGCCACCG AGGTCGAGCT CAAGGAGAAG AAGCACCGTC TGGAGGACGC CATCTCCGCG
ACCCGCGCCG CGGTCGAGGA GGGCATCGTC TCCGGCGGCG GCTCCGCGCT GGTGCACGCC
GTCTCCGTGC TGGACGGCGA CCTGGGCCTG ACGGGCGACG AGGCCACCGG CGTGCGCGTC
GTGCGCAAGG CCGCGGTCGA GCCGCTGCGC TGGATCGCCG AGAACGCGGG CCTGGAGGGC
TACGTCGTGA CCGACAAGGT CTCGAACCTG CCCGTGGGCT CCGGCCTGAA CGCGGCCACC
GGCGAGTACG TCGACCTGGT CGCGGCCGGC GTCATCGACC CGGTCAAGGT CACCCGCTCC
GCGCTGGCCA ACGCCGCCTC CATCGCCTCG ATGCTGCTCA CGACCGAGAC CCTGGTCGTC
GAGAAGCCGG CGCCGAAGGA AGACGAGGGC CACTCGCACG GTGGCCACGG GCACTCCCAC
TAA
 
Protein sequence
MAKMLEFDED ARRKLERGVN ALADAVKVTI GPKGRNVVID KKFGAPTITN DGVTIAREVE 
LEDPYENLGA QLAKEVATKT NDIAGDGTTT ATVLAQAMVR EGLRNVAAGA QPIALKRGID
AAVKAVADQL LSTAKQVESK ESIAQVGAIS AQDKAIGDLI AEAMDKVGKD GVITVEESNT
MGLELEFTEG MQFDKGYLSP YMVTDQERME AVLEDPYILI NQGKISSLNE LLPLLEKVAQ
SRKPLLIIAE DVDGEALSTL VVNKIRGTFT SVAVKAPAFG DRRKAILEDL AILTGAQVVA
PEVGLKLDQV GVEVLGTARR VTVTKDDTTV VDGAGDTAAV ADRVKQIKAA IDTTDSDWDR
EKLQERLAKL AGGVCVIRVG AATEVELKEK KHRLEDAISA TRAAVEEGIV SGGGSALVHA
VSVLDGDLGL TGDEATGVRV VRKAAVEPLR WIAENAGLEG YVVTDKVSNL PVGSGLNAAT
GEYVDLVAAG VIDPVKVTRS ALANAASIAS MLLTTETLVV EKPAPKEDEG HSHGGHGHSH