Gene Acel_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0101 
SymbolgroEL 
ID4484544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp104887 
End bp106506 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content66% 
IMG OID639728864 
Productchaperonin GroEL 
Protein accessionYP_871863 
Protein GI117927312 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA TGATCGCGTT TGACGAGGCG GCTCGTCGCG CTCTCGAGCG CGGCATGAAC 
CAGCTCGCTG ACGCTGTCAA GGTGACGCTC GGCCCGAAGG GTCGCAACGT CGTCCTGGAG
AAGAAATGGG GCGCACCCAC GATCACGAAT GACGGGGTGA GCATCGCCAA GGAGATCGAG
CTCGAGGACC CCTACGAGAA GATCGGGGCT GAGCTCGTCA AGGAAGTTGC CAAGAAGACC
GACGACGTCG CCGGTGACGG GACGACGACG GCAACTGTTC TCGCTCAGAC CCTGGTACGC
GAGGGTCTGC GGAACGTGGC CGCCGGCGCG AACCCGATGG CGCTCAAGCG CGGAATCGAA
GCGGCGACCG AGCGGGTCTG CCAGGCGCTG CTCGAGATAG CCAAGGACGT CGAGACCCGC
GAGCAGATCG CGTCGACTGC GTCGATCTCC GCCGGTGACA CTGCCGTCGG CGAGATGATT
GCCGAGGCGA TGGACAAGGT CGGCAAGGAA GGCGTCATCA CCGTTGAGGA GTCCAACACC
TTCGGTCTGG AGCTTGAGCT CACCGAGGGA ATGCGCTTCG ACAAGGGGTA CATCTCGCCG
TACTTCGTGA CCGACAGCGA GCGGATGGAA GCGGTCCTCG AAGACCCGTA CATTCTGATC
GCGAACCAGA AGATCTCGGC GGTCAAGGAC CTGCTGCCCG TCCTGGAGAA GGTCATGCAG
GCCGGGAAAC CGCTCGCGAT CATTGCCGAG GACGTCGAGG GCGAGGCGCT GGCGACGCTC
GTCGTCAACA AGATCCGCGG TACCTTCCGC TCGGTTGCCG TGAAGGCGCC CGGCTTCGGG
GATCGGCGGA AGGCGATGCT CGGCGACATC GCGGTACTCA CCGGCGGTCA GGTGATCAGC
GAAGAGGTCG GTCTGAAGCT GGAGAATGTC GGCCTCGACC TGCTCGGCCG GGCGCGCAAG
GTCGTCGTCA CCAAGGATGA GACGACGATC GTCGAGGGCG CTGGTGATCC GGAGCAGATC
GCGGGCCGGG TCAACCAGAT TCGCGCCGAG ATCGAGAAGA CCGACTCCGA CTACGACCGG
GAGAAGCTGC AGGAGCGGCT CGCGAAGCTC GCGGGCGGCG TCGCGGTGAT CAAGGTTGGT
GCCGCCACCG AGGTCGAGCT CAAGGAGCGC AAGCACCGCA TCGAGGACGC CGTCCGGAAT
GCCAAGGCCG CGGTGGAGGA GGGCATCGTC GCTGGCGGCG GTGTGGCTCT GCTCCAGGCG
GGCAAGACGG CCTTCGAGAA GCTCGACCTT GAGGGTGACG AGGCAACCGG CGCCCGGATC
GTTGAGCTGG CGCTCGAAGC CCCGCTGAAG CAGATCGCGA TCAACGCGGG TCTCGAGGGC
GGGGTCGTCG TCGAGAAGGT GCGGAGCCTT GAGCCTGGCT GGGGTCTCAA CGCCCAGACC
GGTGAGTACG TCGACATGAT CAAGGCGGGC ATCATCGACC CGGCCAAGGT CACCCGGTCG
GCGCTGCAGA ACGCTGCGTC CATCGCCGGA CTCTTCCTCA CCACCGAGGC GGTGGTGGCG
GAGAAGCCGG AGAAGGAGAA GACACCCGCA GCACCCGGCG GCGGCGACAT GGACTTCTGA
 
Protein sequence
MAKMIAFDEA ARRALERGMN QLADAVKVTL GPKGRNVVLE KKWGAPTITN DGVSIAKEIE 
LEDPYEKIGA ELVKEVAKKT DDVAGDGTTT ATVLAQTLVR EGLRNVAAGA NPMALKRGIE
AATERVCQAL LEIAKDVETR EQIASTASIS AGDTAVGEMI AEAMDKVGKE GVITVEESNT
FGLELELTEG MRFDKGYISP YFVTDSERME AVLEDPYILI ANQKISAVKD LLPVLEKVMQ
AGKPLAIIAE DVEGEALATL VVNKIRGTFR SVAVKAPGFG DRRKAMLGDI AVLTGGQVIS
EEVGLKLENV GLDLLGRARK VVVTKDETTI VEGAGDPEQI AGRVNQIRAE IEKTDSDYDR
EKLQERLAKL AGGVAVIKVG AATEVELKER KHRIEDAVRN AKAAVEEGIV AGGGVALLQA
GKTAFEKLDL EGDEATGARI VELALEAPLK QIAINAGLEG GVVVEKVRSL EPGWGLNAQT
GEYVDMIKAG IIDPAKVTRS ALQNAASIAG LFLTTEAVVA EKPEKEKTPA APGGGDMDF