Gene Dgeo_2231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2231 
SymbolgroEL 
ID4056906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2350311 
End bp2351948 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content64% 
IMG OID641231274 
Productchaperonin GroEL 
Protein accessionYP_605694 
Protein GI94986330 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAC AGCTCGTCTT TGATGAACAC GCCCGCCGCA GCCTGGAGCG CGGTGTCAAC 
GCCGTTGCCA ATGCCGTCAA GGTCACCCTG GGGCCGCGTG GCCGCAACGT GGTGATCGAG
AAGAAGTTCG GCAGCCCCAC CATCACCAAG GACGGCGTGA CCGTCGCCAA GGAAGTGGAG
CTGGAAGACA AGCTCGAGAA CATCGGCGCA CAGCTGCTCA AGGAAATCGC CTCCAAGACC
AATGACATCA CGGGTGACGG CACCACCACG GCGACCGTGC TGGGCCAAGC CATCGTGAAG
GAAGGGCTGC GCAACGTGGC TGCGGGCGCC AACCCGCTGG CCCTGAAGCG CGGCATCGAA
AAGGCCGTGG CTGCTGCCAC CGAGGAGATC AAGAAGCTGG CCGTCCCGGT TGAGGACAGC
AACGCGATCA AAAAGGTTGC GGGCATCAGC GCCAACGACG CGCAGGTCGG CGAGGAAATC
GCCAACGCGA TGGACAAGGT GGGCAAGGAA GGCGTGATCA CCATCGAAGA GTCGAAGAGC
TTCGACACTG AGGTGGACGT CGTGGAAGGG ATGCAGTTTG ACAAGGGCTA CATCAGCCCC
TACTTCATCA CCAACCCCGA CAAGATGGAG GCCGTCCTTG AAGACGCCTA CATCCTGATC
AACGAGAAGA AGATCAGCGC CCTCAAGGAT CTCCTGCCCG TGCTGGAAAA GGTCGCGCAA
ACCAGCCGTC CTCTCCTGAT CATTGCGGAA GACGTGGAAG GCGAGGCGCT CGCGACGCTG
ATCGTAAACA AGCTGCGCGG CACGCTGAAC ATCGCCGCCG TGAAGGCTCC CGGCTTCGGT
GACCGCCGCA AGGAAATGCT GCGCGACATC GCCGCCGTGA CGGGTGGTCA GGTGGTCAGC
GAGGATCTGG GCCACCGCCT GGAAAACGTC ACGCTCGACA TGCTGGGCCG CGCCAAGCGC
ATCCGCATCA CCAAGGATGA GACGACCATC ATCGACGGCA TGGGCAACCA GGCCGAGATC
GATGCCCGCG TCAACGCCAT CAAGGCCGAA CTGGAAACCA CCGACAGCGA CTACGCCCGC
GAGAAGCTCC AGGAGCGCCT CGCCAAGTTG GCGGGCGGCG TGGCCGTGAT CCGCGTGGGG
GCCGCAACCG AGACCGAACT CAAGGAGAAG AAGCACCGCT ACGAGGACGC CCTCTCCACC
GCCCGCTCGG CGGTTGAGGA AGGCATCGTC GCGGGTGGTG GGACCACGCT GCTGCGCGTG
ATTCCTGCCG TCAAGCAGCT GGCCGAGAGC CTGGAAGGTG ACGAGGCGAC CGGTGCGCGT
ATCCTGGTCC GCGCCCTGGA AGAACCCGCC CGCCAGATCG CGGCGAACGC GGGCGACGAG
GGCAGCGTGA TTGTGAACGC CGTTCTGAAC AGTGACAAGC CGCGCTACGG CTACAACGCT
GCGACCGGCG AGTTTGTGGA TGACATGGTC GCCGCCGGGA TCGTTGACCC CGCCAAGGTG
ACCCGGACCG CCCTCCAGAA CGCCGCTTCC ATCGGCGGGC TGATCCTGAC CACCGAAGCC
ATCGTCAGCG ACAAGCCCGA GAAGGAGAAG GCACCTGCGG CCGCTGGCGC CCCCGATATG
GGCGGGATGG ACTTCTAA
 
Protein sequence
MAKQLVFDEH ARRSLERGVN AVANAVKVTL GPRGRNVVIE KKFGSPTITK DGVTVAKEVE 
LEDKLENIGA QLLKEIASKT NDITGDGTTT ATVLGQAIVK EGLRNVAAGA NPLALKRGIE
KAVAAATEEI KKLAVPVEDS NAIKKVAGIS ANDAQVGEEI ANAMDKVGKE GVITIEESKS
FDTEVDVVEG MQFDKGYISP YFITNPDKME AVLEDAYILI NEKKISALKD LLPVLEKVAQ
TSRPLLIIAE DVEGEALATL IVNKLRGTLN IAAVKAPGFG DRRKEMLRDI AAVTGGQVVS
EDLGHRLENV TLDMLGRAKR IRITKDETTI IDGMGNQAEI DARVNAIKAE LETTDSDYAR
EKLQERLAKL AGGVAVIRVG AATETELKEK KHRYEDALST ARSAVEEGIV AGGGTTLLRV
IPAVKQLAES LEGDEATGAR ILVRALEEPA RQIAANAGDE GSVIVNAVLN SDKPRYGYNA
ATGEFVDDMV AAGIVDPAKV TRTALQNAAS IGGLILTTEA IVSDKPEKEK APAAAGAPDM
GGMDF