Gene Dret_2176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2176 
SymbolgroEL 
ID8420027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2473141 
End bp2474790 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content59% 
IMG OID645038770 
Productchaperonin GroEL 
Protein accessionYP_003199038 
Protein GI258406296 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.901834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCGA AGACCATCAA ATTTGGCGTA AAAGCCCGCG AACAGCTGCA ACAGGGTGTG 
GACCAACTGG CTCAGGCCGT AAAGGTCACC CTTGGCCCCA AAGGCCGTAA TGTCGTTATT
GAAAAGTCCT TCGGTTCCCC CACGATCACC AAGGACGGTG TCACCGTTGC CCGCGAGATC
GAACTCGAGG ACAAGTTTGA AAATATGGGC GCTCAGATGG TCAAGGAAGT GGCCAGCAAG
ACCAGCGACG TTGCTGGTGA CGGCACCACC ACCGCCACAA TCCTGGCCCA GAAAATTTTC
AGCGAAGGCT TGAAGCTTGT TGCCGCCGGC CGGAACCCCA TGGCCATCAA GCGCGGCATC
GACAAGGCCG TTGAAGCCAT CAACAAAGAA TTGGCCGACT TCGCCAAGCC GACCCGCGAC
CAGAAAGAGA TCGCCCAGAT CGGCACTATC TCCGCCAACA ACGATCCGAC CATCGGCAAC
ATCATTGCCG AGGCCATGAA CAAGGTTGGC AAGGAAGGCG TTATCACCGT GGAAGAGGCC
AAGGGCCTGG ACACCACCCT GGACGTGGTC GAAGGCATGC AGTTCGACCG CGGCTACCTC
TCCCCCTATT TCGTGACCGA CTCCGAAAAA ATGGTTGCCG AGTTGGAAGA TCCGCTCATC
CTCATCAATG AGAAGAAGAT CTCCAACATG AAAGACCTCC TGCCCGTGCT GGAGCAGGTG
GCCAAAATGA ACAAGCCGCT GATGATCATC GCCGAGGAAA TCGAAGGCGA AGCCCTGGCC
ACCCTCGTGG TCAACAAGCT GCGCGGCACC CTGCAGGTCG CTGCGGTCAA GGCCCCCGGC
TTTGGCGAAC GCCGTAAGGC CATGCTCCAG GACATCGCCG TTTTGACCGG CGGCAGCGTC
ATTTCCGAAG ATGTGGGCAC CAAGCTTGAA AATGCCACGG TCAACGACCT CGGCAGCGCC
AAGCGCATCA ACATCGACAA AGAAAACACC ACCATCGTGG ACGGCGCTGG CTCCTCCGAC
GACATCAAGG CCCGCATCAA GCAGATCCGC GCTGAGATCG ACGAAACCAC CTCCGATTAC
GATCGCGAAA AGCTCCAGGA GCGTTTGGCC AAGATCGTCG GCGGTGTGGC CGTGATCAAT
GTCGGCGCTG CGACCGAAAC CGAAATGAAA GAAAAGAAGG CCCGCGTCGA AGACGCCCTG
AACGCTACCC GCGCTGCCGT TGAAGAAGGC ATCGTGCCTG GTGGCGGCGT GGCCTTCATC
CGCACCCAGC ATGCCGCTAA TTCCGTCAAA CCGGCCGACG AAGACGAAAA GGCCGGTGTC
GATGTGGTCC GCGCCGCTGT GGTCGAACCC CTGCGTCAGA TTTGCGCCAA TGCTGGCTTC
GAAGGCGCCT TAATCGTGGA AAAAGTCCGC GAGCACAAGG ACGGCTACGG CTTTAACGCC
GCCACTGGCG AATTCGAAGA CCTGCTCAAG GCCGGTGTCA TTGATCCTAA AAAGGTCTCC
CGCACCGCCC TGCAGAACGC CGCTTCCGTC GCCTCCCTCT TGCTGACCAC GGAAGCTGCC
ATTGCCGACA AACCTGAAGA CAAGGACAGC GGTGGCGCTC CTGCCGGTGG CGGTATGCCC
GGCATGGGCG GCATGGGCGG CATGTACTAA
 
Protein sequence
MAAKTIKFGV KAREQLQQGV DQLAQAVKVT LGPKGRNVVI EKSFGSPTIT KDGVTVAREI 
ELEDKFENMG AQMVKEVASK TSDVAGDGTT TATILAQKIF SEGLKLVAAG RNPMAIKRGI
DKAVEAINKE LADFAKPTRD QKEIAQIGTI SANNDPTIGN IIAEAMNKVG KEGVITVEEA
KGLDTTLDVV EGMQFDRGYL SPYFVTDSEK MVAELEDPLI LINEKKISNM KDLLPVLEQV
AKMNKPLMII AEEIEGEALA TLVVNKLRGT LQVAAVKAPG FGERRKAMLQ DIAVLTGGSV
ISEDVGTKLE NATVNDLGSA KRINIDKENT TIVDGAGSSD DIKARIKQIR AEIDETTSDY
DREKLQERLA KIVGGVAVIN VGAATETEMK EKKARVEDAL NATRAAVEEG IVPGGGVAFI
RTQHAANSVK PADEDEKAGV DVVRAAVVEP LRQICANAGF EGALIVEKVR EHKDGYGFNA
ATGEFEDLLK AGVIDPKKVS RTALQNAASV ASLLLTTEAA IADKPEDKDS GGAPAGGGMP
GMGGMGGMY