Gene Sare_4231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4231 
Symbol 
ID5704402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4803939 
End bp4805576 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content70% 
IMG OID641273650 
Productchaperonin GroEL 
Protein accessionYP_001539003 
Protein GI159039750 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.732523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.119821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA TCCTGAGCTT CTCGGACGAC GCTCGGCACC AGCTGGAGCA CGGTGTCAAC 
GCCCTCGCGG ACGCGGTCAA GGTCACCCTC GGCCCCCGCG GGCGCAACGT CGTCCTAGAC
AAGAAGTTTG GTGCACCCAC GATCACCAAC GACGGCGTGA CGATCGCCAA GGAGATCGAG
CTCACCAACC CATACGAGAA TCTCGGCGCG CAGTTGGTCA AGGAGGTGGC GACCAAGACC
AACGACGTCG CCGGCGACGG GACCACCACC GCCACCGTGC TGGCCCAGGC GCTGGTCCGG
GAGGGCCTGC GCAACGTGGC GGCCGGCGCC AACCCGACCG GCCTCAAGCG GGGCATCGAC
GCGGCGGCGA CCAAGGTCTC CGAGGAACTG CTCGGCAAGG CCGTTGACGT GTCCGACAAG
GCGGCGATCG CCCACGTCGC GACCGTCTCC GCGCAGGACT CCACGATCGG CGAGCTCATC
GCCGAGGCGA TGGAGCGGGT CGGCCGCGAC GGTGTCATCA CCGTCGAGGA GGGCTCCACC
CTCGCCACCG AACTGGACGT GACCGAGGGT CTCCAGTTCG ACAAGGGCTT CATCTCCCCC
AACTTCGTCA CCGACGCGGA GGGGCAGGAG TCGGTCCTGG AGGACGCGTA CATCCTGATC
ACCACGCAGA AGATCTCGGC GATCGAGGAG CTGCTGCCGC TGCTGGAGAA GGTCCTCCAG
GAGAGCAAGC CGCTGCTCAT CATCGCCGAG GACGTCGAGG GGCAGGCGCT GTCCACCCTG
GTGGTCAACG CGCTCCGCAA GACCATGAAG GTCTGCGCGG TGAAGGCTCC CGGCTTCGGC
GACCGCCGCA AGGCGATGCT GCAGGACATG GCGATCCTGA CCGGTGCCGA GTTGGTCGCC
CCCGAGCTGG GCTACAAGCT CGACCAGGTT GGGCTCGAGG TGCTCGGCAC CGCCCGCCGC
GTGGTGGTCG ACAAGGAGAA CACCACCATC GTCGACGGCG GCGGCCAGGC ATCCGACGCC
GAGGACCGGG TCGCCCAGAT CCGCAAGGAG ATCGAGGCTT CGGACTCCGA GTGGGACCGG
GAGAAGCTCG CCGAGCGGCT GGCCAAGCTC TCCGGCGGCA TCGCCGTGAT CCGGGCGGGC
GCGGCGACCG AGGTCGAGAT GAAGGAGCGT AAGCACCGCA TCGAGGACGC CATCGCCGCC
ACCAAGGCCG CGGTCGAGGA GGGTACGGTG CCCGGCGGCG GTGCCGCCCT GGCCCAGGTC
CTGCCGGCGC TCGACGGCGA CCTCGGCCTC ACCGGGGACG AGCAGGTCGG TGTCTCGATC
GTGCGTAAGG CGCTGATCGA GCCGCTGCGC TGGATCGCGC AGAACGCCGG CCACGACGGC
TACGTGGTGG TGCAGAAGGT CGCCGGCAAG GACTGGGGCC ACGGTCTGGA CGCGGCCACG
GGCGAGTACG TCGACCTGGC CAAGGCCGGC ATCCTCGACC CGGTGAAGGT GACCCGCAAC
GCGGTCGCCA ACGCCGCGTC GATCGCGGGC CTGCTGCTCA CCACCGAGAG CCTCGTGGTG
GAGAAGCCGC AGGAGCCGGA GCCGGCCGCG GCTGGCCACG GCCACGGCCA CGGTCATCAG
CACGGCCCGG GCTTCTGA
 
Protein sequence
MAKILSFSDD ARHQLEHGVN ALADAVKVTL GPRGRNVVLD KKFGAPTITN DGVTIAKEIE 
LTNPYENLGA QLVKEVATKT NDVAGDGTTT ATVLAQALVR EGLRNVAAGA NPTGLKRGID
AAATKVSEEL LGKAVDVSDK AAIAHVATVS AQDSTIGELI AEAMERVGRD GVITVEEGST
LATELDVTEG LQFDKGFISP NFVTDAEGQE SVLEDAYILI TTQKISAIEE LLPLLEKVLQ
ESKPLLIIAE DVEGQALSTL VVNALRKTMK VCAVKAPGFG DRRKAMLQDM AILTGAELVA
PELGYKLDQV GLEVLGTARR VVVDKENTTI VDGGGQASDA EDRVAQIRKE IEASDSEWDR
EKLAERLAKL SGGIAVIRAG AATEVEMKER KHRIEDAIAA TKAAVEEGTV PGGGAALAQV
LPALDGDLGL TGDEQVGVSI VRKALIEPLR WIAQNAGHDG YVVVQKVAGK DWGHGLDAAT
GEYVDLAKAG ILDPVKVTRN AVANAASIAG LLLTTESLVV EKPQEPEPAA AGHGHGHGHQ
HGPGF