Gene Sare_0311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0311 
SymbolgroEL 
ID5703979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp345675 
End bp347297 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content67% 
IMG OID641269837 
Productchaperonin GroEL 
Protein accessionYP_001535232 
Protein GI159035979 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000447633 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCAAGA TGATCGCGTT CGACGAGGAG GCTCGCCGCG GCCTCGAGCG GGGCATGAAC 
CAGCTCGCTG ACGCCGTGAA GGTGACCCTC GGCCCCAAGG GCCGCAACGT CGTGCTCGAG
AAGAAGTGGG GCGCCCCCAC CATCACCAAC GATGGTGTGA GCATCGCCAA GGAGATCGAG
CTCGAGGACC CGTACGAGAA GATCGGCGCC GAGCTGGTCA AGGAGGTCGC GAAGAAGACC
GACGACGTCG CCGGTGACGG CACGACGACG GCGACCGTCC TGGCCCAGGC CATGGTCCGC
GAGGGCCTGC GTAACGTGGC CGCCGGCGCC AACCCGATGG CCCTGAAGCG GGGCATCGAG
ACCGCGGTCG CCAGCGTCTC GGAGGAGCTG CTGAAGCTCG CCAAGGACGT CGAGACCAAG
GAGCAGATCG CCTCCACCGC CTCCATCTCC GCTGGTGACC CCAGCGTCGG CGAGATCATC
GCCGAGGCGA TGGACAAGGT CGGCAAGGAA GGCGTCATCA CCGTCGAGGA GAGCAACACC
TTCGGCCTGG AGCTCGAGCT CACCGAGGGT ATGCGCTTCG ACAAGGGCTA CATCTCCGCC
TACTTCATGA CCGATCCGGA GCGGATGGAG GCCGTCTTCG ACGACCCGTA CATCCTGATC
GTCAACAGCA AGATCTCCTC GGTCAAGGAC CTGCTCCCGA TCCTGGAGAA GGTCATGCAG
TCGGGTAAGC CGCTGCTGAT CATCGCCGAG GACATCGAGG GTGAGGCCCT GGCGACCCTG
GTCGTCAACA AGGTCCGTGG CACCTTCAAG TCCGTCGCCG TCAAGGCTCC GGGCTTCGGT
GACCGCCGCA AGGCCATGCT CGGTGACATT GCCATCCTCA CCGGTGGCCA GGTCATCAGC
GAGGAGGTCG GCCTCAAGCT CGACGCCGTC AACCTCGACA TGGTGGGCCG GGCCCGCAAG
GTCGTGGTGA CCAAGGACGA GACCACCATC GTCGATGGCG CCGGCGACGC CGAGCAGATC
CAGGGTCGGG TCAACCAGAT CCGGGCCGAG ATCGACAAGA GCGACTCCGA CTACGACCGG
GAGAAGCTGC AGGAGCGGCT GGCCAAGCTG GCCGGCGGCG TTGCGGTGAT CAAGGTCGGT
GCGGCCACCG AGGTCGAGCT GAAGGAGCGC AAGCACCGCA TCGAGGACGC CGTTCGCAAC
GCGAAGGCCG CCGTCGAGGA GGGCATCGTC CCGGGTGGTG GCGTCGCGCT GGTGCAGGCC
GGCAAGACCG CCTTCGACAA GCTCGACCTG ACCGGTGACG AGGCGACCGG CGCCCAGATC
GTCAAGATCG CGCTGGACGG CCCGCTGCGG CAGATCGCCG TCAACGCCGG CCTCGAGGGT
GGCGTCGTCG TCGAGCACGT GCGTGGCATC GAGGCGGGTC ATGGCCTGAA CGCCGCGACC
GGTGGGTACG TGGACCTGAT GGCCGCGGGC ATCATCGACC CGGCCAAGGT GACCCGGTCG
GCGCTGCAGA ACGCGTCGTC GATCGCGGCG CTGTTCCTCA CCACCGAGGC CGTCGTGGCG
GACAAGCCGG AGAAGACCCC GGCCGCCCCG GCTGCTCCGG GCGGCGGGGA AATGGACTTC
TGA
 
Protein sequence
MAKMIAFDEE ARRGLERGMN QLADAVKVTL GPKGRNVVLE KKWGAPTITN DGVSIAKEIE 
LEDPYEKIGA ELVKEVAKKT DDVAGDGTTT ATVLAQAMVR EGLRNVAAGA NPMALKRGIE
TAVASVSEEL LKLAKDVETK EQIASTASIS AGDPSVGEII AEAMDKVGKE GVITVEESNT
FGLELELTEG MRFDKGYISA YFMTDPERME AVFDDPYILI VNSKISSVKD LLPILEKVMQ
SGKPLLIIAE DIEGEALATL VVNKVRGTFK SVAVKAPGFG DRRKAMLGDI AILTGGQVIS
EEVGLKLDAV NLDMVGRARK VVVTKDETTI VDGAGDAEQI QGRVNQIRAE IDKSDSDYDR
EKLQERLAKL AGGVAVIKVG AATEVELKER KHRIEDAVRN AKAAVEEGIV PGGGVALVQA
GKTAFDKLDL TGDEATGAQI VKIALDGPLR QIAVNAGLEG GVVVEHVRGI EAGHGLNAAT
GGYVDLMAAG IIDPAKVTRS ALQNASSIAA LFLTTEAVVA DKPEKTPAAP AAPGGGEMDF