Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4231 |
Symbol | |
ID | 5704402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4803939 |
End bp | 4805576 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641273650 |
Product | chaperonin GroEL |
Protein accession | YP_001539003 |
Protein GI | 159039750 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.732523 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.119821 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAGA TCCTGAGCTT CTCGGACGAC GCTCGGCACC AGCTGGAGCA CGGTGTCAAC GCCCTCGCGG ACGCGGTCAA GGTCACCCTC GGCCCCCGCG GGCGCAACGT CGTCCTAGAC AAGAAGTTTG GTGCACCCAC GATCACCAAC GACGGCGTGA CGATCGCCAA GGAGATCGAG CTCACCAACC CATACGAGAA TCTCGGCGCG CAGTTGGTCA AGGAGGTGGC GACCAAGACC AACGACGTCG CCGGCGACGG GACCACCACC GCCACCGTGC TGGCCCAGGC GCTGGTCCGG GAGGGCCTGC GCAACGTGGC GGCCGGCGCC AACCCGACCG GCCTCAAGCG GGGCATCGAC GCGGCGGCGA CCAAGGTCTC CGAGGAACTG CTCGGCAAGG CCGTTGACGT GTCCGACAAG GCGGCGATCG CCCACGTCGC GACCGTCTCC GCGCAGGACT CCACGATCGG CGAGCTCATC GCCGAGGCGA TGGAGCGGGT CGGCCGCGAC GGTGTCATCA CCGTCGAGGA GGGCTCCACC CTCGCCACCG AACTGGACGT GACCGAGGGT CTCCAGTTCG ACAAGGGCTT CATCTCCCCC AACTTCGTCA CCGACGCGGA GGGGCAGGAG TCGGTCCTGG AGGACGCGTA CATCCTGATC ACCACGCAGA AGATCTCGGC GATCGAGGAG CTGCTGCCGC TGCTGGAGAA GGTCCTCCAG GAGAGCAAGC CGCTGCTCAT CATCGCCGAG GACGTCGAGG GGCAGGCGCT GTCCACCCTG GTGGTCAACG CGCTCCGCAA GACCATGAAG GTCTGCGCGG TGAAGGCTCC CGGCTTCGGC GACCGCCGCA AGGCGATGCT GCAGGACATG GCGATCCTGA CCGGTGCCGA GTTGGTCGCC CCCGAGCTGG GCTACAAGCT CGACCAGGTT GGGCTCGAGG TGCTCGGCAC CGCCCGCCGC GTGGTGGTCG ACAAGGAGAA CACCACCATC GTCGACGGCG GCGGCCAGGC ATCCGACGCC GAGGACCGGG TCGCCCAGAT CCGCAAGGAG ATCGAGGCTT CGGACTCCGA GTGGGACCGG GAGAAGCTCG CCGAGCGGCT GGCCAAGCTC TCCGGCGGCA TCGCCGTGAT CCGGGCGGGC GCGGCGACCG AGGTCGAGAT GAAGGAGCGT AAGCACCGCA TCGAGGACGC CATCGCCGCC ACCAAGGCCG CGGTCGAGGA GGGTACGGTG CCCGGCGGCG GTGCCGCCCT GGCCCAGGTC CTGCCGGCGC TCGACGGCGA CCTCGGCCTC ACCGGGGACG AGCAGGTCGG TGTCTCGATC GTGCGTAAGG CGCTGATCGA GCCGCTGCGC TGGATCGCGC AGAACGCCGG CCACGACGGC TACGTGGTGG TGCAGAAGGT CGCCGGCAAG GACTGGGGCC ACGGTCTGGA CGCGGCCACG GGCGAGTACG TCGACCTGGC CAAGGCCGGC ATCCTCGACC CGGTGAAGGT GACCCGCAAC GCGGTCGCCA ACGCCGCGTC GATCGCGGGC CTGCTGCTCA CCACCGAGAG CCTCGTGGTG GAGAAGCCGC AGGAGCCGGA GCCGGCCGCG GCTGGCCACG GCCACGGCCA CGGTCATCAG CACGGCCCGG GCTTCTGA
|
Protein sequence | MAKILSFSDD ARHQLEHGVN ALADAVKVTL GPRGRNVVLD KKFGAPTITN DGVTIAKEIE LTNPYENLGA QLVKEVATKT NDVAGDGTTT ATVLAQALVR EGLRNVAAGA NPTGLKRGID AAATKVSEEL LGKAVDVSDK AAIAHVATVS AQDSTIGELI AEAMERVGRD GVITVEEGST LATELDVTEG LQFDKGFISP NFVTDAEGQE SVLEDAYILI TTQKISAIEE LLPLLEKVLQ ESKPLLIIAE DVEGQALSTL VVNALRKTMK VCAVKAPGFG DRRKAMLQDM AILTGAELVA PELGYKLDQV GLEVLGTARR VVVDKENTTI VDGGGQASDA EDRVAQIRKE IEASDSEWDR EKLAERLAKL SGGIAVIRAG AATEVEMKER KHRIEDAIAA TKAAVEEGTV PGGGAALAQV LPALDGDLGL TGDEQVGVSI VRKALIEPLR WIAQNAGHDG YVVVQKVAGK DWGHGLDAAT GEYVDLAKAG ILDPVKVTRN AVANAASIAG LLLTTESLVV EKPQEPEPAA AGHGHGHGHQ HGPGF
|
| |