Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0311 |
Symbol | groEL |
ID | 5703979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 345675 |
End bp | 347297 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641269837 |
Product | chaperonin GroEL |
Protein accession | YP_001535232 |
Protein GI | 159035979 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000447633 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCCAAGA TGATCGCGTT CGACGAGGAG GCTCGCCGCG GCCTCGAGCG GGGCATGAAC CAGCTCGCTG ACGCCGTGAA GGTGACCCTC GGCCCCAAGG GCCGCAACGT CGTGCTCGAG AAGAAGTGGG GCGCCCCCAC CATCACCAAC GATGGTGTGA GCATCGCCAA GGAGATCGAG CTCGAGGACC CGTACGAGAA GATCGGCGCC GAGCTGGTCA AGGAGGTCGC GAAGAAGACC GACGACGTCG CCGGTGACGG CACGACGACG GCGACCGTCC TGGCCCAGGC CATGGTCCGC GAGGGCCTGC GTAACGTGGC CGCCGGCGCC AACCCGATGG CCCTGAAGCG GGGCATCGAG ACCGCGGTCG CCAGCGTCTC GGAGGAGCTG CTGAAGCTCG CCAAGGACGT CGAGACCAAG GAGCAGATCG CCTCCACCGC CTCCATCTCC GCTGGTGACC CCAGCGTCGG CGAGATCATC GCCGAGGCGA TGGACAAGGT CGGCAAGGAA GGCGTCATCA CCGTCGAGGA GAGCAACACC TTCGGCCTGG AGCTCGAGCT CACCGAGGGT ATGCGCTTCG ACAAGGGCTA CATCTCCGCC TACTTCATGA CCGATCCGGA GCGGATGGAG GCCGTCTTCG ACGACCCGTA CATCCTGATC GTCAACAGCA AGATCTCCTC GGTCAAGGAC CTGCTCCCGA TCCTGGAGAA GGTCATGCAG TCGGGTAAGC CGCTGCTGAT CATCGCCGAG GACATCGAGG GTGAGGCCCT GGCGACCCTG GTCGTCAACA AGGTCCGTGG CACCTTCAAG TCCGTCGCCG TCAAGGCTCC GGGCTTCGGT GACCGCCGCA AGGCCATGCT CGGTGACATT GCCATCCTCA CCGGTGGCCA GGTCATCAGC GAGGAGGTCG GCCTCAAGCT CGACGCCGTC AACCTCGACA TGGTGGGCCG GGCCCGCAAG GTCGTGGTGA CCAAGGACGA GACCACCATC GTCGATGGCG CCGGCGACGC CGAGCAGATC CAGGGTCGGG TCAACCAGAT CCGGGCCGAG ATCGACAAGA GCGACTCCGA CTACGACCGG GAGAAGCTGC AGGAGCGGCT GGCCAAGCTG GCCGGCGGCG TTGCGGTGAT CAAGGTCGGT GCGGCCACCG AGGTCGAGCT GAAGGAGCGC AAGCACCGCA TCGAGGACGC CGTTCGCAAC GCGAAGGCCG CCGTCGAGGA GGGCATCGTC CCGGGTGGTG GCGTCGCGCT GGTGCAGGCC GGCAAGACCG CCTTCGACAA GCTCGACCTG ACCGGTGACG AGGCGACCGG CGCCCAGATC GTCAAGATCG CGCTGGACGG CCCGCTGCGG CAGATCGCCG TCAACGCCGG CCTCGAGGGT GGCGTCGTCG TCGAGCACGT GCGTGGCATC GAGGCGGGTC ATGGCCTGAA CGCCGCGACC GGTGGGTACG TGGACCTGAT GGCCGCGGGC ATCATCGACC CGGCCAAGGT GACCCGGTCG GCGCTGCAGA ACGCGTCGTC GATCGCGGCG CTGTTCCTCA CCACCGAGGC CGTCGTGGCG GACAAGCCGG AGAAGACCCC GGCCGCCCCG GCTGCTCCGG GCGGCGGGGA AATGGACTTC TGA
|
Protein sequence | MAKMIAFDEE ARRGLERGMN QLADAVKVTL GPKGRNVVLE KKWGAPTITN DGVSIAKEIE LEDPYEKIGA ELVKEVAKKT DDVAGDGTTT ATVLAQAMVR EGLRNVAAGA NPMALKRGIE TAVASVSEEL LKLAKDVETK EQIASTASIS AGDPSVGEII AEAMDKVGKE GVITVEESNT FGLELELTEG MRFDKGYISA YFMTDPERME AVFDDPYILI VNSKISSVKD LLPILEKVMQ SGKPLLIIAE DIEGEALATL VVNKVRGTFK SVAVKAPGFG DRRKAMLGDI AILTGGQVIS EEVGLKLDAV NLDMVGRARK VVVTKDETTI VDGAGDAEQI QGRVNQIRAE IDKSDSDYDR EKLQERLAKL AGGVAVIKVG AATEVELKER KHRIEDAVRN AKAAVEEGIV PGGGVALVQA GKTAFDKLDL TGDEATGAQI VKIALDGPLR QIAVNAGLEG GVVVEHVRGI EAGHGLNAAT GGYVDLMAAG IIDPAKVTRS ALQNASSIAA LFLTTEAVVA DKPEKTPAAP AAPGGGEMDF
|
| |