Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3854 |
Symbol | groEL |
ID | 5672217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4576310 |
End bp | 4577947 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242732 |
Product | chaperonin GroEL |
Protein accession | YP_001508152 |
Protein GI | 158315644 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAGA TCATTGCCTT CGATGAAGAG GCACGGCGCG GCCTGGAGCG CGGCATGAAC CAGCTGGCCG ACGCGGTCAA GGTCACGCTC GGCCCCAAGG GTCGCAACGT CGTGCTGGAG AAGAAGTGGG GCGTCCCCAC GATCACCAAC GACGGCGTCA GCATCGCCAA GGAGATCGAG CTCGAGGACC CGTACGAGAA GATCGGCGCG GAGCTCGTCA AGGAAGTCGC GAAGAAGACC AACGACGTCG CGGGTGACGG CACCACCACC GCCACGATCC TCGCCCAGGC GCTCGTCCGC GAGGGCCTGC GCAACGTCGC CGCCGGTGCG AACCCGATCG GCCTGAAGAA GGGCATCGAG GCCGCCGTCG AGCGTGTCTC CGAGGAGCTC TCCAGCGTCG CCAAGGACGT GGAGACCAAG GAGCAGATCG CCTCGGCCGC CTCCATCTCC GCCGGTGACC CGGCCATCGG CGCCATGATC GCCGAGGCGA TGGACAAGGT CGGCAAGGAA GGCGTCATCA CCGTCGAGGA GAGCAACACC TTCGGGCTCG AGCTCGAGCT CACCGAGGGC ATGCGCTTCG ACAAGGGCTA CATCTCGCCC TACTTCGTCA CCGACACCGA CCGCATGGAA GCCGTCCTCG ACGACCCGTA CATCCTGATC ACCAACAGCA AGATCTCCGC GGTCAAGGAC CTCCTCCCGA TCCTGGAGAA GGTCATGCAG GCCGGCAAGC AACTCGTGAT CCTCTCCGAG GACGTCGAGG GCGAGGCCCT GGCCACCCTG GTCGTGAACA AGATCCGCGG CACGTTCAAG AGCGCCGCGG TCAAGGCGCC CGGCTTCGGT GACCGCCGCA AGGCCATGCT GACCGACATC GCCGTCCTCA CCGGCGGCCA GGTCATCTCC GAGGACATCG GCCTCAAGCT CGAGGGCACC ACCGTCGACC TGCTCGGCCG GGCCCGCAAG GTCGTCATCA CCAAGGACGA GACCACCATC GTCGAGGGTG CCGGCGACGC GGACCAGATC GCGGGGCGGG TCAACCAGAT CCGCAACGAG ATCGAGAAGT CCGACTCCGA CTACGACCGC GAGAAGCTCC AGGAGCGGCT GGCCAAGCTC GCCGGCGGCG TCGCGGTCAT CAAGGTCGGC GCGGCCACCG AGGTCGAGCT CAAGGAGAAG AAGCACCGCA TCGAGGACGC CGTCTCGAAC GCGAAGGCCG CGGTCGAGGA GGGCATCGTC GCCGGCGGTG GCGTCGCGCT CCTGCAGGCC TCCACCAGCG CCTTCGAGAA GCTCGACCTC TCCGGCGACG AGGCCACCGG CGCCCTGATC GTCGAGCGCG CGCTCGCCGC ACCGCTACGC CAGATCGCCA CCAACGCCGG GCTCGAGGGC GGCGTCGTGG TCGAGAAGGT CCGCGGTCTC CCGACCGGGC ACGGCCTGAA CGCCGCCACC GGCGAGTACG TCGACATGAT CGCCGCCGGG ATCATCGACC CGGTGAAGGT CACCCGCTCG GCGCTACAGA ACGCCGCGTC CATCACCGGC CTCTTCCTCA CCATCGAGGT CGTCGTCGCG GACAAGCCGG AGCCCGCGGG CGCCGGCGGT GGTGCTGACG CGGCCGCCAT GGGCGGCATG GGCGGCATGG GCTTCTGA
|
Protein sequence | MPKIIAFDEE ARRGLERGMN QLADAVKVTL GPKGRNVVLE KKWGVPTITN DGVSIAKEIE LEDPYEKIGA ELVKEVAKKT NDVAGDGTTT ATILAQALVR EGLRNVAAGA NPIGLKKGIE AAVERVSEEL SSVAKDVETK EQIASAASIS AGDPAIGAMI AEAMDKVGKE GVITVEESNT FGLELELTEG MRFDKGYISP YFVTDTDRME AVLDDPYILI TNSKISAVKD LLPILEKVMQ AGKQLVILSE DVEGEALATL VVNKIRGTFK SAAVKAPGFG DRRKAMLTDI AVLTGGQVIS EDIGLKLEGT TVDLLGRARK VVITKDETTI VEGAGDADQI AGRVNQIRNE IEKSDSDYDR EKLQERLAKL AGGVAVIKVG AATEVELKEK KHRIEDAVSN AKAAVEEGIV AGGGVALLQA STSAFEKLDL SGDEATGALI VERALAAPLR QIATNAGLEG GVVVEKVRGL PTGHGLNAAT GEYVDMIAAG IIDPVKVTRS ALQNAASITG LFLTIEVVVA DKPEPAGAGG GADAAAMGGM GGMGF
|
| |