Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2820 |
Symbol | |
ID | 8417151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3274386 |
End bp | 3276023 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645025800 |
Product | chaperonin GroEL |
Protein accession | YP_003183156 |
Protein GI | 257792550 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000232523 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.159069 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAAAG AGATTAAGTT CGAAGCCGAC GCGCGCAGCG CGCTTGCTGC CGGCGTCAAC AAGCTGGCCG ACGCCGTCAA GGTGACGCTC GGGCCCAAGG GTCGTTACGT GGCGCTGGAG AAGTCCTACG GTGCTCCGCT GATCACGAAC GACGGCGTAA CCGTGGCCAA GGAAGTCGAG CTCGAGGACC CCATCGAGAA CATGGGCGCC CAGCTGGTTC GCGAAGTGGC CGTGAAGACG AACGACGTCG CCGGCGACGG CACCACGACG GCCACGCTGC TGGCTGACGT CATCGTGTCC GAGGGCCTGC GCAACGTGAC CGCCGGCGCC GATGCCCTGG GCATCCGTCG CGGCATCCAG AAGGCCACCG ACGCTGTGGT CGAGGCCATC AAGGCCGACG CCACCCCCGT CTCCACCAAG GAGCAGATCG CGAACGTCGG CACCATTTCC GCCGGCGACG CCGAGATCGG CGAGAAGATC GCCGAGGCCA TGGATGCCGT GGGCAAGGAC GGCGCCATCT CCGTCGAGGA GAGCCAGACG TTCGGCATCG ACATGGACAT CGTCGAGGGC ATGCAGTACG AGCGCGGCTA CATCTCGCCG TACATGGCCA CCGACATGGA GAAGATGGAG GCCGTCCTCA GCGATCCCTA CATCCTCCTC ACCGACCAGA AGGTCACCAA CATCCAGGAC ATGGTGCCCC TGCTGGAGGA GATCATGAAG TCCGGTCGTC CGCTGTTCAT CGTTGCCGAG GACGTCGAGG GCGAAGCGCT GGCCACCATC CTGCTGAACA AGCTGCGCGG CACGTTCAAC TGCGTCGCCA TCAAGGCTCC CGGCTTCGGC GATCGCCGCA AGCGCATCCT CGAGGACATC GCCGCCGTCA CGGGCGCGCA GGTCATCGAC AAGGACTTCG GCATGACCAT GGCCGACGCC CGCATCGACA TGCTGGGCCA TGCCAAGACG GTCAAGGTCA CGAAGGACTC CGCCCTCATC GTGGACGGCG CCGGCGACAA GAAGGCCATC GACGATCGCA TCGGCCAGAT CAAGGCCGAG CTCGAGCGCG TCGACTCCGA CTTCGACCGC GAGAAGCTCC AGGAGCGCCT GGCGAAGCTG TCCGGCGGCG TGGCGGTGCT CAAGGTGGGC GCTGCGACCG AGTCCGAGCT CAAGGAGAAG AAGTCCCGCA TCGAGGACGC CCTGCAGGCG ACCCGCGCGG CGGTCGAAGA GGGCATCGTC GCCGGCGGCG GCGTGGCTCT GGTGGGCGCT CTGCCCGCGC TCGACAAGGT GGAAGCGGCC GACAAGGACG AAGAGGTCGG CGTCGCCATC ATCCGCAAGG CGCTCGAAGC TCCGATGCGC GCTATCGCGC AGAACGCCGG CTTCGAGGGC AGCGTCGTAG TCGAGCGCGT CAAGGGCATG GCTACGGGCG AAGGCCTGAA CTGCGCCAAC GGCGAGTACG GCAACATGAT CGAGATGGGC GTCAACGACC CGGTGAAGGT CACCCGCACG GCTCTGCAGT CTGCGGCCTC CGTCGCGGCT CTCATTCTCA TCACCGAGGC CACCATCAAC GAGATCCCGA AGGATCCGGA TCCGGCGGCT CTGGCGGCTA TGGCCGGTGC CGGCGGTGGC ATGGGCGGCA TGATGTAA
|
Protein sequence | MAKEIKFEAD ARSALAAGVN KLADAVKVTL GPKGRYVALE KSYGAPLITN DGVTVAKEVE LEDPIENMGA QLVREVAVKT NDVAGDGTTT ATLLADVIVS EGLRNVTAGA DALGIRRGIQ KATDAVVEAI KADATPVSTK EQIANVGTIS AGDAEIGEKI AEAMDAVGKD GAISVEESQT FGIDMDIVEG MQYERGYISP YMATDMEKME AVLSDPYILL TDQKVTNIQD MVPLLEEIMK SGRPLFIVAE DVEGEALATI LLNKLRGTFN CVAIKAPGFG DRRKRILEDI AAVTGAQVID KDFGMTMADA RIDMLGHAKT VKVTKDSALI VDGAGDKKAI DDRIGQIKAE LERVDSDFDR EKLQERLAKL SGGVAVLKVG AATESELKEK KSRIEDALQA TRAAVEEGIV AGGGVALVGA LPALDKVEAA DKDEEVGVAI IRKALEAPMR AIAQNAGFEG SVVVERVKGM ATGEGLNCAN GEYGNMIEMG VNDPVKVTRT ALQSAASVAA LILITEATIN EIPKDPDPAA LAAMAGAGGG MGGMM
|
| |