Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2964 |
Symbol | |
ID | 8417296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3438093 |
End bp | 3439679 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645025941 |
Product | chaperonin GroEL |
Protein accession | YP_003183296 |
Protein GI | 257792690 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAAG ACATTGCGTT CGACAACGAA ACCCGTACGA AGATGGCTGC CGGCGTCAAC AAGCTGGCCG ACGCCGTGAG GGTGACGATC GGCCCGAAGG GGCGCTACGT CGCCATGCAG AAGGAGCACG AGAAGCCCAA CGTTTCCAAC GACGGCGCCA CGGTGGCCGC CAATGTCGAT CTTGAAGATC CCATCGAGAA CATGGGCATG AAGATCGTGC GCGAGGCCGC CATCGCCGCG AACAACGACG CGGGCGACGG CACCACGACG GCGACCATCC TTTCCGACGC TATCGTGAGC GAGGGAGTAC GCTGCGTCAT CTCCGGTTCC GATCCGCTGG CGCTGCGCCG CGGCATCCAG CGCGCTGCCG ACGTGGTGGC CGACGAGGTT CTCAAGAACG CCGTCGAGGT TACCACGCGT GAGCAGATCG CCGAGATCGC GACGGTTTCC GCGGGCGACC GCCAGATCGG CGAGAAGATC GCGGAGGCTA TGGATGCCAT CGGCCGTGAC GGCGTCATCT CGGTCGAGAA GTCTCAGAAC TTCGGCATCG AGGTGAAAAT CCTCAAGGGC ATGATGTTCG ACAACGGTTT CATCTCCCCG TACATGGCCG ACGACCCGGC GCGCCTGGAA GGCGAGCTCA CCGAGCCCTA CATCCTGTTG ACCGACCAGC GTCTGGGCGA CAACTTCGCA GACATCGTGC CCGTGCTGGA AGAGGTCATG CAGTCCGGTC ATCCGTTGCT GATCGCCGCT GAAGACGTGC GCGGCGAGGC GCTGAACACG TTGCTGGTGA ACCGCCGTCG CGGCACGCTG ACCAGCGTTG CGGTGAAGGC GCCGGCTCTG GGCGATCGAC GCAAGGCTGA GCTCGAGGAT CTGGCCATCC TCACCGGCGG CGAGGTTATC ACGCCCGATC GCGGCCTGAC GCTGGCCGAT GCGCGCAAGA GCATGCTCGG CCGCGCCGCC AGCGTGCAGA TCACCAAGGA CCGTACCACC ATCTTGGGCG GCAAGGGCAA GCCCGAGGCT ATCGAGCAGC GTTGCGATCA GCTGCGCGCG CAGATCGAAA CGGAGAAGAT CGACTACGAC CGCGACGTTC TGCGCGAGCG TCTGGCCAAG CTGTCCAGCG GTATCGCGGT CATGGAAGTG GGCGCCGCCA CGGAGTCCGA GATGAACGAG ATCCGCAGCC GTATCCAGGA TGCTTTGCTG GCTACCCGCT CGGCCGCAGA GCAGGGTCTG GTGGCCGGTG GCGGCGTGGC GCTGCTGCAG GCCGCGTCCG CGCTGGACGG CCTCGTATGC GAGAACGCCG AGGAGCAGCT GGGTATCGAC ATTCTGCGCA AGGCGCTGGA GGTGCCGCTG CGCGCCCTGG CGGAGAACGC GGGCTATCGC GGCGACGTGG CCGTTGAGAA GGTCAAGGAG CTGCCGCTGG GCCAGGGTCT GGATTGTATG ACCGGCGAGT ACGGCGACAT GATCGGCCGT GGCATCGCCG ACCCGGCGAA GGTTACGGTC ACGGCGCTTC AGGCCGCCGC TTCTGTGGCG TCGCTGATCC TGATCACGAA CGCCTCCGTC AGCGAGACGG TGCCCGAGGA AGACTAA
|
Protein sequence | MSKDIAFDNE TRTKMAAGVN KLADAVRVTI GPKGRYVAMQ KEHEKPNVSN DGATVAANVD LEDPIENMGM KIVREAAIAA NNDAGDGTTT ATILSDAIVS EGVRCVISGS DPLALRRGIQ RAADVVADEV LKNAVEVTTR EQIAEIATVS AGDRQIGEKI AEAMDAIGRD GVISVEKSQN FGIEVKILKG MMFDNGFISP YMADDPARLE GELTEPYILL TDQRLGDNFA DIVPVLEEVM QSGHPLLIAA EDVRGEALNT LLVNRRRGTL TSVAVKAPAL GDRRKAELED LAILTGGEVI TPDRGLTLAD ARKSMLGRAA SVQITKDRTT ILGGKGKPEA IEQRCDQLRA QIETEKIDYD RDVLRERLAK LSSGIAVMEV GAATESEMNE IRSRIQDALL ATRSAAEQGL VAGGGVALLQ AASALDGLVC ENAEEQLGID ILRKALEVPL RALAENAGYR GDVAVEKVKE LPLGQGLDCM TGEYGDMIGR GIADPAKVTV TALQAAASVA SLILITNASV SETVPEED
|
| |