Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2129 |
Symbol | |
ID | 3833280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2227221 |
End bp | 2228831 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637830054 |
Product | chaperonin GroEL |
Protein accession | YP_430964 |
Protein GI | 83590955 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000000897626 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAC AAGTAGTTTT CGACCGCGAA GCCAGGGAGG CCCTGGAAAA AGGTATTACC AAACTCACCG AAGCCGTCCG GGTCACCCTG GGACCCCGGG GACGCAACGT GGTCTTGGAA AAGAAATTCG GGGCCCCTAC CATTACCAAC GACGGCGTGA CCATTGCCAA GGAGGTTGAG CTGGAAGACC CCCTTGAGAA TGTGGGCGCG CTGCTGGTGC GGGAAGTAGC CTCCAAGACC AACGATGTCG CCGGCGATGG GACTACTACC GCCTGTGTTC TGGCCCAGGC TATTGTCCGG GAGGGCATGA AAAACGTGGC CGCCGGGGCC AATCCTATGT TCATGAAGCG GGGTATTGAA AAGGCGGTGG CGGCCGTGGT GGAGAACCTT AAGGCCCAGG CCCGGCCGGT GGAAACCAAG GACTCCATCA GCCAGGTAGC CTCCATTTCT GCCAATGACC CCCAGATCGG CGCCCTGGTG GCCGACGCCA TGGAAAAGGT GGGGAAGGAC GGCGTCATAA CCGTGGAGGA ATCCAAAGGT ATGGAAACCG CCGTGGACGT CGTGGAAGGC ATGCAGTTCG ATCGCGGCTA TATCTCTCCG TATATGGTCA CCGACAACGA GCGCATGGAA GCCGTCCTGG AAGAGCCCTA CATCCTCATC ACCGATAAAA AGATTACCGC CGTGGCCGAC CTGGTACCCG TCCTGGAACG GGTAGTACGG ACGGGTAAAC CCCTGCTTAT AATCTGCGAG GATATGGAGG GTGAAGCTCT GGCCACCCTG GTGGTCAATA AGATCCGGGG CACCTTTACC TGCGTGGCCG TCAAGGCGCC GGCCTTCGGC GATCGGCGCA AGGCCATGCT ACAGGACATC GCCATCCTGA CGGGTGGCCA GGTTATTACC GAAGAAGCCG GCTTGAAACT GGAGAACACC ACCCTGGATA TGCTGGGCCA GGCGCGCCAG GTCCGGGTGG GTAAAGAGGA AACCACCATC GTCGAAGGTC GCGGCAAAGA AGAGGCCATT GAAGCCCGGA TAGCCCAGAT TCGCCGCGAG TACGAGGAGT CGACCTCGGA CTACGACCGG GAGAAACTCC AGGAACGCCT GGCCAAACTG GCCGGCGGTG TGGCGGTCAT TAAAGTCGGG GCCGCTACCG AGACGGAAAT GAAAGAAAAG AAAATGCGCA TCGAAGACGC CCTCGCAGCC ACCCGGGCGG CGGTGGAGGA GGGCATTGTC CCCGGCGGCG GCACCGCCCT GGTACGCGCC CAGACGGCCC TGGATGGTGT CCAGGCCCAG GGTGACGAAC TGACGGGGGT GCGTCTGGTC TACCGCGCCC TGGAAGAACC CATGCGCCAG ATTGCGGCCA ATGCCGGCGT TGATGGATCG GTGGTAGTGG AGAAGGTGCG CCAGAGCGGT GACAGCATGG GCTTTAACGC CGCTACCCGG GAGTATGTCA ACCTCTTTGA AGCGGGTATT GTCGATCCCT TGAAGGTGAC CCGTTCCGCC CTGGAGAATG CTGCCAGCAT TGCCTCCCTG GTCCTGACCA CTGAGAGTCT AATAGCCGAC ATTCCGGAGG AAGAACCGCC CGTTCCCGGC GGCGGTATGC CTCCCATGTA A
|
Protein sequence | MAKQVVFDRE AREALEKGIT KLTEAVRVTL GPRGRNVVLE KKFGAPTITN DGVTIAKEVE LEDPLENVGA LLVREVASKT NDVAGDGTTT ACVLAQAIVR EGMKNVAAGA NPMFMKRGIE KAVAAVVENL KAQARPVETK DSISQVASIS ANDPQIGALV ADAMEKVGKD GVITVEESKG METAVDVVEG MQFDRGYISP YMVTDNERME AVLEEPYILI TDKKITAVAD LVPVLERVVR TGKPLLIICE DMEGEALATL VVNKIRGTFT CVAVKAPAFG DRRKAMLQDI AILTGGQVIT EEAGLKLENT TLDMLGQARQ VRVGKEETTI VEGRGKEEAI EARIAQIRRE YEESTSDYDR EKLQERLAKL AGGVAVIKVG AATETEMKEK KMRIEDALAA TRAAVEEGIV PGGGTALVRA QTALDGVQAQ GDELTGVRLV YRALEEPMRQ IAANAGVDGS VVVEKVRQSG DSMGFNAATR EYVNLFEAGI VDPLKVTRSA LENAASIASL VLTTESLIAD IPEEEPPVPG GGMPPM
|
| |