Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_0121 |
Symbol | groEL |
ID | 4460431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 146644 |
End bp | 148275 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639700877 |
Product | chaperonin GroEL |
Protein accession | YP_844259 |
Protein GI | 116747572 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0222262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAAC AGCTGATCTA CGATGTCAAA GCTCGCGAAG CTTTGCTGAG CGGAGTGAAT ATTCTGGCGG ACGCAGTGAA GGTAACGCTT GGTCCCAAAG GCCGCAACGT GGTGATTGAA AAGGCATTCG GCGGGCCTAC CGTGACCAAG GACGGCGTGA CCGTTGCCAA GGAAATCGAG CTCGAGGACA AGTTCGAGAA CATGGGTGCC CAGATGGTGA AGGAAGTCGC CAGCAAGACC AGCGACGTTG CCGGTGACGG GACCACCACG GCGACCATAC TCGCTCAGTC CATCTATTAC GAAGGTTCCA AGCTGGTTGC CGCCGGTGCC AATCCCATGG CGCTCAAGCG CGGCATCGAA AAGGCTGTGC AGGTGGTGGT GGACGAACTG AAGAAGATCA GCAAGCCCAC CAAGGACCAG AAGGAAATCG CCCAGGTCGG CACCATTTCG GCGAACAACG ATCCCACCAT CGGCAACATC ATCGCCGAAG CCATGAACAA GGTGGGCAAG GAAGGTGTGA TCACGGTCGA GGAAGCCAAG GCCATGGAGA CAACCCTGGA AGTTGTCGAG GGCATGCAGT TCGATCGCGG CTATATTTCC CCGTATTTCG TGACCGACCC CGAAAAGATG GAAGTCCTTC TGAACGAACC CCTGATTCTC ATCAACGAGA AGAAGATCAG CAACATGAAG GACCTGCTGC CGGTCCTGGA GCAGATCGCC AAGATGGGCA GACCGTTGCT GATCATTGCC GAAGACGTCG AAGGCGAAGC GCTGGCCACC CTGGTGGTGA ACAAGCTGCG CGGAACGCTG CATGTGTGCG CCGTGAAGGC ACCGGGATTC GGCGATCGCC GCAAGGCCAT GCTCGACGAC ATCGCCATTC TGACCGGCGG CCAGGTGATC AGTGAAGAGA AGGGCATCAA GCTCGAGTCC GTCGGGCTGA ACGATCTCGG GAAAGCAAAG ACCATCCGGA TCGACAAGGA CAACACCACC ATCGTCGACG GCGCGGGCGA TCGCAAGGCG CTCGAAGGGC GGGTGCGCCA GATTCGGACC CAGATCGACG AGACCACCAG CGATTACGAC CGTGAGAAGC TGCAGGAGCG GCTGGCAAAA ATGGTCGGCG GAGTGGCGGT CATCAGTGTC GGCGCGGCCA CCGAAACCGA AATGAAAGAG AAGAAGGCGC GCGTCGAGGA CGCTTTGAAC GCTACCCGGG CCGCAGTGGA GGAAGGCATC GTTCCCGGCG GCGGAGTGGC CTATCTGCGC TGCCTCGGGG CACTGGGAGC GGTGAACCTG GAAGGCGACG AAAAACTGGG GCTCAACATC GTCAAACGCG CACTTGAAGA GCCCGCCCGC CAGATTGCCA TGAATGCCGG TGAGGAAGGC TCCGTAATCG TGCAGAGGGT CAAGTCCGAA ACGGGCGCTT TCGGTTTCGA CGCGGAAACC AGCCAGTTCT GCGACCTCAT CGAAGCGGGT GTCATCGATC CGACCAAAGT GACCCGTACC GCTCTGCTCA ACGCGGCCAG CGTTTCGGCA TTGATGCTGA CCACCGAGTG CATGGTTTCG GAAATCCCGA AAGAGGACAA GGGAGCCCCT GCAGGGATGG GCGGAATGCC CCCCGGAGGC GGAATGTACT AA
|
Protein sequence | MAKQLIYDVK AREALLSGVN ILADAVKVTL GPKGRNVVIE KAFGGPTVTK DGVTVAKEIE LEDKFENMGA QMVKEVASKT SDVAGDGTTT ATILAQSIYY EGSKLVAAGA NPMALKRGIE KAVQVVVDEL KKISKPTKDQ KEIAQVGTIS ANNDPTIGNI IAEAMNKVGK EGVITVEEAK AMETTLEVVE GMQFDRGYIS PYFVTDPEKM EVLLNEPLIL INEKKISNMK DLLPVLEQIA KMGRPLLIIA EDVEGEALAT LVVNKLRGTL HVCAVKAPGF GDRRKAMLDD IAILTGGQVI SEEKGIKLES VGLNDLGKAK TIRIDKDNTT IVDGAGDRKA LEGRVRQIRT QIDETTSDYD REKLQERLAK MVGGVAVISV GAATETEMKE KKARVEDALN ATRAAVEEGI VPGGGVAYLR CLGALGAVNL EGDEKLGLNI VKRALEEPAR QIAMNAGEEG SVIVQRVKSE TGAFGFDAET SQFCDLIEAG VIDPTKVTRT ALLNAASVSA LMLTTECMVS EIPKEDKGAP AGMGGMPPGG GMY
|
| |