Gene Sfum_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0121 
SymbolgroEL 
ID4460431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp146644 
End bp148275 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content59% 
IMG OID639700877 
Productchaperonin GroEL 
Protein accessionYP_844259 
Protein GI116747572 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0222262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAC AGCTGATCTA CGATGTCAAA GCTCGCGAAG CTTTGCTGAG CGGAGTGAAT 
ATTCTGGCGG ACGCAGTGAA GGTAACGCTT GGTCCCAAAG GCCGCAACGT GGTGATTGAA
AAGGCATTCG GCGGGCCTAC CGTGACCAAG GACGGCGTGA CCGTTGCCAA GGAAATCGAG
CTCGAGGACA AGTTCGAGAA CATGGGTGCC CAGATGGTGA AGGAAGTCGC CAGCAAGACC
AGCGACGTTG CCGGTGACGG GACCACCACG GCGACCATAC TCGCTCAGTC CATCTATTAC
GAAGGTTCCA AGCTGGTTGC CGCCGGTGCC AATCCCATGG CGCTCAAGCG CGGCATCGAA
AAGGCTGTGC AGGTGGTGGT GGACGAACTG AAGAAGATCA GCAAGCCCAC CAAGGACCAG
AAGGAAATCG CCCAGGTCGG CACCATTTCG GCGAACAACG ATCCCACCAT CGGCAACATC
ATCGCCGAAG CCATGAACAA GGTGGGCAAG GAAGGTGTGA TCACGGTCGA GGAAGCCAAG
GCCATGGAGA CAACCCTGGA AGTTGTCGAG GGCATGCAGT TCGATCGCGG CTATATTTCC
CCGTATTTCG TGACCGACCC CGAAAAGATG GAAGTCCTTC TGAACGAACC CCTGATTCTC
ATCAACGAGA AGAAGATCAG CAACATGAAG GACCTGCTGC CGGTCCTGGA GCAGATCGCC
AAGATGGGCA GACCGTTGCT GATCATTGCC GAAGACGTCG AAGGCGAAGC GCTGGCCACC
CTGGTGGTGA ACAAGCTGCG CGGAACGCTG CATGTGTGCG CCGTGAAGGC ACCGGGATTC
GGCGATCGCC GCAAGGCCAT GCTCGACGAC ATCGCCATTC TGACCGGCGG CCAGGTGATC
AGTGAAGAGA AGGGCATCAA GCTCGAGTCC GTCGGGCTGA ACGATCTCGG GAAAGCAAAG
ACCATCCGGA TCGACAAGGA CAACACCACC ATCGTCGACG GCGCGGGCGA TCGCAAGGCG
CTCGAAGGGC GGGTGCGCCA GATTCGGACC CAGATCGACG AGACCACCAG CGATTACGAC
CGTGAGAAGC TGCAGGAGCG GCTGGCAAAA ATGGTCGGCG GAGTGGCGGT CATCAGTGTC
GGCGCGGCCA CCGAAACCGA AATGAAAGAG AAGAAGGCGC GCGTCGAGGA CGCTTTGAAC
GCTACCCGGG CCGCAGTGGA GGAAGGCATC GTTCCCGGCG GCGGAGTGGC CTATCTGCGC
TGCCTCGGGG CACTGGGAGC GGTGAACCTG GAAGGCGACG AAAAACTGGG GCTCAACATC
GTCAAACGCG CACTTGAAGA GCCCGCCCGC CAGATTGCCA TGAATGCCGG TGAGGAAGGC
TCCGTAATCG TGCAGAGGGT CAAGTCCGAA ACGGGCGCTT TCGGTTTCGA CGCGGAAACC
AGCCAGTTCT GCGACCTCAT CGAAGCGGGT GTCATCGATC CGACCAAAGT GACCCGTACC
GCTCTGCTCA ACGCGGCCAG CGTTTCGGCA TTGATGCTGA CCACCGAGTG CATGGTTTCG
GAAATCCCGA AAGAGGACAA GGGAGCCCCT GCAGGGATGG GCGGAATGCC CCCCGGAGGC
GGAATGTACT AA
 
Protein sequence
MAKQLIYDVK AREALLSGVN ILADAVKVTL GPKGRNVVIE KAFGGPTVTK DGVTVAKEIE 
LEDKFENMGA QMVKEVASKT SDVAGDGTTT ATILAQSIYY EGSKLVAAGA NPMALKRGIE
KAVQVVVDEL KKISKPTKDQ KEIAQVGTIS ANNDPTIGNI IAEAMNKVGK EGVITVEEAK
AMETTLEVVE GMQFDRGYIS PYFVTDPEKM EVLLNEPLIL INEKKISNMK DLLPVLEQIA
KMGRPLLIIA EDVEGEALAT LVVNKLRGTL HVCAVKAPGF GDRRKAMLDD IAILTGGQVI
SEEKGIKLES VGLNDLGKAK TIRIDKDNTT IVDGAGDRKA LEGRVRQIRT QIDETTSDYD
REKLQERLAK MVGGVAVISV GAATETEMKE KKARVEDALN ATRAAVEEGI VPGGGVAYLR
CLGALGAVNL EGDEKLGLNI VKRALEEPAR QIAMNAGEEG SVIVQRVKSE TGAFGFDAET
SQFCDLIEAG VIDPTKVTRT ALLNAASVSA LMLTTECMVS EIPKEDKGAP AGMGGMPPGG
GMY