Gene Sfum_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0043 
Symbol 
ID4460983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp57077 
End bp59407 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content61% 
IMG OID639700795 
ProductATP-dependent Clp protease, ATP-binding subunit clpA 
Protein accessionYP_844181 
Protein GI116747494 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00409723 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.565561 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAACA GAGAGCTTGA AATAACGCTG TCGGCCGCGA TCAAGGAGGC CAAGACCAGA 
CGGCATGAAT TCTTCACCCT GGAGCACGTG CTGTTCGCGA TGCTGCACGA TGTAACGGGA
CGGCGCGTCC TTTTCCATTG CGGAGCGGAC CTGGAGACGT TGAAAGGGAA GCTGCTCAAG
TTTTTTGAGG AGCGGGCGGA GAAGCTTCCG GAGGGCATCG ACCAGGATCC GATTCAGACC
TTGAGCGTTC AGAGGGTGTT GCAGCGGGCG GTGATGCACG TGCACGGGGC CGAAAAGAAG
GAAGTGGATG CCGGGGACAT ACTGGCCGCG ATGTTCTACG AAGAGAACTC TTACGCCGTC
TATTTCCTGA AGTCCGAGGG CGTGAGCCGC CTGGATGTGC TGAGCTATAT TTCGCACGGC
ATCTCCAAAG TCGGGTATCC CGAAGAGGAT GAGGGGGCCC GGGACGAGCC TGCACACGAG
GTCGCGCCCG GGGAGGAAGG CGAACCGCGG AGGACGTCGG CCCTGGAAAG CTTCGCCGTG
AACCTCATCG AAAAGGCTGC CCAGGGACGG ATCGATCCAC TCATCGGGCG TGAGGAAGAG
ATCCTTAGGA CGCTCCAGAT CCTTGGACGC AGGAGCAAGA ACAACCCGAT CTTCGTCGGA
GACCCCGGAG TGGGGAAGAC GGCGATTGCC GAGGGGCTGG CGCTGAAAAT CCAGAAGGGA
GAGGTGCCCG AGGTCTTCCA GAACATAGAG ATTTTCGCCT TGGACATGGG AGCGCTTCTT
GCGGGCACGA AGTTCCGGGG CGATTTCGAG GCGCGCCTCA AAAGCGTGAT CCAGGAGCTC
AAGAAGAAGG AAGGCGCGAT CCTGTTCATC GACGAGATCC ACACGGTGGT GGGGGCTGGT
GCGACGAGCG GCGGCTCGAT GGACGCCTCC AATATCCTCA AGCCCGTTCT GGTCACCGGA
GGGCTGCGGT GTATCGGGTC CACCACCTAT GAGGAATACA AGAACCACTT TGAGAAAGAC
CGTGCGCTGA GTCGCCGGTT CCAGAAGGTC GAAATCCACG AGCCCTCCGT GGAGGAGACC
TACCGAATCC TGCTGGGACT GCGGCAATAC TACGAGAAGC ACCATGGGAT CCGGTACACG
GACGCCGCAT TGAGGGCGGC GACGGAACTC TCCAACCGGT TCATCAACGA TCGCTATCTG
CCGGACAAGG CCATCGACGT CATTGACGAG ACGGGGGCCA GCCTGCGGCT GAAGAAAGAC
CGGCGGGTGC GGCGAGTGGT CGGGCCGAAG GACATCGAGC TGGTGGTGGC CAGGATCGCG
AAGATTCCTC CCCGGTCGGT CTCCGCCTCC GACCAGTTGC GGCTTCACAG CCTGGACGAG
GAGCTCAAGG GCCAGGTGTT TGGCCAGGAC GGGGCCATCG ATGTCCTGGT CAAGGCGATC
AAGCGGTCTC GAGCGGGTCT GAGGATTCCG GAGCGGCCCA TCGGGTCTTT CCTTTTCATC
GGGCCTACGG GAGTCGGAAA AACCGAAGTC GCCAAGCAAC TGGCACGCAT CCTCGGGGTC
AATTTCCTGC GCTTCGATAT GAGCGAATAT ATGGAGAAGC ACACGGTCGC ACGGCTGATC
GGCGCGCCTC CGGGGTATAT CGGATTCGAC CAGGGAGGTC TGCTGACCGA CGCCATCCGC
AAGCAGCCTT ACACGGTCCT GCTGCTGGAC GAGATCGAGA AGGCCCACCC CGACCTTTTC
AGCATTCTCC TGCAGGTGAT GGACCATGCG ACGCTCACGG ACAACAACGG CAAGAAGGCG
GATTTTCGCA ACGTCATCCT GCTCATGACC TCAAACGCCG GGGCGCGGGA GATGAGCATG
TCGTCCATCG GTTTCGGCGG AGGCACGGTG GAACCCGACC ACCGCGCGAT CAAGGGCCTG
AAGGCCGTGG AGAGCCTCTT CAGCCCGGAA TTCCGCAATC GCCTGGACGG GATCGTCCTG
TTCAACGGTC TGAACCTCGA GATCATGGAG CAGATTGTCG ACAAGTTCGT CGTCGAGATG
GAAACGCAAC TGGGCGAGAA GAAGATCCGC ATGGAATTGA GCGCCGAGGC GAGGCGCTGG
CTGGGCGGGC AGGGCTACGA CGTGACGTTC GGAGCACGGC CGCTTGCGCG GTTGATCCAG
ACCGAAGTCA AGGATGTGCT GGCTGACGAA ATTCTTTTCG GGAGGCTCAT GCACGGGGGC
CGCGTGCAGG TCTTCAGACC TGACGAGGCC GTACCGGCAG GCGAGGAGAT TCTGAAGACG
TCCAACCTGG TCTTCGTTTT TCCGGACGTT CCGGGCGTGC CTGCGGCATA G
 
Protein sequence
MINRELEITL SAAIKEAKTR RHEFFTLEHV LFAMLHDVTG RRVLFHCGAD LETLKGKLLK 
FFEERAEKLP EGIDQDPIQT LSVQRVLQRA VMHVHGAEKK EVDAGDILAA MFYEENSYAV
YFLKSEGVSR LDVLSYISHG ISKVGYPEED EGARDEPAHE VAPGEEGEPR RTSALESFAV
NLIEKAAQGR IDPLIGREEE ILRTLQILGR RSKNNPIFVG DPGVGKTAIA EGLALKIQKG
EVPEVFQNIE IFALDMGALL AGTKFRGDFE ARLKSVIQEL KKKEGAILFI DEIHTVVGAG
ATSGGSMDAS NILKPVLVTG GLRCIGSTTY EEYKNHFEKD RALSRRFQKV EIHEPSVEET
YRILLGLRQY YEKHHGIRYT DAALRAATEL SNRFINDRYL PDKAIDVIDE TGASLRLKKD
RRVRRVVGPK DIELVVARIA KIPPRSVSAS DQLRLHSLDE ELKGQVFGQD GAIDVLVKAI
KRSRAGLRIP ERPIGSFLFI GPTGVGKTEV AKQLARILGV NFLRFDMSEY MEKHTVARLI
GAPPGYIGFD QGGLLTDAIR KQPYTVLLLD EIEKAHPDLF SILLQVMDHA TLTDNNGKKA
DFRNVILLMT SNAGAREMSM SSIGFGGGTV EPDHRAIKGL KAVESLFSPE FRNRLDGIVL
FNGLNLEIME QIVDKFVVEM ETQLGEKKIR MELSAEARRW LGGQGYDVTF GARPLARLIQ
TEVKDVLADE ILFGRLMHGG RVQVFRPDEA VPAGEEILKT SNLVFVFPDV PGVPAA