Gene Sfum_3647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3647 
Symbol 
ID4458037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4453516 
End bp4454958 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content59% 
IMG OID639704419 
Productpeptidase M48, Ste24p 
Protein accessionYP_847752 
Protein GI116751065 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00143596 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.526227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCAC TGACCAGAGC CGCACGGGTG TGTCTCGTGA TCGCAACCGT GGTTTGTTTC 
ACCTTCACGA ATGCGGTGGC GCCGCTCTCC CGAAGCTTCG CCTTCACGCT CAGCGAAGAG
AACGAACTGG GGCGAAAGCT TCTCGAAAAG ATCAGGCAGC ACATGCAACT GGTCGAGGAC
GGCGAAGTGC TCACGTACGT GCAGTCCGTG GGGAACCGGA TCGTCGAGCA TCTGGGGACC
ACTCCCTACG AATTCCGGTT CTTCGTCATC AATGAACCGG TGCCCAACGC GTTCGCTATC
CCCGGCGGTT ACGTTTTCGT CTTTCGGGGA CTCATCGAAG TGCTGGAAGA CGAAGGGGAG
TTGGCCTCCA TTCTGAGTCA TGAGTTGGCT CACGTGCAGG CACGTCACAT CGAACGCCGG
ATGAAGGAAG GCCGCATCCT GTCGGTTGCC TCGCTGGCGG GATTGTTGGC CGGGATTCTG
CTTGCCAGCA AGACCGGTGC GAGCCCCGCC CTGGCTCTCG GAGGTGCCGC GGGCGCTCAG
AGCGCCGCCT TGAAATACAG CCGCGAATTC GAAACCGAGG CGGACCAGAT CGGGCTGAGA
AACCTGTGCG AGGCGGGGTA CGACCCGAAG GACATGGCTG AAGCGATGCA ACGGCTCGAG
CAGTTCAGAT TTCTGAACAA CGCAAGGATA CCGAGTTACC TGTCTACGCA CCCCGCTGTG
GCGGAGCGGG TCCAGTACCT GAAAGAGCTG GCGTACTCTC ACAAGGAATG GTTCGGGAAG
AAGAGGGTGT CGCCCACCGC GATGGATTTC CCGCTCATGA AGGCGGCGCT CATTGCCGAT
TACTCAGAGC CTGCACGAGC CATGGAACGG TTCACGGCCG GGGTCAAGAA AAGAGAGGCG
GAAGCCTTCT ACGGCATGGG AAGGCTTTAC ATGCGGCAGG GAGACAACGC ACAGGCCCTG
TCCATGCTGC AGGAAGCGGT CAGGCTTCGG CCGGGCAGCC CGTTCATCCT GAGCAGCCTC
GGCAAGGTGT ATCACCAGCT GGGGAGGCTT CCCGAGGCGC AGAAAGCGCT TCAGACCGCC
CTGTTGATGG ATTCTTCGAC GATCATCGCT CAATACAGGC TGGCGCTTGT CCTCCAGGAC
CAGGGGAAGC GGGAGGAGGC CCTGGAGCAC CTTCAGAGCA TTCAGAGATA TGCCCTGTCT
TTTCCCGATA TCGACTACCA GATGGGGATC ATTCTGGGGC AAAACAACCG CATCGGACTG
GCTCACTACC ATCTGGGGCA CTATTACGAA AGCAAGCAGG ACCTGAAGCT GGCCTTGTTC
CACTACCAGA AGGCCCGAGT GCTGCTCAGG GATTCGGTTG AAAAGATGAA TGAGCTGGAC
AAGACCATCA AAGTGCTTGA GAAGGAGAAG AAGGAGAGCG TGTCACATCA GGCCCGCCGG
TAA
 
Protein sequence
MKPLTRAARV CLVIATVVCF TFTNAVAPLS RSFAFTLSEE NELGRKLLEK IRQHMQLVED 
GEVLTYVQSV GNRIVEHLGT TPYEFRFFVI NEPVPNAFAI PGGYVFVFRG LIEVLEDEGE
LASILSHELA HVQARHIERR MKEGRILSVA SLAGLLAGIL LASKTGASPA LALGGAAGAQ
SAALKYSREF ETEADQIGLR NLCEAGYDPK DMAEAMQRLE QFRFLNNARI PSYLSTHPAV
AERVQYLKEL AYSHKEWFGK KRVSPTAMDF PLMKAALIAD YSEPARAMER FTAGVKKREA
EAFYGMGRLY MRQGDNAQAL SMLQEAVRLR PGSPFILSSL GKVYHQLGRL PEAQKALQTA
LLMDSSTIIA QYRLALVLQD QGKREEALEH LQSIQRYALS FPDIDYQMGI ILGQNNRIGL
AHYHLGHYYE SKQDLKLALF HYQKARVLLR DSVEKMNELD KTIKVLEKEK KESVSHQARR