Gene Sfum_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1661 
Symbol 
ID4460033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp2037576 
End bp2040506 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content59% 
IMG OID639702430 
Productpeptidase M16C associated domain-containing protein 
Protein accessionYP_845783 
Protein GI116749096 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.902377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCC ACAACGGCTT CGAACTGTTG AAACAGCAGT ACGTCCCCGA GATAAGCACG 
GAGATCAAGG TGTTGCGGCA TGTCGGCACA GGCGCGCAGG TCCTGTCGCT GATCAACGAC
GACGAGAACA AGGTCTTCGG GATTTCATTT CGCACGCCGC CCGAAGACTC AACCGGCGTG
GCCCATATCC TGGAACATTC CGTTCTTTGC GGGTCGCGCA AGTTTCCCGT GAAGGAGCCC
TTCGTCGAGT TGCTCAAAGG GTCTTTGAAG ACTTTTCTCA ATGCATTCAC CTATCCCGAC
AAGACCTGCT ACCCCGTGGC AAGCCAGAAT GACAAGGACT TCTACAACCT CATCGACGTC
TACCTGGATG CGGTCTTCCA TCCGCTGATC ACGCCCTATA TCTTCCAGCA GGAGGGCTGG
CATTACGAGC TCGAGTCCGA GGACTCCTCG CTTTCTTACA AGGGAGTGGT CTTCAATGAA
ATGAAAGGGG CCTATTCTTC GCCGGATAAC CTTCTTGCGG AATACTCACA GCAGTCGCTC
TTTCCGGAAA GCACCTACGG GTTGGATTCT GGCGGGAATC CGGAGAAGAT ACCCGATTTG
ACCTACGAGC GATTCAAGGC ATTTCATGAA AGACACTATC ACCCTTCCAA CGCCTATATA
TACTTTTACG GCAATGACGA TCCGGAGAAG CGTCTGCGCT TTTTGAGGAC CTACCTGGAC
GATTTTTCGG CCGTCCCGGC GGATTCATCC GTGGGACTGC AGCCGTTTTT CGACGCGCCC
CGGCGCATCC GTCGCGGCTT TGCATCCGGA ACCGAGGCGG GTCATGGCCG GGGAGCCAAA
CCGCGCGGCA TGATGACGCT CAACTGGCTG CTTCCGGAAA CCTCGAATGC GACCTTGAAT
CTTTCCCTGC AGGTCTTGCG GCACATCCTG ATCGGCATGC CGGGTTCGCC GCTGCGCAAG
GCACTGATCG ACTCGGGACT CGGCGACGAC CTGGCCGGAA CGGGACTCGA GAACGAGCTC
CGGCAGGCAT ACTTCTCCAC CGGGCTGAAG GGCATCGACA CCGACCAGGC CGATCATGTC
GAAAAGCTCA TTCTGGACAC CCTCGCAGGA CTAGCGGGGG ACGGGATCGC GCCTGAATTC
GTCGAAGCGG CTCTGAATAC CGTCGAGTTC CGGCTGCGCG AAAACAACGC AGGAGGTTAC
CCACGCGGCC TGGTCCTCAT GCTCCGGGCG CTTTCCACCT GGCTCTACGA TGGTGACCCG
GCGGCGCTGC TCGCATTCGA GGCCCCGCTG GAGGCCGTCA AGTCTTCGGC CGCCGCAGGG
AAGCGGTATT TTGAAGGAAT GATCGAGCGG CATTTTTTGC AGAATCCGCA CCGCACCACC
CTGATCCTGA AACCCGACCC AACCCGGGCG GATGCGGAGG AAGCCCGGGA ACGCGAACGT
CTTGCCGCAG TCCGTTCCAC CATGAGCGCG GAGCAACTGC GGGCCGTCGT CGAAAACACT
CGCGAGTTGC GACGCAGACA GGAAGCGCCG GATTCGCCCG AAGCCCTTGC CGCCATTCCC
ACCCTGAAAC GGGAGGACCT GGAAAGAACC AACAAGAAGA TCCCCATGGA AGAAACGTTC
CCGGAAGGCT CGAGGCTGCT CTTTCACGAT ATCCATACCA ACGGCATCTT CTATCTTGAC
ATGGCATTCG ACATCCACTC TCTGCCGCAG CACGCCCTGC CTTTCGCTCC GCTGTTCGGC
CGGGCGCTCG TCGAAATCGG TACCGAAACG GAGGATTTCG TGTCTCTGTC CACGAGAATC
AGCCGTCGGA CCGGCGGTAT CCGACCGGAT GTGTTCACGT CGGCGGTGAG AAGCAGCCCG
CACGGCGCTG CACGGCTCAT TCTTCGCGGC AAGAGCACGG TCCCGCGGGC CGGCGAGCTT
TTCTCCATAT TGCGGGACGT TTTGCTCACG GTCAAACTCG ACGATCGGGA ACGGTTCCGG
CAGATGGTGC TCGAAGAAAA GGCACGACAA GAGCAAAGGC TCATTCCCGG CGGGCATCAG
ATGGTGAATC TGCGCCTGCG CGCCCACTTC GGCGAAGCGG ACTGGGCGGC CGAACAGACC
TCCGGGATCA GTTACCTCAC GTTCCTGCGC AAGCTGGTCT CCGACATCGA CGAAAACTGG
TCCGGCATTC TTGCAACGCT CGAAGATCTC CGCCACGTTC TGATCAACCG GACCGGCATG
ATCTTCAACG TGACCGCGGA TCGGTCCGAC TGGAGCCGGG TCCGCGGCGA TTTCGAACAA
TTCGTCCGGG AACTTCCGGC TCGGCCTCCC GGCAGGTGCG ACTGGCATCC GAAGCACAAC
CCCGAGCTTG AAGGCTTGCT CATCCCCTCG CAGGTCAACT ATGTAGGCAA GGGACTCGAC
CTCTACCGGC TGGGATACCG TTTCCACGGA TCGGTCCAGG TGATCACCGC CTACCTGAGA
AATTCCTGGT TGTGGGAGCA GGTTCGCGTG CAGGGAGGAG CCTACGGGGC AATGTGCCTG
TTCGATCGGA TTTCGGGGAT ACTCACCTTT GTGTCCTATC GTGACCCCAA TCTCGATCGA
ACCCTGGAAG CCTTTGACCG CGCCGCGGAT TTCCTGCGGA CCGTCAATTT GAGCGAGGAC
GAGCTCACCA AGGCGATCGT CGGCGCCATC GGCACCTTGG ATACCTATCT GCTTCCGGAC
GCCCGGGGAT ACGTCTCCAT GCTGCGGACC ATTACCGGCG ACATGGAAGA AGATCGCCAG
AGAATGCGGG ACGAAATCCT CGCGACCACC ACCCGGGATT TCAGAGATTT CGCCGAAGTC
CTGGATGCCG TCAGACATCA TGCGATCGTC AAGGTGCTCG GGTCAAAAGC CGCTGTCGAT
GACTCTCCCA TCGGCAGAAG CGGGAAGATC GAGCTCGTGA CGGTCCTGTA G
 
Protein sequence
MTVHNGFELL KQQYVPEIST EIKVLRHVGT GAQVLSLIND DENKVFGISF RTPPEDSTGV 
AHILEHSVLC GSRKFPVKEP FVELLKGSLK TFLNAFTYPD KTCYPVASQN DKDFYNLIDV
YLDAVFHPLI TPYIFQQEGW HYELESEDSS LSYKGVVFNE MKGAYSSPDN LLAEYSQQSL
FPESTYGLDS GGNPEKIPDL TYERFKAFHE RHYHPSNAYI YFYGNDDPEK RLRFLRTYLD
DFSAVPADSS VGLQPFFDAP RRIRRGFASG TEAGHGRGAK PRGMMTLNWL LPETSNATLN
LSLQVLRHIL IGMPGSPLRK ALIDSGLGDD LAGTGLENEL RQAYFSTGLK GIDTDQADHV
EKLILDTLAG LAGDGIAPEF VEAALNTVEF RLRENNAGGY PRGLVLMLRA LSTWLYDGDP
AALLAFEAPL EAVKSSAAAG KRYFEGMIER HFLQNPHRTT LILKPDPTRA DAEEARERER
LAAVRSTMSA EQLRAVVENT RELRRRQEAP DSPEALAAIP TLKREDLERT NKKIPMEETF
PEGSRLLFHD IHTNGIFYLD MAFDIHSLPQ HALPFAPLFG RALVEIGTET EDFVSLSTRI
SRRTGGIRPD VFTSAVRSSP HGAARLILRG KSTVPRAGEL FSILRDVLLT VKLDDRERFR
QMVLEEKARQ EQRLIPGGHQ MVNLRLRAHF GEADWAAEQT SGISYLTFLR KLVSDIDENW
SGILATLEDL RHVLINRTGM IFNVTADRSD WSRVRGDFEQ FVRELPARPP GRCDWHPKHN
PELEGLLIPS QVNYVGKGLD LYRLGYRFHG SVQVITAYLR NSWLWEQVRV QGGAYGAMCL
FDRISGILTF VSYRDPNLDR TLEAFDRAAD FLRTVNLSED ELTKAIVGAI GTLDTYLLPD
ARGYVSMLRT ITGDMEEDRQ RMRDEILATT TRDFRDFAEV LDAVRHHAIV KVLGSKAAVD
DSPIGRSGKI ELVTVL