Gene Sfum_2035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_2035 
Symbol 
ID4459666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp2492469 
End bp2495942 
Gene Length3474 bp 
Protein Length1157 aa 
Translation table11 
GC content59% 
IMG OID639702801 
Productpeptidase C11, clostripain 
Protein accessionYP_846153 
Protein GI116749466 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.762463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.874852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGTA TACGCCCGAG AAAGTGGTTT CTGATCTGTG CCGTTCTGGG GTTCGTATTC 
CCTTCGCAGA TTTTTGCGGC CAGGATTGTC TCCATCACAA CCTGCGAAAG TGTCGAACCG
TCGAGCATGA AAGCCGTGGG GGTGAGAAGT CGGTTCACGC AGGACACCCG CGAAATACAT
CTTCTGGTTC AGTTTGAAGG GCTCAAGGCC GGCAGCAAGG TGAAGGGAAC ATGGATTTCC
GTGGATGCGA TCAAGACCCC CAACTACGAA ATTGACTCCT CGGAATCCCA GTTTTCGAAG
GATGGAGAAG GTACCGCGCA TTTTTCCCTC TCCAGGCCCA ATCAAGGATG GCCCGTCGGG
AACTACAAGG TGGATCTTCA CCTTGACGGA GCCCTTTTGA CGTCCGTACC CTTCGCTGTC
GCGGCATCAC CCATGGCCGC CCCGGCGTCT TCCCAATCCT CCCCTCAGTC ACCTTCCCCC
AAACTGGTTT CCCTTGTTTC ATGCGAGGCC GTGGCAAGCC CGGGACTCAG CCCGATCAAC
GTCACCGACA CCTTCGATGC CACAACCCCC GAGCTTCACG TGCTCGCAAA GGTTGAAGGC
GCCGTTCCCG GGACGAAGCT CAAGGGAATC TGGAGCGTTC TGGGACAACC GGGGGGCTCC
GACCGCGAGA TCAACGCCAC CGAGGCCGAA TTCGGGCAGG AAGGCGGGGA CACCGTGCAT
TTCCTGGTGG AAAAACCGGA GGGAGGATGG CCTGCGGGAA ACTACAGATT CGACCTGTAC
ATCAACGGCA AGACGGCGGG AAAGATTGCC TTCTCGGTCC GGGGGGGCGA TCCGGCCAGG
CCGCCTTTCA GCCTCGATCT GGGCCCCGAA CGCAGAGACC CGCAACGAAT CTGGACCATA
GCCGTGTATC TGGACGGAGA TAACGATCTC GAACCCTTCG CCCTGAAGGA CTTGAAGGAA
ATGGAGCGCG GAATTTCCGA AGAAGGCGTC GAATGCATCG TGTTTCTGGA TCGCGCCGGG
GGGGAAGGCC AGGTCGTGCG AATCCGCAAG GACATTGCGG GCTCCACCAA ATCGGAGGTC
ATCACGACCG CCGGTGAAGT GAACATGGCG GATCCCCAGG TTCTCCACAC CTTTTTGGCC
TCGGTGCTCA AGGCCTTTCC CGCACAGCAT CACGCGCTGA TTCTGTGGGA TCACGGCGGG
GGCTGGGCCG CCCACCTCAT CGACGAAAAG GCGCCGGGCG CCCCCAAAGG CCGCGACAAA
ATGACGCTCC CAAGACTGAG GGAGGCCGTC GCGGGGGCAC TGAAGGACAC CGGCCGACAA
CGGATCGATA TCATTGGGTT TGACATGTGC CTCATGGCCC AGCTCGAAGT GGCCTGCGAG
CTCTCGGGAC TTGCCGAAAT CATGTTGGCA TCGCAGGCTG TGGAACCCGG GGACGGCTGG
CCCTATGACC AGATACTGCC CTATTTCGGG AAGGGAACCA TGGGCGCCAG GCGGTTGGCC
GCTCAGATCG TCGAAAGTTA CGGAAAGTAT TACGGAGAGC GCAAAGAGCG TGTTGCCACG
CTCTCAGCCG TGGATCTCGG CGAGACGGAC AAGCTGGTGA GCTCTCTGAG CGACTTTGCG
CGAAAGATGT CGGAATCGGT TCCGGGTGCA TGGCCGGAGG TCTCGCGGTC CTTATACTAT
TCGGAGGGAT ACGCCGACCG GACGGACATT CGCAGGAAGT CAGGAGTGCT TGCGAGTGTG
GACCTGATGG ATGTATTGAA CCGCCTTCGC CTTTCAACCC CCAACTTTCC TGCACAGAAG
GAATACCAGG ACATCGTCGC GATCATGGAC AAGGCCATTA TCGCAAAACA TGCCTCGCCC
CGCCACCGGC TCAGTCACGG ACTCGCCGTA TACGCTCCCG TACGGGGCGC TCAATTCGAT
CCCGACTATC TGCAAATCCG GATGAGCAAG ACCTGTGCGT GGCAGGGCAT GTTGTCCGCC
CTCCATAAAG CTCAGGAACA GCAGCTCTCC CCTCCCAGGG TCACCGATAT CCGGGTTGTG
GACGCTCAAA CAGGCAACCC TGTGAAGGGA GGTAAACCGG GCGGGGGGTT CAAGGTTGAG
GCGACCGTGG AAGGGGAAAA TATCCTGTGG GTGCAATACA TGCAGGCGGT CAGAGACGAC
AAAAGAAACG GCTTTGCCCT TCTCGAGAAA GGCTATGTGT ACGATCCCGA ATTCTACAAG
AAGAAGGAGG GGGCCGTGGC CGATGTGGTG GACTTGATCA TGCCCGAGTT CAAGGGCAAT
CGCAACAAGG TGTCGAAGGA ATTCATCGGG CGGCATCTGA AAGTGACCGA CGGGCGGCAT
GCAGCCCGGG CCACCATCGA CGCCTCGAGC CTTTCCGACC TCCAGCATGC CGCGGTCCCC
GTCGCTCTGA AACGGCAGGG AGCGGAACAG TGTTTTGCGA CCCTGTTTTT CAACGCCGTC
ACCTGGACGG TCGCGAATGT GGTCGCCGAG CTGCCTCAAA AGGACGGAAC CGTGGCGTAT
CGTCAAGTCA AGCTGCAACC CGAAGATGAG GTCACTTTGC TCTTCGAGTT CATCTCCGAC
GACGGCAAGG CCGGCTATGT GGCCGGGGAG ACGTTGAAGT GGGGAAAGGG GCTCGAGCTT
GTCATCGACA CCGATGAACC GTCCAACCTG ACGGTGGCGA TGCGGGCGGA ATCCATAGCG
GGACAGTCGT CGTTCGCCAC CACGCAAATC AAGCTGGAAG CATTGACCAA AGAGGAGCAA
TCCTTCGTGG AGAACGCTCG AAAGGTCAAA CTCAAGGACC TCGTCGGGAC CTGGCAATGG
CACGGGTTGA AAGACGGGCA GTGGAACTCG ATCCCGCCGT TCACCGAGAT TGCGCCATCC
CCTTCCAACC CGGAAGTTCT GATCGCCAAG ATACAAAACC CGGGCGATTC CAGCTGGAAA
GTCACGCCCA TGGCGGTTCT TCTCGACACA CGGCTGATGC CGACTCTGAG GCTCATCTCC
TTCGATGACG AGGGCCGGCC GGTCGAGGCG ATGAACTTCA CGATGCTCGT GTCGCGATGG
GACCAGGGCG CACCAAGAAT GATCCTCAAA TACCTGGTCC CGAAAGGCTG GCTCCTTCTC
TGGGCGAAGC GGCAGGCTCC GCAAGGCACA GCCACCGGGC CGGCTCCCGG ACCCGGCATG
ACGGTCCCTC CTCCCGTTCC TTCCCAGGTC CAGCCGCCCT CACCGAGCGC ATCTCTCGCC
GGCCTGTGGT ACGGCCCCGA CAGAGAAGTC CTGAAGATGG GCGAGTCCAC CTATGAGCTT
TATGAGTTCA ACATGCTGGA GGACAAGGGG GTTTATGAGA TCCGCGGGAA GCAGTTGATC
ACCAGAAGCG CGATAACCGG GGAGGTGGAA CGCTTCAGTT TCAGCCTGTC GGGTCAACAG
TTGACCCTGA GAGACTCAGG AGGCCAGACG TTCCGGTTCC GCAGAAAGCA GTAG
 
Protein sequence
MSRIRPRKWF LICAVLGFVF PSQIFAARIV SITTCESVEP SSMKAVGVRS RFTQDTREIH 
LLVQFEGLKA GSKVKGTWIS VDAIKTPNYE IDSSESQFSK DGEGTAHFSL SRPNQGWPVG
NYKVDLHLDG ALLTSVPFAV AASPMAAPAS SQSSPQSPSP KLVSLVSCEA VASPGLSPIN
VTDTFDATTP ELHVLAKVEG AVPGTKLKGI WSVLGQPGGS DREINATEAE FGQEGGDTVH
FLVEKPEGGW PAGNYRFDLY INGKTAGKIA FSVRGGDPAR PPFSLDLGPE RRDPQRIWTI
AVYLDGDNDL EPFALKDLKE MERGISEEGV ECIVFLDRAG GEGQVVRIRK DIAGSTKSEV
ITTAGEVNMA DPQVLHTFLA SVLKAFPAQH HALILWDHGG GWAAHLIDEK APGAPKGRDK
MTLPRLREAV AGALKDTGRQ RIDIIGFDMC LMAQLEVACE LSGLAEIMLA SQAVEPGDGW
PYDQILPYFG KGTMGARRLA AQIVESYGKY YGERKERVAT LSAVDLGETD KLVSSLSDFA
RKMSESVPGA WPEVSRSLYY SEGYADRTDI RRKSGVLASV DLMDVLNRLR LSTPNFPAQK
EYQDIVAIMD KAIIAKHASP RHRLSHGLAV YAPVRGAQFD PDYLQIRMSK TCAWQGMLSA
LHKAQEQQLS PPRVTDIRVV DAQTGNPVKG GKPGGGFKVE ATVEGENILW VQYMQAVRDD
KRNGFALLEK GYVYDPEFYK KKEGAVADVV DLIMPEFKGN RNKVSKEFIG RHLKVTDGRH
AARATIDASS LSDLQHAAVP VALKRQGAEQ CFATLFFNAV TWTVANVVAE LPQKDGTVAY
RQVKLQPEDE VTLLFEFISD DGKAGYVAGE TLKWGKGLEL VIDTDEPSNL TVAMRAESIA
GQSSFATTQI KLEALTKEEQ SFVENARKVK LKDLVGTWQW HGLKDGQWNS IPPFTEIAPS
PSNPEVLIAK IQNPGDSSWK VTPMAVLLDT RLMPTLRLIS FDDEGRPVEA MNFTMLVSRW
DQGAPRMILK YLVPKGWLLL WAKRQAPQGT ATGPAPGPGM TVPPPVPSQV QPPSPSASLA
GLWYGPDREV LKMGESTYEL YEFNMLEDKG VYEIRGKQLI TRSAITGEVE RFSFSLSGQQ
LTLRDSGGQT FRFRRKQ