Gene Sfum_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0222 
Symbol 
ID4461298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp263940 
End bp265433 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content60% 
IMG OID639700977 
Productpeptidase C1A, papain 
Protein accessionYP_844358 
Protein GI116747671 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4870] Cysteine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.107237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0912112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTCAC CCGGAAAGTA TGCAGCGTGG AGGATACTCT GCACCGTCAG CCTGGTGGTC 
GTCATCGGTG TTTTCTTCGC GGTGTCGCCG GGCGTGTCGC AGGCGCAACA GCTGCGGCAG
GCACCTTTGA ACCCGGATTT TCTGAAATAC AGGGAGCAGC TCGGACAGGG AAGAAGCCCT
CTGAGAGTCA CCGGGGAGGG ACATGCCCTG GGGTACATTC CCCCTCCCAT CGACCTTTCT
TATGCGAAGA ACCTCCCGGA TGCCGACAGC CGGGCGGCCG CGGTCGAGGC GGTGACCTAT
CCGGCGACCT ACGACCTGCG CACCCAGCAT CGGGTCACCT CGGTCAGGGA CCAGGGGGAC
TGCGGTTCGT GCTGGGCATT CGCCACCTAC GCTTCCGTGG AATCCAAGCT CAAGGGCGCT
TCCGGAACCG GTCCGTCGAC CAACTATTCC GAACAGCATC TCAACGCGAC CCACGGTTTT
GACTGGGCGG AATGCGACGG CGGGAACGAT TTCATTTCCA TGGCCTACCT GGGACGCTGG
AGCGGTCCCG TCAATGAAGC GGATGTTCCG TATCCGTACG CCGTCGATGA GACGGCGGCC
GTGGTGAGGA AGCATATCCA GGACGTGGAC CAGATCAAGG ACCGCGCCAA TTATACCGAC
AACAACAGGA TCAAGGCGGC CGTAACCAAT CACGGCGCTC TTTACATTTC CTTCAAATGG
CTCGATACCC GCTACAATGC GGCCAAATAC GCCTACTGGA ACAACGGGAC GAGCGGCGAA
GGTCATGCCG TGGCCATCAT CGGCTGGAAC GACAATTATT CCAGAACGAA CTTCAAGAAG
GCCGGCACCG CACTGCCCGG CGGAAACGGC GCGTTTCTCG TGAAGAACAG CTGGGGAACC
ACCTGGGGCA ACAAGGGCTA CTTCTGGATG TCCTACTACG ACAAATCGTT GCAGACCGGC
ACGTCGTTCA GGGGGATTCA AGCGACATCT AACTACAAAC GCAGCTACCA GTACGATCCC
CTTGGCTGGG TCACCAGTCT GGGCGTCGAC ACAACGGTGT TGTGGGGCGC CAACATCTTC
ACCGCCAGAT CCGATGCCCC CGCGATCAAG GCCGTCAGCT TCTACACGGG AGCACCGAAT
ACCAAGGTGA CGATATATGT CCGCAAAAAC GTGTCGGCGA CCAACCCGAA AAGTGGGACC
CAGGTCGGAG CGGCGGTGAC GAAGACCATC CCGACCATGG GATACCACAC GGTCGTCTTC
TCCACGCCGA AGGCGGTCAC TGCCGGCAAG AAATTCTCGG TCCTGATCAA GTTCAATACC
CCGGGATTCA AATATCCTCT GCCGGTCGAA CTTGCCGCGG CCGACTATTC CAGCGGCGCC
ACCGCCTCCG CGAATCAGAG CTTCTACTCC ACGGACGGCG TTGCCTGGAC CGACATCACC
ACCTATGATT CGACCTGCAA CGTGTGCATC AAGGCGTTTG GCAGGGCTTC CTGA
 
Protein sequence
MRSPGKYAAW RILCTVSLVV VIGVFFAVSP GVSQAQQLRQ APLNPDFLKY REQLGQGRSP 
LRVTGEGHAL GYIPPPIDLS YAKNLPDADS RAAAVEAVTY PATYDLRTQH RVTSVRDQGD
CGSCWAFATY ASVESKLKGA SGTGPSTNYS EQHLNATHGF DWAECDGGND FISMAYLGRW
SGPVNEADVP YPYAVDETAA VVRKHIQDVD QIKDRANYTD NNRIKAAVTN HGALYISFKW
LDTRYNAAKY AYWNNGTSGE GHAVAIIGWN DNYSRTNFKK AGTALPGGNG AFLVKNSWGT
TWGNKGYFWM SYYDKSLQTG TSFRGIQATS NYKRSYQYDP LGWVTSLGVD TTVLWGANIF
TARSDAPAIK AVSFYTGAPN TKVTIYVRKN VSATNPKSGT QVGAAVTKTI PTMGYHTVVF
STPKAVTAGK KFSVLIKFNT PGFKYPLPVE LAAADYSSGA TASANQSFYS TDGVAWTDIT
TYDSTCNVCI KAFGRAS