Gene Sfum_4005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_4005 
Symbol 
ID4457651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4861324 
End bp4863183 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content62% 
IMG OID639704776 
Productpeptidase C1A, papain 
Protein accessionYP_848106 
Protein GI116751419 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4870] Cysteine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGCCG GGGAGGGGAG TATGCACTTG AAACGACTTG CTGTGGCTTG TTTCGCGTTG 
TTTTATGCCA CGTTTTTCGG TAGCGCAGGT GCGTTTGCCG AAGAACTGGC CGATGTGCAG
AAAAGAATAA AAGACAGGAA TCTCAAGTGG GTTGCTGAGC GACACCTCAA CCCCGAAAGG
AAGGGACTCG GACTCCTCAG GGACGGGTTC ACCGCCGCCG TCCCGCCTGC CGAAGGTGCA
GGCGATGTTC CCACAGGCCT GGCGACCGCC GTGGACTGGC GAAATATCGG CGCCGACAAC
GCGTTCGGGG TTGCCCCGGG CAACTACGTG TCCCGCGTGA AGAACCAGGG GAGCTGCGGC
AGTTGCTGGG CTTTCGCAAC GACGGCAATC CTTGAGTCCG CGACCCAGAT CGCCAACAAC
GATCCGATCG AACCCCTTGA TCCCGGCAGT GCCTACGATC TGTCCGAGCA GGTCATGCTC
ACCTGCAGCG GCGCCGGCAG CTGCAACGGC GGGTACGTCA CCACGGCCTC GAGCTATGCA
GCCACCACGG GATTGCTGCG GGAATTCCCC TCCGGTTGTT ACGCCTACAA CATCGGCAGC
ACAACCTGCC CCAACCCGGG TTCCTACCCC GACTGCGATC AAACCCGCTT CCGGATCGAC
GCCTGGAGCG GGGTATCGGC CACCGTCGAT GCGATGAAGA ACGCCCTGAA CACCCATGGC
CCTCTCGTGG CGACCTACGC GGTCTACAAT GACTTCTACC GCTACTACGG CAGCGGCATT
TACGAGGCCA TTTCCTGCGA TCAAACGGTC AACCCCCTCG TGGGCTATCA CGCGGTGGCG
CTGGTGGGCT ATCGGGATGC CGATGCCGCC GACCCGGTGG GGTATTTCAT CGTGAAGAAC
AGCTGGGGAG CCGCGTGGGG TGAATCGGGG TATTTCAGAA TCGCCTACTC TCAGGTCGGC
AACTGCGTGA AATTCGGGGG GACCACCCTG GCGTACTCCA AGACGGCCTG CAACGGGGCC
ATCACCGTGG ATTCTCCGGC CGAATCGGCC ACCTTGCAGG CGGGAACGAT GCACGCCATC
ACCTGGAGCG ACTCGGGGAG CATCGGCCCG TACGCGAGTA TCGATCTCTA CCAGGCAGGG
AATCGCGTCC GGACGATCCA GGCAAATGCC CTCCTGGCGG ATGGATCATT CTCGTGGCTG
GTCGACTCCG ATCTTCAGGG GCCGAACTTT TCGGTAATGG TCACGAGCAC GGCATGCAGT
TCCGCCTACG GCACGAGCGG ACCGTTTTCC ATCAACCCGG CGGCGGATTT CGAAGTGGCC
GGCACGGCGG TTTCCGGCAG TGTGGGACTT TCGGGAGTGA CCATCAGTTT CAGCAGAGTT
TCCGGGGCCG GCACGATCCC CGCCCCTGTC GTCACCGACT TCCAGGGCGG GTGGAGACAA
AGCGGGTTTC AGCAAGGGAC GGAATACCGG GCGACGCCAT CAAAAACGGG CTGCACGTTC
AGCCCGGCGT TCCTGGATTT CACCGATGCA GCGTCCAGCC TGAATTTCGC CGCGACGGAG
AACAAGATAA CCTCCGTCAT ATCTCCGACC GCCGGCTCCA TCGTGAAGGT GGGGGGGGCC
CTGCTCGTCA AGTGGACGTA CACGGGCAGC CCGGGTCCCT ATGTGACCAT CGAGGCCGTG
AACACCGCTA CCGGCGCTCG AACGGCGATC AGTTCCAAGG CCAAGATCGG CACCGGCGGG
ATCGGCTCTT ACAACTGGAG GATCGGCAAG CAGCAGGCGG CAGGGACCTA CCGGATCAAG
GTCGCGAGCA AGACCAACGG TTCCTCGGCG ACGAGCGCGG AATTCAGCAT CATAAAATAG
 
Protein sequence
MGAGEGSMHL KRLAVACFAL FYATFFGSAG AFAEELADVQ KRIKDRNLKW VAERHLNPER 
KGLGLLRDGF TAAVPPAEGA GDVPTGLATA VDWRNIGADN AFGVAPGNYV SRVKNQGSCG
SCWAFATTAI LESATQIANN DPIEPLDPGS AYDLSEQVML TCSGAGSCNG GYVTTASSYA
ATTGLLREFP SGCYAYNIGS TTCPNPGSYP DCDQTRFRID AWSGVSATVD AMKNALNTHG
PLVATYAVYN DFYRYYGSGI YEAISCDQTV NPLVGYHAVA LVGYRDADAA DPVGYFIVKN
SWGAAWGESG YFRIAYSQVG NCVKFGGTTL AYSKTACNGA ITVDSPAESA TLQAGTMHAI
TWSDSGSIGP YASIDLYQAG NRVRTIQANA LLADGSFSWL VDSDLQGPNF SVMVTSTACS
SAYGTSGPFS INPAADFEVA GTAVSGSVGL SGVTISFSRV SGAGTIPAPV VTDFQGGWRQ
SGFQQGTEYR ATPSKTGCTF SPAFLDFTDA ASSLNFAATE NKITSVISPT AGSIVKVGGA
LLVKWTYTGS PGPYVTIEAV NTATGARTAI SSKAKIGTGG IGSYNWRIGK QQAAGTYRIK
VASKTNGSSA TSAEFSIIK