Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_4005 |
Symbol | |
ID | 4457651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 4861324 |
End bp | 4863183 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639704776 |
Product | peptidase C1A, papain |
Protein accession | YP_848106 |
Protein GI | 116751419 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGCCG GGGAGGGGAG TATGCACTTG AAACGACTTG CTGTGGCTTG TTTCGCGTTG TTTTATGCCA CGTTTTTCGG TAGCGCAGGT GCGTTTGCCG AAGAACTGGC CGATGTGCAG AAAAGAATAA AAGACAGGAA TCTCAAGTGG GTTGCTGAGC GACACCTCAA CCCCGAAAGG AAGGGACTCG GACTCCTCAG GGACGGGTTC ACCGCCGCCG TCCCGCCTGC CGAAGGTGCA GGCGATGTTC CCACAGGCCT GGCGACCGCC GTGGACTGGC GAAATATCGG CGCCGACAAC GCGTTCGGGG TTGCCCCGGG CAACTACGTG TCCCGCGTGA AGAACCAGGG GAGCTGCGGC AGTTGCTGGG CTTTCGCAAC GACGGCAATC CTTGAGTCCG CGACCCAGAT CGCCAACAAC GATCCGATCG AACCCCTTGA TCCCGGCAGT GCCTACGATC TGTCCGAGCA GGTCATGCTC ACCTGCAGCG GCGCCGGCAG CTGCAACGGC GGGTACGTCA CCACGGCCTC GAGCTATGCA GCCACCACGG GATTGCTGCG GGAATTCCCC TCCGGTTGTT ACGCCTACAA CATCGGCAGC ACAACCTGCC CCAACCCGGG TTCCTACCCC GACTGCGATC AAACCCGCTT CCGGATCGAC GCCTGGAGCG GGGTATCGGC CACCGTCGAT GCGATGAAGA ACGCCCTGAA CACCCATGGC CCTCTCGTGG CGACCTACGC GGTCTACAAT GACTTCTACC GCTACTACGG CAGCGGCATT TACGAGGCCA TTTCCTGCGA TCAAACGGTC AACCCCCTCG TGGGCTATCA CGCGGTGGCG CTGGTGGGCT ATCGGGATGC CGATGCCGCC GACCCGGTGG GGTATTTCAT CGTGAAGAAC AGCTGGGGAG CCGCGTGGGG TGAATCGGGG TATTTCAGAA TCGCCTACTC TCAGGTCGGC AACTGCGTGA AATTCGGGGG GACCACCCTG GCGTACTCCA AGACGGCCTG CAACGGGGCC ATCACCGTGG ATTCTCCGGC CGAATCGGCC ACCTTGCAGG CGGGAACGAT GCACGCCATC ACCTGGAGCG ACTCGGGGAG CATCGGCCCG TACGCGAGTA TCGATCTCTA CCAGGCAGGG AATCGCGTCC GGACGATCCA GGCAAATGCC CTCCTGGCGG ATGGATCATT CTCGTGGCTG GTCGACTCCG ATCTTCAGGG GCCGAACTTT TCGGTAATGG TCACGAGCAC GGCATGCAGT TCCGCCTACG GCACGAGCGG ACCGTTTTCC ATCAACCCGG CGGCGGATTT CGAAGTGGCC GGCACGGCGG TTTCCGGCAG TGTGGGACTT TCGGGAGTGA CCATCAGTTT CAGCAGAGTT TCCGGGGCCG GCACGATCCC CGCCCCTGTC GTCACCGACT TCCAGGGCGG GTGGAGACAA AGCGGGTTTC AGCAAGGGAC GGAATACCGG GCGACGCCAT CAAAAACGGG CTGCACGTTC AGCCCGGCGT TCCTGGATTT CACCGATGCA GCGTCCAGCC TGAATTTCGC CGCGACGGAG AACAAGATAA CCTCCGTCAT ATCTCCGACC GCCGGCTCCA TCGTGAAGGT GGGGGGGGCC CTGCTCGTCA AGTGGACGTA CACGGGCAGC CCGGGTCCCT ATGTGACCAT CGAGGCCGTG AACACCGCTA CCGGCGCTCG AACGGCGATC AGTTCCAAGG CCAAGATCGG CACCGGCGGG ATCGGCTCTT ACAACTGGAG GATCGGCAAG CAGCAGGCGG CAGGGACCTA CCGGATCAAG GTCGCGAGCA AGACCAACGG TTCCTCGGCG ACGAGCGCGG AATTCAGCAT CATAAAATAG
|
Protein sequence | MGAGEGSMHL KRLAVACFAL FYATFFGSAG AFAEELADVQ KRIKDRNLKW VAERHLNPER KGLGLLRDGF TAAVPPAEGA GDVPTGLATA VDWRNIGADN AFGVAPGNYV SRVKNQGSCG SCWAFATTAI LESATQIANN DPIEPLDPGS AYDLSEQVML TCSGAGSCNG GYVTTASSYA ATTGLLREFP SGCYAYNIGS TTCPNPGSYP DCDQTRFRID AWSGVSATVD AMKNALNTHG PLVATYAVYN DFYRYYGSGI YEAISCDQTV NPLVGYHAVA LVGYRDADAA DPVGYFIVKN SWGAAWGESG YFRIAYSQVG NCVKFGGTTL AYSKTACNGA ITVDSPAESA TLQAGTMHAI TWSDSGSIGP YASIDLYQAG NRVRTIQANA LLADGSFSWL VDSDLQGPNF SVMVTSTACS SAYGTSGPFS INPAADFEVA GTAVSGSVGL SGVTISFSRV SGAGTIPAPV VTDFQGGWRQ SGFQQGTEYR ATPSKTGCTF SPAFLDFTDA ASSLNFAATE NKITSVISPT AGSIVKVGGA LLVKWTYTGS PGPYVTIEAV NTATGARTAI SSKAKIGTGG IGSYNWRIGK QQAAGTYRIK VASKTNGSSA TSAEFSIIK
|
| |