Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_2005 |
Symbol | |
ID | 4459671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | - |
Start bp | 2454394 |
End bp | 2455821 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639702771 |
Product | protease Do |
Protein accession | YP_846123 |
Protein GI | 116749436 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGTCG CGGCCTTCTT TGTGGGCGGT TTGGCGGCTT CATCGGGATT GAATCCCGGT AGACTTCTCG CGGTGCCGGA TGCGGCCCAG GCAGCGCCCG CTGAAGGGGA TCAGCCAGGA GTAACCCCAA CCAGCCCGTT CGCCACGCTG GCGGCAAAAC TCACGCCCGT GGTGGTCAAC GTCAGGGTGA CCAAAATCGA ACGGGCGGAG TTCCCCGATT TTGAAGGACC GGAGCAGCCG TTCGGAGACT TTTTCAGGCA TTTTTTCGGC GACCGGCGGG GATTTCCGAA TGTCCCGGCG CAGGGCGCAG GTTCGGGAGT GATCATCCGC GGCGACGGGT ATGTCCTGAC CAACAATCAC GTGGTTGAAG GCGCCAGGGA AGTGACCGTG ACGCTTTCCG ACAAGCAGGA ACACAAAGCG CGAATCGTCG GGCGGGATGC CAAGACCGAC CTCGCGCTTC TCAAAATTGA AGCGGGCAAA AGCCTGCCTG CCGCCAGCCT GGGCGATTCC GACCAACTCA AGGTCGGGGA TTGGGTGATG GCCATCGGCA ACCCGTTCGG TCTCAGTGAA ACGGTCACTT CCGGGATCGT CAGCGCCAAA GGCCGCGTCA TTGGGGCGGG CCCCTATGAC GACTTCATCC AGACCGATGC CTCGATCAAC CCGGGCAATT CGGGAGGACC GCTTTTCAAT ATGAAGGGCG AAGTCGTGGG GATCAACACC GCCATCATCC CGAACGCCCA GGGAATCGGA TTCGCCATTC CCGTCAACAC GGCCAAGCCG CTGATTCCTC AGCTGGAAAC CAAAGGCGAA GTGACTCGGG GGTACCTGGG AGTCAGCATC CAGTCGATCA CGCCCGATCT TGCCTCGGCA ATGGGGCTGG GTGACGGGAA GGGAGCGCTG GTGGCGGACG TCGTTGAAGG CGGTCCCGCC GACAGGGCCG GGATCCGGCG CGGGGACGTG ATCCTCGCCT TTGGAGGCAA GGACGTCAAA GACAGTCACG ATCTCTCGTT CATGGTCGCC GCGGCCCCGG TGGGCAGGGA ATCCGCGGTG ACGATCATGC GGGAGGGCGT CGAGCGGCGG CTGGACGTCA AGATCGGAAA ACAGGAATCC GAGGAAGGGG CGAAGGAGGA ATTTTCGAAA CAGGCTCACG GCAAATGGGG CCTCCAGCTC CGGGATGTGC CTCCCCGGGT TGCGGAAGAG CTCGGCCTCG AGTCGGAGCG CGGGGCACTC GTGGCCGGCG TTCTCCCGGG AAGCCCGGCG GATCGGGCCG CCCTGCGGCA GGGGGATGTC ATCCTGGAGG TCAATCGTCA GCCCGTAACA TCGGCGAGCG AGCTCAAAGA AAGGATTGCC GGGGCGGGCG AGCGGGGTGC CCTGGTTCTC CTCGTGCAGA GCAGTCGGGG GACGAGGTAC GTCGTGCTGA AGGGCTGA
|
Protein sequence | MVVAAFFVGG LAASSGLNPG RLLAVPDAAQ AAPAEGDQPG VTPTSPFATL AAKLTPVVVN VRVTKIERAE FPDFEGPEQP FGDFFRHFFG DRRGFPNVPA QGAGSGVIIR GDGYVLTNNH VVEGAREVTV TLSDKQEHKA RIVGRDAKTD LALLKIEAGK SLPAASLGDS DQLKVGDWVM AIGNPFGLSE TVTSGIVSAK GRVIGAGPYD DFIQTDASIN PGNSGGPLFN MKGEVVGINT AIIPNAQGIG FAIPVNTAKP LIPQLETKGE VTRGYLGVSI QSITPDLASA MGLGDGKGAL VADVVEGGPA DRAGIRRGDV ILAFGGKDVK DSHDLSFMVA AAPVGRESAV TIMREGVERR LDVKIGKQES EEGAKEEFSK QAHGKWGLQL RDVPPRVAEE LGLESERGAL VAGVLPGSPA DRAALRQGDV ILEVNRQPVT SASELKERIA GAGERGALVL LVQSSRGTRY VVLKG
|
| |