Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0308 |
Symbol | |
ID | 7172190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 355405 |
End bp | 356850 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643538804 |
Product | protease Do |
Protein accession | YP_002434733 |
Protein GI | 218885412 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 0.582048 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGCT CACTTTCCCG TACCCTCGCC GCCGCTCTGG CGCTGTGCCT TGTGCTTGCA GCAGCCGCAC AGGCCGCGCC CATGCTGCCC GACTTTCGCG AACTGGCGAA GCAGTCGGGC AATGCCGTGG TCAACATCAG CACCGAAAAA ACAGTGCAGG CCGCGGAAAA TCCGTTCAAC GAACTGTTCC GCAACATGCC GCCCGGCACC CCCTTCGACA AGTTCTTCGA CCAGTTCGAA AAATTCCATG GCCGCCAGCA GCGCCCGCAG AAGCAGCGTT CGCTGGGTTC CGGCTTCATC ATTTCCACGG ACGGCTACAT CGTCACCAAC AACCACGTGG TTGCTGAGGC GGACGTGATC CGCGTGAACC TGCAAGGCGC CAGCGGCAAG TCCAACTCGT ACGTGGCCAA CGTCATCGGC ACCGACGAGG AGACCGACCT CGCCCTGCTG AAGATCAACG CGGGCAACAC CTTGCCGGTG CTGCCCTTCG GCGATTCCGA CAAGCTGGAA GTGGGCGAAT GGCTGCTGGC CATCGGCAAC CCCTTCGGCC TCGACCACTC GGTGACCGCG GGCATACTGA GCGCCAAGGG GCGAGACATC CGCTCCGGCC CGTTCGACAA TTTCCTGCAG ACCGACGCCT CCATCAACCC CGGCAACAGC GGCGGCCCGC TGTTGAACAT GAATGGCCAG GTCATCGGCA TCAACACCGC CATCATCGCC TCCGGCCAGG GCATCGGCTT TGCCATTCCC AGCAACATGG CCGAGCGGGT CATCGCCCAG CTGCGCGCCG AGGGCAAGGT GCGGCGCGGC TGGATCGGCG TGACCATCCA GGACGTGGAC GAGGCAACCG CACGTGCCCT GGGCCTTGGC GAGCCGCGCG GCGCGCTGGT GGGCTCGGTG ATGCCCGGCG AACCCGCCGA CAAGGCGGGG CTGAAGCCCG GCGACATCGT GCTGAAGGTT GAAGGCGACG ACGTGTCCGA TTCCAGCCAA CTGCTGCGCC GCATCGCCGC GCTGAAGCCC GGCGACACCA CCAAGCTGAC CCTGTGGCGC AACGGCCAGA CCAAGACCGT CAACCTTACC CTTGGCGAAC GCACGGCGGA ACACCTGACC GCCCAGCGCG GCGATGCCGC CCCGGAAAAG AGCGGCAAGG AACAGGCTTC CGCCGGGCTT GGCATGAGCG TGCGCCCCGT CAGCGCGGAA GACGCCCGCA ACCTGAAGCT GGAAGAGGCG CGCGGCCTGC TGGTGGTTTC CGTCGAGGGC GGCAAGCCCG CGGCCGAGGC GGACATCCGC GCCGGTGACA TCATCCTGCT GGCCAACCTG AAGCCGGTGA ACACCGCTGC CGACCTCACC AAGGTCATCG AGCAGGACGG CAAGAAGCGC GGCGCGGTGA TGCTGCAACT GATGCGCCGC GGCCAGACCT TCTTCCGCAC CGTGCCCCTG GAATAG
|
Protein sequence | MARSLSRTLA AALALCLVLA AAAQAAPMLP DFRELAKQSG NAVVNISTEK TVQAAENPFN ELFRNMPPGT PFDKFFDQFE KFHGRQQRPQ KQRSLGSGFI ISTDGYIVTN NHVVAEADVI RVNLQGASGK SNSYVANVIG TDEETDLALL KINAGNTLPV LPFGDSDKLE VGEWLLAIGN PFGLDHSVTA GILSAKGRDI RSGPFDNFLQ TDASINPGNS GGPLLNMNGQ VIGINTAIIA SGQGIGFAIP SNMAERVIAQ LRAEGKVRRG WIGVTIQDVD EATARALGLG EPRGALVGSV MPGEPADKAG LKPGDIVLKV EGDDVSDSSQ LLRRIAALKP GDTTKLTLWR NGQTKTVNLT LGERTAEHLT AQRGDAAPEK SGKEQASAGL GMSVRPVSAE DARNLKLEEA RGLLVVSVEG GKPAAEADIR AGDIILLANL KPVNTAADLT KVIEQDGKKR GAVMLQLMRR GQTFFRTVPL E
|
| |