Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1611 |
Symbol | |
ID | 4665004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 1909457 |
End bp | 1910905 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639819849 |
Product | protease Do |
Protein accession | YP_967055 |
Protein GI | 120602655 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.4051 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTACGTA CCCCCCGCTA TTTCGCATTG CTGCTGCTCA TGGTGGCGGT GGTCCTGTCA TCCACCGCAC AGGCGGCAAG TCTTCCGGAT TTCAGGGAAC TGGCGAAGAA CGCCGGTGCC GCTGTCGTCA ACATCAGCAC GGAGAAGACC GTGCAGGCCC CGGAAAACCC GTTCGGAGAC ATGCTCCGCA ACGCCCCGCA AGGGACGCCC TTCGACAGGT TCTTCGAGCA GTTCGAAAGA TTCCACGGAA AGATGCGCCC GCAGAAGCAG CGTTCGCTCG GTTCCGGCTT CATCATCTCG GCGGACGGCT ACATCGTCAC CAACAACCAC GTCATCGCCG ATGCGGATGT CATCCACGTC AACATCGAGA ACGAGACGGG CAAGAGTGCG TCTTACGACG CCAAGGTTAT CGGCACTGAC GAAGAGACCG ACCTCGCCCT GCTCAAGATC GACGCCAAGC GGCAACTGCC CGTGCTGCGC TTCGGCGACT CTGACAGCCT CGAAGTGGGT GAATGGCTGA TGGCCATCGG CAACCCCTTC GGCCTCGACC ACAGCGTGAC GGCGGGCATC CTCAGTGCCA AGGGGCGCGA CATCCGCTCC GGGCCCTTCG ACAACTTCCT CCAGACCGAT GCCTCCATCA ACCCCGGCAA CAGCGGCGGC CCCCTCATCA ACATGAAGGG TGAGGTCATC GGCATCAACA CGGCCATCGT CGCCAGCGGT CAGGGCATCG GCTTCGCCAT CCCCAGCAAC ATGGCAGCCC GCATCATCGA CCAGCTCAAG AGCGACAAGA AGGTGCGCCG TGGCTGGATA GGCGTGACCA TTCAGGATGT CGACGAGAAC ACGGCCCGTG CGCTCGGTCT CGGTGAACCG CGAGGTGCTC TCGTCGGTTC CGTGATGCCC GGAGAACCCG CCGACAAGGC CGGTATCAAG GCCGGGGACA TCCTGCTCAA GGTCGAAGGT GAGGACATAG CCGACTCCGG TCGCCTGCTG CGCCGCGTCG CAGCACTCAA GCCCGGTGAG ACGGCCAAGA TAACCCTCTG GCGCAACGGC CAGACCAAGA CCGTCAACCT CACCCTTGGC GAACGCACGG CAGAGCATCT CGCCGCACAG GGCGGCACAC CGCGCCAGAC TCCCGAATCG AAGCAGCAGG CGTCGAGCAG CCTCGGCCTT ACCGTACGCC CGCCCAACGC CGAAGAAGCC CGCGCGCTCA AGCTTGACAG GCCGCAGGGT CTCCTCGTCA TCGCCGTCGA AGAGGGCAGG CCCGCCGCCG ACGCAGACAT CCGCGCCGGA GACGTGGTGC TTTCCGCCAA CCTGCACCCC GTCAACAGCA CCGCCGACCT CGCCAAGGTC GTGCAGGAGG ACGCCAAGCG CAGGGGTGCC GTGATGTTGC AGATTCAGCG TCGCGGTCAG ACGTTCTTCC GCACCGTTCC CATCGAAGCC GAAAAGTAG
|
Protein sequence | MVRTPRYFAL LLLMVAVVLS STAQAASLPD FRELAKNAGA AVVNISTEKT VQAPENPFGD MLRNAPQGTP FDRFFEQFER FHGKMRPQKQ RSLGSGFIIS ADGYIVTNNH VIADADVIHV NIENETGKSA SYDAKVIGTD EETDLALLKI DAKRQLPVLR FGDSDSLEVG EWLMAIGNPF GLDHSVTAGI LSAKGRDIRS GPFDNFLQTD ASINPGNSGG PLINMKGEVI GINTAIVASG QGIGFAIPSN MAARIIDQLK SDKKVRRGWI GVTIQDVDEN TARALGLGEP RGALVGSVMP GEPADKAGIK AGDILLKVEG EDIADSGRLL RRVAALKPGE TAKITLWRNG QTKTVNLTLG ERTAEHLAAQ GGTPRQTPES KQQASSSLGL TVRPPNAEEA RALKLDRPQG LLVIAVEEGR PAADADIRAG DVVLSANLHP VNSTADLAKV VQEDAKRRGA VMLQIQRRGQ TFFRTVPIEA EK
|
| |