Gene Dvul_1611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1611 
Symbol 
ID4665004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1909457 
End bp1910905 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content64% 
IMG OID639819849 
Productprotease Do 
Protein accessionYP_967055 
Protein GI120602655 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.4051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACGTA CCCCCCGCTA TTTCGCATTG CTGCTGCTCA TGGTGGCGGT GGTCCTGTCA 
TCCACCGCAC AGGCGGCAAG TCTTCCGGAT TTCAGGGAAC TGGCGAAGAA CGCCGGTGCC
GCTGTCGTCA ACATCAGCAC GGAGAAGACC GTGCAGGCCC CGGAAAACCC GTTCGGAGAC
ATGCTCCGCA ACGCCCCGCA AGGGACGCCC TTCGACAGGT TCTTCGAGCA GTTCGAAAGA
TTCCACGGAA AGATGCGCCC GCAGAAGCAG CGTTCGCTCG GTTCCGGCTT CATCATCTCG
GCGGACGGCT ACATCGTCAC CAACAACCAC GTCATCGCCG ATGCGGATGT CATCCACGTC
AACATCGAGA ACGAGACGGG CAAGAGTGCG TCTTACGACG CCAAGGTTAT CGGCACTGAC
GAAGAGACCG ACCTCGCCCT GCTCAAGATC GACGCCAAGC GGCAACTGCC CGTGCTGCGC
TTCGGCGACT CTGACAGCCT CGAAGTGGGT GAATGGCTGA TGGCCATCGG CAACCCCTTC
GGCCTCGACC ACAGCGTGAC GGCGGGCATC CTCAGTGCCA AGGGGCGCGA CATCCGCTCC
GGGCCCTTCG ACAACTTCCT CCAGACCGAT GCCTCCATCA ACCCCGGCAA CAGCGGCGGC
CCCCTCATCA ACATGAAGGG TGAGGTCATC GGCATCAACA CGGCCATCGT CGCCAGCGGT
CAGGGCATCG GCTTCGCCAT CCCCAGCAAC ATGGCAGCCC GCATCATCGA CCAGCTCAAG
AGCGACAAGA AGGTGCGCCG TGGCTGGATA GGCGTGACCA TTCAGGATGT CGACGAGAAC
ACGGCCCGTG CGCTCGGTCT CGGTGAACCG CGAGGTGCTC TCGTCGGTTC CGTGATGCCC
GGAGAACCCG CCGACAAGGC CGGTATCAAG GCCGGGGACA TCCTGCTCAA GGTCGAAGGT
GAGGACATAG CCGACTCCGG TCGCCTGCTG CGCCGCGTCG CAGCACTCAA GCCCGGTGAG
ACGGCCAAGA TAACCCTCTG GCGCAACGGC CAGACCAAGA CCGTCAACCT CACCCTTGGC
GAACGCACGG CAGAGCATCT CGCCGCACAG GGCGGCACAC CGCGCCAGAC TCCCGAATCG
AAGCAGCAGG CGTCGAGCAG CCTCGGCCTT ACCGTACGCC CGCCCAACGC CGAAGAAGCC
CGCGCGCTCA AGCTTGACAG GCCGCAGGGT CTCCTCGTCA TCGCCGTCGA AGAGGGCAGG
CCCGCCGCCG ACGCAGACAT CCGCGCCGGA GACGTGGTGC TTTCCGCCAA CCTGCACCCC
GTCAACAGCA CCGCCGACCT CGCCAAGGTC GTGCAGGAGG ACGCCAAGCG CAGGGGTGCC
GTGATGTTGC AGATTCAGCG TCGCGGTCAG ACGTTCTTCC GCACCGTTCC CATCGAAGCC
GAAAAGTAG
 
Protein sequence
MVRTPRYFAL LLLMVAVVLS STAQAASLPD FRELAKNAGA AVVNISTEKT VQAPENPFGD 
MLRNAPQGTP FDRFFEQFER FHGKMRPQKQ RSLGSGFIIS ADGYIVTNNH VIADADVIHV
NIENETGKSA SYDAKVIGTD EETDLALLKI DAKRQLPVLR FGDSDSLEVG EWLMAIGNPF
GLDHSVTAGI LSAKGRDIRS GPFDNFLQTD ASINPGNSGG PLINMKGEVI GINTAIVASG
QGIGFAIPSN MAARIIDQLK SDKKVRRGWI GVTIQDVDEN TARALGLGEP RGALVGSVMP
GEPADKAGIK AGDILLKVEG EDIADSGRLL RRVAALKPGE TAKITLWRNG QTKTVNLTLG
ERTAEHLAAQ GGTPRQTPES KQQASSSLGL TVRPPNAEEA RALKLDRPQG LLVIAVEEGR
PAADADIRAG DVVLSANLHP VNSTADLAKV VQEDAKRRGA VMLQIQRRGQ TFFRTVPIEA
EK