Gene DvMF_0308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_0308 
Symbol 
ID7172190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp355405 
End bp356850 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content66% 
IMG OID643538804 
Productprotease Do 
Protein accessionYP_002434733 
Protein GI218885412 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.582048 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGCT CACTTTCCCG TACCCTCGCC GCCGCTCTGG CGCTGTGCCT TGTGCTTGCA 
GCAGCCGCAC AGGCCGCGCC CATGCTGCCC GACTTTCGCG AACTGGCGAA GCAGTCGGGC
AATGCCGTGG TCAACATCAG CACCGAAAAA ACAGTGCAGG CCGCGGAAAA TCCGTTCAAC
GAACTGTTCC GCAACATGCC GCCCGGCACC CCCTTCGACA AGTTCTTCGA CCAGTTCGAA
AAATTCCATG GCCGCCAGCA GCGCCCGCAG AAGCAGCGTT CGCTGGGTTC CGGCTTCATC
ATTTCCACGG ACGGCTACAT CGTCACCAAC AACCACGTGG TTGCTGAGGC GGACGTGATC
CGCGTGAACC TGCAAGGCGC CAGCGGCAAG TCCAACTCGT ACGTGGCCAA CGTCATCGGC
ACCGACGAGG AGACCGACCT CGCCCTGCTG AAGATCAACG CGGGCAACAC CTTGCCGGTG
CTGCCCTTCG GCGATTCCGA CAAGCTGGAA GTGGGCGAAT GGCTGCTGGC CATCGGCAAC
CCCTTCGGCC TCGACCACTC GGTGACCGCG GGCATACTGA GCGCCAAGGG GCGAGACATC
CGCTCCGGCC CGTTCGACAA TTTCCTGCAG ACCGACGCCT CCATCAACCC CGGCAACAGC
GGCGGCCCGC TGTTGAACAT GAATGGCCAG GTCATCGGCA TCAACACCGC CATCATCGCC
TCCGGCCAGG GCATCGGCTT TGCCATTCCC AGCAACATGG CCGAGCGGGT CATCGCCCAG
CTGCGCGCCG AGGGCAAGGT GCGGCGCGGC TGGATCGGCG TGACCATCCA GGACGTGGAC
GAGGCAACCG CACGTGCCCT GGGCCTTGGC GAGCCGCGCG GCGCGCTGGT GGGCTCGGTG
ATGCCCGGCG AACCCGCCGA CAAGGCGGGG CTGAAGCCCG GCGACATCGT GCTGAAGGTT
GAAGGCGACG ACGTGTCCGA TTCCAGCCAA CTGCTGCGCC GCATCGCCGC GCTGAAGCCC
GGCGACACCA CCAAGCTGAC CCTGTGGCGC AACGGCCAGA CCAAGACCGT CAACCTTACC
CTTGGCGAAC GCACGGCGGA ACACCTGACC GCCCAGCGCG GCGATGCCGC CCCGGAAAAG
AGCGGCAAGG AACAGGCTTC CGCCGGGCTT GGCATGAGCG TGCGCCCCGT CAGCGCGGAA
GACGCCCGCA ACCTGAAGCT GGAAGAGGCG CGCGGCCTGC TGGTGGTTTC CGTCGAGGGC
GGCAAGCCCG CGGCCGAGGC GGACATCCGC GCCGGTGACA TCATCCTGCT GGCCAACCTG
AAGCCGGTGA ACACCGCTGC CGACCTCACC AAGGTCATCG AGCAGGACGG CAAGAAGCGC
GGCGCGGTGA TGCTGCAACT GATGCGCCGC GGCCAGACCT TCTTCCGCAC CGTGCCCCTG
GAATAG
 
Protein sequence
MARSLSRTLA AALALCLVLA AAAQAAPMLP DFRELAKQSG NAVVNISTEK TVQAAENPFN 
ELFRNMPPGT PFDKFFDQFE KFHGRQQRPQ KQRSLGSGFI ISTDGYIVTN NHVVAEADVI
RVNLQGASGK SNSYVANVIG TDEETDLALL KINAGNTLPV LPFGDSDKLE VGEWLLAIGN
PFGLDHSVTA GILSAKGRDI RSGPFDNFLQ TDASINPGNS GGPLLNMNGQ VIGINTAIIA
SGQGIGFAIP SNMAERVIAQ LRAEGKVRRG WIGVTIQDVD EATARALGLG EPRGALVGSV
MPGEPADKAG LKPGDIVLKV EGDDVSDSSQ LLRRIAALKP GDTTKLTLWR NGQTKTVNLT
LGERTAEHLT AQRGDAAPEK SGKEQASAGL GMSVRPVSAE DARNLKLEEA RGLLVVSVEG
GKPAAEADIR AGDIILLANL KPVNTAADLT KVIEQDGKKR GAVMLQLMRR GQTFFRTVPL
E