Gene DvMF_3158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_3158 
Symbol 
ID7175104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp3982747 
End bp3984048 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content65% 
IMG OID643541694 
Productcarboxyl-terminal protease 
Protein accessionYP_002437562 
Protein GI218888241 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.0549247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTGA CGCTGTGGAC GGCCACGCTG CTCATTCTTG GCGTGCTCGC CATTTCCGGC 
GGGGCGGCGC TGGTGCCCGA CACCGTCGGT GCAGCCAGTG AGGAAGGCAA ATACGACTCG
CTGAAGCGCT TCAGCCAGGT GCTCGACCTG GTGGAGCGCT ATTACGTGCG CGACGTGCCC
CGCAAGGACC TGATCAACGG GGCCGTCAAG GGCATGCTGC AAGGGCTGGA CCCCCATTCG
ACGTTCCTGT CGGTGGAGGA GTTCAAGGAG ATGCAGGAAA GCACCTCCGG CGAATTCTTC
GGCATCGGCA TCGAAATTTC CAGCGAGAAC GGCCAGCTCA TCGTGGTGGC CCCCATCGAG
GACACCCCCG CCCACAAGGC GGGCCTGAAG AGCGGCGACA TCATCCTTGC CGTGGACGGC
GTGCCCACCC AGGACATGAC CACCCAGGAA GCGGTCACCC GCATCCGTGG CGCCAAGGGC
ACCGAAGTGG AGCTGTCCAT CCTGCACCGC GACGCCAAGG CCCCCGAAGT GGTGCGCCTG
GTGCGCGACG CCATTCCGCT CATCAGCGTC AAGTCCAAGA TGCTCGAGGA CGGCTACCAC
TGGGTGCGCC TGACCCGCTT CAGCGAACGC ACCACCGGTG AACTGGTGGA CGCGTTGAAG
GAAGCCAACA AGAAGGGCAT GAAGGGCATC ATCCTCGATC TGCGCAACAA CCCCGGCGGG
TTGCTGGACC AGGCCGTGAG CGTGTCCGAC ACCTTCCTGA AGGACGGGGT CATCGTGTCT
ATCCGTGGCC GCATGGAAGA CGCCAGCCGG GAATACCGGG CCAAGGCCCA GCCCGGCGAC
GTGACCGTGC CCATGGTGGT GCTGGTCAAC GCCGGTTCCG CCTCGGCCTC GGAAATCGTG
GCCGGTGCCC TGCGTGACCA CAACCGCGCG CTCATCCTGG GTGAACGCAC CTTCGGCAAG
GGTTCGGTGC AGAACGTCAT CCCGCTGTCC GACGGCGCGG GCCTGAAGCT GACCGTGGCC
CTGTACTACA CGCCTAATGG CCGCTCCATC CAGGCGGAAG GCGTGGAGCC CGACTTCGAG
GTGCCTTTCG AACTGCCGCG CGAGGAAGAA AAGGCCCACC GCCTGAACAT GGTGCGCGAA
AAGGATCTGA ACCGCCACCT CGAGAACGGT TCTTCCGGCA AGGATGCGCG TCCTTCGGCC
AAGGCCGCGG ACGACGTGAA GCAGGCCCTG GAAAAGGACA ACCAGCTGCG CATGGCGTTG
CAGTTCGTGA AGCGCCTGCC CCGCCTCAAG GATATCCAGT AG
 
Protein sequence
MRVTLWTATL LILGVLAISG GAALVPDTVG AASEEGKYDS LKRFSQVLDL VERYYVRDVP 
RKDLINGAVK GMLQGLDPHS TFLSVEEFKE MQESTSGEFF GIGIEISSEN GQLIVVAPIE
DTPAHKAGLK SGDIILAVDG VPTQDMTTQE AVTRIRGAKG TEVELSILHR DAKAPEVVRL
VRDAIPLISV KSKMLEDGYH WVRLTRFSER TTGELVDALK EANKKGMKGI ILDLRNNPGG
LLDQAVSVSD TFLKDGVIVS IRGRMEDASR EYRAKAQPGD VTVPMVVLVN AGSASASEIV
AGALRDHNRA LILGERTFGK GSVQNVIPLS DGAGLKLTVA LYYTPNGRSI QAEGVEPDFE
VPFELPREEE KAHRLNMVRE KDLNRHLENG SSGKDARPSA KAADDVKQAL EKDNQLRMAL
QFVKRLPRLK DIQ