Gene Dvul_0923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0923 
Symbol 
ID4663405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1135160 
End bp1136446 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content63% 
IMG OID639819146 
Productcarboxyl-terminal protease 
Protein accessionYP_966371 
Protein GI120601971 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.315177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.478971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTTA CGTTATGGGT GGTATCGCTG GGCCTGTGTG CCGCGGTCGC CTTTTCAGGT 
GGAGCAGTCT TCGCGACCAC CGAAGAGTCC AAGTATGACG CGCTGAAGCG CTTCAGTCAG
GTGCTCGACA TCGTCGAGCG CTACTACGTG CGCGACGTGC CCCGCAAGGA CCTCATGAAC
GGAGCGGTGA AGGGCATGTT GCAGGGGCTT GACCCCCACT CCACCTTCCT CTCCCCGGAA
GAATTCAAGG AGATGCAGGA GACCACCTCT GGTGAGTTCT TCGGCATCGG CATCGAGATA
TCCAGCGAGA ACGGGCAACT CACCGTTGTG TCGCCCATCG AGGACACTCC TGCGTTCAAG
GCGGGACTCA AGGCGGGCGA CCTCATTCTC GCCGTCGATG GGCAGCCCAC GCAGGAGATG
AGCACGCAGG AGGCCGTATC GCGCATTCGC GGGCCCAAGG GCAGCGAAGT GGAACTGCTC
ATCCTGCATC GCGAAGCCAA GGCCCCCAGC ACGGTGAAAA TCGTGCGCGA CGCCATCCCC
CTCGTCAGCG TCAAGTCGAA GCAGCTTGAG CAGGGGTACG TGTGGGTGCG CCTCACCCGC
TTCAGCGAAC GTACGACCAG CGACCTGCTG GAAGCACTGC GCGAGGCGAA CAAGCGCGGG
CCCGTCAAGG GCGTGGTTCT CGACCTGCGC AACAACCCCG GCGGTCTGCT TGACCAGGCC
GTGAGCGTGT CCGACGTGTT CCTGCGTGAC GGGGGCATCG TCTCCATCCG CGGGCGCGGC
GACGACACGG GGCGTGAGTA CAACGCCAAG GCGCAGTCCA CCGACGTGAC CGCGCCCATG
GTGGTGCTCA TCAACGCCGG GTCTGCCTCC GCTTCGGAGA TCGTCGCCGG GGCCCTGCGC
GACCAGAAGC GCGCGCTTCT GGTGGGTGAA CGCAGCTTCG GCAAGGGGTC GGTGCAGAAC
GTCATCCCGC TTTCCGACGG CGCGGGACTC AAGCTCACGG TTGCACTGTA CTACACGCCC
AATGGCCGTT CCATTCAGGC CGAGGGCATC GACCCTGACA TCGAGATTCC CTTCGAAGCC
CCGCGTGAGG ACGACGCCAA ACCCATGCAG CGTTTCAACA TGTTGCGGGA GAAGGATCTT
TCGCGTCACC TGGAGAACGG TGCCGGGGGC AAGCAGGGCA AGAACGACCA GTCTGCCGAG
GTGCGTGACC TGCTTGAACG CGACAACCAG TTGCGCATGG CATTGCAGTT CGTGAAGCGG
CTGCCCGCCT TGAAGGAAAT ACGCTAG
 
Protein sequence
MRVTLWVVSL GLCAAVAFSG GAVFATTEES KYDALKRFSQ VLDIVERYYV RDVPRKDLMN 
GAVKGMLQGL DPHSTFLSPE EFKEMQETTS GEFFGIGIEI SSENGQLTVV SPIEDTPAFK
AGLKAGDLIL AVDGQPTQEM STQEAVSRIR GPKGSEVELL ILHREAKAPS TVKIVRDAIP
LVSVKSKQLE QGYVWVRLTR FSERTTSDLL EALREANKRG PVKGVVLDLR NNPGGLLDQA
VSVSDVFLRD GGIVSIRGRG DDTGREYNAK AQSTDVTAPM VVLINAGSAS ASEIVAGALR
DQKRALLVGE RSFGKGSVQN VIPLSDGAGL KLTVALYYTP NGRSIQAEGI DPDIEIPFEA
PREDDAKPMQ RFNMLREKDL SRHLENGAGG KQGKNDQSAE VRDLLERDNQ LRMALQFVKR
LPALKEIR