Gene DvMF_0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_0403 
Symbol 
ID7172289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp473238 
End bp474485 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content70% 
IMG OID643538902 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_002434828 
Protein GI218885507 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.119633 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTGTC TCGGCATAGA AACCTCGTGC GATGAAACCG CGCTGGCGCT TGTGCGCGAC 
GGCGTGGTGG TCGCCGACGT CATGTCCTCG CAGGCCGACG TGCACGCCCT GTTCGGCGGG
GTGGTGCCGG AACTGGCCTC GCGCGAGCAT TACAGGCTCA TCGGCGCCCT GTGCGACGAA
GTGCTGCGCC GCGCGGGCGT GACCGCCGCC GACATCGACG TGGTGGCCGT GGCGCGCGGG
CCGGGCCTGC TGGGCAGCCT GCTGGTGGGC ATGGGCTTTG CCAAGGGGCT TGCCCTGGCC
ACCGGCGCGC GGCTGGTGGG GGTCAACCAC CTGCATGCCC ACCTGCTGGC GGCGGGCATT
GCCGCAGAGA TGGCGTACCC CGCGCTGGGC CTTCTGGTGT CCGGCGGGCA TACCCACATC
TACCGCATGG AATCTCCGCG CCACATGCCG CTGCTGGGGC GCACCCTGGA CGACGCGGCG
GGCGAGGCCT TCGACAAGGT GGCCAAGCAA CTCGGCCTGG CCTATCCCGG CGGGCGCACC
ATCGACGAAC TGGGCCGCAA GGGCAGGGTG AACCCCACCA TGTTTCCCCG GCCCTACGTC
GATAACGACA ACCTCGATTT CAGCTTCAGC GGCCTGAAGA CCGCAGCCGG GCTGGTGATC
CAGGCCAACC CCGCCCTGGC CGCCGCCGCG CGCGCCGTGG CAACCGGTTC GGTCGCTGTG
GCGGGTTCTG TCGACGGTGC GGCGGATCAG GCCCTGCCCG ATGTATCCGG CCTCTGCGAC
ATGTGCGCCT CGTTCAACCA CGCCGTGGCC GACACCCTGC GCATCAAGCT GGAAAGGGCG
CTGGTGCGTG AAAACGGTCC GCTGCCCGGC ACGCCCACCG CACCGGCCAG GAAAAAGCGG
AAGAAAGGCG AAGCGGTTGC CACGGATACC GATCAGGCCG CACTTGCCGC GCATGCGGGG
CAGGGCGTCA CACCCCCGGA TCCTCACCGC CCCCCCGTGC GTGCGCTGGT GGCTGCCGGC
GGTGTGGCCG CAAACTCGTA TGTGCGCGAG GCCCTGGGTG ATCTTGCCCG GCGCGCGGGC
CTGCCGCTGC TGTTGCCGTC GCCCGCCCTG TGCACCGACA ACGGCATCAT GGTGGCCCAT
GCGGGCTGGC AGCTGGCATC CCTTGGACTG GGGCACGACC TTGCGTTGGA AGCCATCCCG
CGCGGGCGCT CGATTCCCGA TGATTTCACC ATGTTCCGCA GTGTGTAG
 
Protein sequence
MLCLGIETSC DETALALVRD GVVVADVMSS QADVHALFGG VVPELASREH YRLIGALCDE 
VLRRAGVTAA DIDVVAVARG PGLLGSLLVG MGFAKGLALA TGARLVGVNH LHAHLLAAGI
AAEMAYPALG LLVSGGHTHI YRMESPRHMP LLGRTLDDAA GEAFDKVAKQ LGLAYPGGRT
IDELGRKGRV NPTMFPRPYV DNDNLDFSFS GLKTAAGLVI QANPALAAAA RAVATGSVAV
AGSVDGAADQ ALPDVSGLCD MCASFNHAVA DTLRIKLERA LVRENGPLPG TPTAPARKKR
KKGEAVATDT DQAALAAHAG QGVTPPDPHR PPVRALVAAG GVAANSYVRE ALGDLARRAG
LPLLLPSPAL CTDNGIMVAH AGWQLASLGL GHDLALEAIP RGRSIPDDFT MFRSV