Gene DvMF_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_0414 
Symbol 
ID7172300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp482956 
End bp484266 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content64% 
IMG OID643538913 
ProductPeptidase M23 
Protein accessionYP_002434839 
Protein GI218885518 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.0487693 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCAG CTAAGAATCT TTCCATCGTG ATCGTCTTCG CGGCCCTGCT GGGGCTGCTC 
GGCATCGGCG GCTTCATGCT CTTCCAGGAC ATGGAAGGGC CGGAAGTCAT CCTTACCCCG
GACACGGGGC GCGCCTCGCC GCATCAGGAC CTGACCCTGA CCCTGCGCGA CAAAAAGTCG
GGCGTCCGGT CCGTCACCGT CACGGTGAAA AAGAATTCGC ACTCGCTGGT CGTGCTCGAC
AGCGCCTTCA CCGAAGGTCG CCGCGAACAA CGCGTGACCT TCAACCTGAA GGATGCGGGC
CTGAAGGACG GCGCCTTCGA CCTTGAAGTG CGCGCCACCG ACACCTCGCT GGCCGGGTTC
GGCAAGGGCA ACACCACCAC CAGGGTCTAT CCCCTGCGCC TGGACACCCT GCCCCCGCGC
GTTTCGGTAA AGAGCATGCC CCCCTACGTG CGGCGCGGCG GCACCGGCTC CATCCTCTAT
TCGGTCAACG AAGAGGTGGA ACGCACCGGC GTGAAGGTGG GCGACCTGTA CTTTGCGGGG
TTCAAGCAGC CCAGCGGCGA CTACCTGTGC TTCTTCGCCT TCCCGCACTT TCTCACCGTG
GCCCAGTACG CCCCGGAAAT CACCGCCGTG GACCTTTCCG GCAACGCCAT GGCCAGCCGC
CTGGTCATCC GCCCGCTGGA CCGGGTGTTC CGACATGACA ACATCAACAT TTCCGAAAAC
TTCCTGGCCA GCAAGATGCC CGAATTCGAA CAGGACGTAC CCGGCGATCT GAGCCCGCTG
GAGCGCTTCC TGAAGGTGAA CAACGAGTTG CGGGTGTCCA ACGAGCAGAA ACTGCTGGAA
ATCGGCAAGG ATACCGCCCC CGCCATGCTC TGGCACGGGG CCTTCCTGCA ACTGCCCAAC
TCGGCCACCA GGGCCGGATT TGCCGACAAC CGCAGCTACC TGCACAACGG CCAGAAGGTG
GACAACCAGA CCCATCTCGG TCTGGACTTC GCCTCGCTGG CCATGGCCGA GGTGCCCGCC
TCCAACAGCG GGCGCGTGGT CTTTGCCGGA ACTCTCGGCA TTTACGGCAA CCTCGTCGTC
ATCGACCACG GCCTTGGTCT GCAAACCCTG TACTCGCACC TCAGCGAGAT TTCGGCCAAC
GTGGGCCAGC AGGTGAAGAA GGGCGACATC ATCGGCAAGA CCGGCACCAC CGGCATGGCA
GGCGGCGACC ACCTGCACTT CGGGGTGACC ATCGGCGGGG TGCAGGTGCA GCCGCTGGAA
TGGCTGGACC CGCACTGGAT CGAGGACAAC GTGACGGCAC GCCTGAAGTA G
 
Protein sequence
MTAAKNLSIV IVFAALLGLL GIGGFMLFQD MEGPEVILTP DTGRASPHQD LTLTLRDKKS 
GVRSVTVTVK KNSHSLVVLD SAFTEGRREQ RVTFNLKDAG LKDGAFDLEV RATDTSLAGF
GKGNTTTRVY PLRLDTLPPR VSVKSMPPYV RRGGTGSILY SVNEEVERTG VKVGDLYFAG
FKQPSGDYLC FFAFPHFLTV AQYAPEITAV DLSGNAMASR LVIRPLDRVF RHDNINISEN
FLASKMPEFE QDVPGDLSPL ERFLKVNNEL RVSNEQKLLE IGKDTAPAML WHGAFLQLPN
SATRAGFADN RSYLHNGQKV DNQTHLGLDF ASLAMAEVPA SNSGRVVFAG TLGIYGNLVV
IDHGLGLQTL YSHLSEISAN VGQQVKKGDI IGKTGTTGMA GGDHLHFGVT IGGVQVQPLE
WLDPHWIEDN VTARLK