Gene DvMF_1929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_1929 
Symbol 
ID7173847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp2379495 
End bp2380565 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content67% 
IMG OID643540445 
Productpeptidase M24 
Protein accessionYP_002436340 
Protein GI218887019 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCAC AACGCTACGA GGCGCGGCGC GAAACCCTGC GCGCCGCCAT GCGCGAAAAA 
GGCCTGTCCG CCCTGCTGGT AAGCCACGCG GCCAACCGGT TCTACCTTTC CGGCTTCGAA
CTGCACGACG TGCAGCTGAA CGAGAGCGCC GGGTACCTCA TCGTCACCGC CGACGGCAAC
GACTGGCTGT GCACCGACCC CCGCTACCTC GACGCGGCCC GCCGCCTGTG GCCCGAAGAG
CGCGTGTTCA TCTATTCCGG CGATGCGCCG GGCCAGATCA ACGGCCTGCT CAAGGACAAG
GTGCGCGGCA CCGTGGGCTT CGAGGCGCGT GCCGTGACCC TGGACTTCTT CGACAAGGTC
TCGCCCGGCC TGACCATGGA ACGGGCCGAC GGCATGGTCG AGGAAATGCG GGTGATCAAG
GAACCCGAGG AAATCGAGCT GATGCGCCGT TCCGCCGCGC TGAACCACCA GCTCATGGAA
TGGGTGCCCA GCATCCTCGT GCCCGGTCGC ACCGAAGCGG AAATCGCCTG GGACATCGAA
AAGTTCTTCC GCGAACATGG CGCCAGCGAA CTGGCCTTCT CCAGCATCGT GGGCGTTGGC
CCCAACGCCG CCCTGCCCCA CTACGCCCCC GGCGACGTGC CCCTGACCGA AAACTGCCCG
GTGCTGGTGG ACGTGGGCGC GCGGCTGGAC CTGTACAACT CGGACCAGAC CCGCACCTTC
TGGGTGGGCG ACAAGCCCGC CGACCACTTC ACCCGCGCGC TGGAACAGAC CAAGGCCGCC
CAGGCGGAGG CCATAAGGAT CATGCGCCCC GGCCTGCCAG TGGCCGACGC CTACCGCGCC
GCGCGCGCCC ACTTCGAGGC GCAGGGCGTG GCCGCCCACT TCACCCACGC GCTGGGGCAC
GGCATAGGCC TCGAAACCCA CGAGCCGCCC AGCCTGAACC CGCGCAACGA AATGATCCTG
AAGCCCGGCA TGATCGTGAC CGTGGAACCC GGCCTGTACT ACCCCGAATG GGGCGGCATC
CGCTGGGAAT ACATGGTGCT GGTCACCGGC GACGGGGTGG AGATTCTCTA G
 
Protein sequence
MNAQRYEARR ETLRAAMREK GLSALLVSHA ANRFYLSGFE LHDVQLNESA GYLIVTADGN 
DWLCTDPRYL DAARRLWPEE RVFIYSGDAP GQINGLLKDK VRGTVGFEAR AVTLDFFDKV
SPGLTMERAD GMVEEMRVIK EPEEIELMRR SAALNHQLME WVPSILVPGR TEAEIAWDIE
KFFREHGASE LAFSSIVGVG PNAALPHYAP GDVPLTENCP VLVDVGARLD LYNSDQTRTF
WVGDKPADHF TRALEQTKAA QAEAIRIMRP GLPVADAYRA ARAHFEAQGV AAHFTHALGH
GIGLETHEPP SLNPRNEMIL KPGMIVTVEP GLYYPEWGGI RWEYMVLVTG DGVEIL