Gene DvMF_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_3039 
Symbol 
ID7174984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp3836186 
End bp3837445 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content70% 
IMG OID643541575 
Productpeptidase M24 
Protein accessionYP_002437444 
Protein GI218888123 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.0333665 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCACG CCCATGCTTC GCACGCCCCT GCTCCCCGCC CGTTCACAGC AGCCGAAAGA 
CTGCCCGCCG AAGAAGCCCG CCTGCGCCAG TCCCGCACGC TGGACATCCT TGCCGCGCAC
GCCCCCGATG CGGGCGGGCT GATGGCCTTC TCGCGCGTGG CCATCTATTA CCTTACCGGC
ACGGCGGGCA ACGGCGTGCT GTGGCTGCCC CGCGACGGCG AACCGGTGCT GCTGGTGCGC
AAGGGCGAGG AACGCGCCCG GCTGGAAAGC CCGCTGGCCA ACATCGTGTC CTTCCGTTCC
TACGGCGACC TGACCGGCCT GTGCGCCGAC GCGGGCAGCC CGCTGGCCCC GGTGGTGGCG
GCGGAAATGT CCGGCCTGCC GTGGTCGCTG GCCGCCATGC TGCAACAGCG GCTGACGGGG
GTGGCCTTCG TGCCGGGCGA CGCGGTGCTT ACCCGTGCCC GCGCCCGCAA GACCGAATGG
GAGCTGGTCA AGCTGCGCCT CGCCGGTGCC CGTCACCACG AATGCCTGCA CGACGTGCTG
CCCGGCCTGC TGCGCCCCGG CATGACCGAG CGCGAGATTT CGCACCTGTC GTGGCAGGTG
TTCTTCAGCC GGGGGCACGG CGGGCTCATG CGCATGGGCG CGCCGGGCGA GGAATGCTTT
CTGGGGCACA TCGCCGCTGG CGAAAACGGC AACTACCCCA GCCATTTCAA CGGCCCGCTG
GGCCTCAAGG GCGAGCACCC GGCCATCCCG TTCATGGGGT ACGCGGGCTC GGTATGGCGG
CATGGCACCC CGCTGGCGCT GGACATCGGC TTCTGCCTCG AAGGGTACCA CACCGACAAG
ACCCAGCTGT ACTGGGCGGG CAACGCCGCC TCCATCCCCG ACGCGGTGCG CCGCTCCCAC
GACCTGTGCA TCGAGGTGCA GGCCCGCGCG GCGGAACGGC TGCGCCCCGG CGAAATTCCC
TCGGACATCT GGCGATCCAC GCTGGATATC GTGGATCGCG CGGGCCTGTC CGAAGGGTTC
ATGGGGCTGG GCTCCAACAA GGTCACCTTT CTGGGGCACG GCATCGGCCT GACCATCGAC
GAGTACCCGG TCATCGCCCA CCGCTTCGAC GAGCCGCTGG AAGAAGGCAT GACCATCGCC
CTGGAACCCA AGATGGGCAT TGCCGGGGTG GGCATGGTGG GGGTGGAGAA CACCTTCGAG
GTTGCCGCCG GGGGTGGTCG CGCCCTGACC GGCACGGCCT ACGACATGGT CTGCGTGTAG
 
Protein sequence
MPHAHASHAP APRPFTAAER LPAEEARLRQ SRTLDILAAH APDAGGLMAF SRVAIYYLTG 
TAGNGVLWLP RDGEPVLLVR KGEERARLES PLANIVSFRS YGDLTGLCAD AGSPLAPVVA
AEMSGLPWSL AAMLQQRLTG VAFVPGDAVL TRARARKTEW ELVKLRLAGA RHHECLHDVL
PGLLRPGMTE REISHLSWQV FFSRGHGGLM RMGAPGEECF LGHIAAGENG NYPSHFNGPL
GLKGEHPAIP FMGYAGSVWR HGTPLALDIG FCLEGYHTDK TQLYWAGNAA SIPDAVRRSH
DLCIEVQARA AERLRPGEIP SDIWRSTLDI VDRAGLSEGF MGLGSNKVTF LGHGIGLTID
EYPVIAHRFD EPLEEGMTIA LEPKMGIAGV GMVGVENTFE VAAGGGRALT GTAYDMVCV