Gene DvMF_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_2034 
Symbol 
ID7173953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp2520611 
End bp2522278 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content68% 
IMG OID643540551 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002436445 
Protein GI218887124 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGCA AGAAGATGAC CCATGGATTG GAGAAGGCCC CGCACCGTTC GCTGCTGCAT 
GCCCTGGGGC TGACGCGCGA GGAGATCGAG CGCCCCCTGG TGGGCGTGGT GAACGCGGCC
AACGAGGTGG TGCCCGGCCA CGTGCATTTG CACACCATTG CAGAGGCGGT GAAGGCCGGG
GTGCGCGCGG CGGGCGGCAC GCCCATGGAG TTTCCGGCCA TCGCGGTCTG TGACGGCTTG
GCCATGAACC ACGAGGGCAT GCATTTTTCG CTGCCCTCGC GCGAGATCAT TGCCGATTCC
ATCGAGATCA TGGCCACTGC GCACCCGTTC GACGCGCTGG TGTTCATCCC CAACTGCGAC
AAGTCGGTGC CCGGCATGCT GATGGCCATG CTGCGGATGG ACATTCCGTC CATCATGGTC
AGCGGCGGCC CCATGCTGGC GGGCGGCACC CTGGCGGGCC GCACCGACCT GATCAGCGTG
TTCGAGGCCG TGGGCCGCGT GCAGCGCGGC GACATGACCA TGGCCGAACT GGACGAGATG
ACCGAAACTG CCTGCCCGGG CTGCGGTTCG TGCGCGGGCA TGTTCACGGC CAACACCATG
AACTGCATGG CCGAGACCAT GGGCCTTGCC CTGCCCGGCA ACGGCACCAT CCCGGCGGTG
ACGGCGGCGC GCGTGCGCCT GGCCAAGCAC GCGGGCATGA AGGTGATGGA ACTGCTGGAA
AAGAACATCA CCCCGCGCTC CATCGTCACC CCGCGCGCCG TGGCCAACGC CGTGGCCGTG
GACATGGCGC TCGGCGGCTC CACCAACACG GTGCTGCACC TGCCCGCCGT GTTCGGCGAG
GCCGGGCTGG ACCTGACGCT GGACATCTTC GACGAGGTCA GCCGCAAGAC CCCCAACCTG
TGCAAGCTTT CCCCGGCCGG ACACCACCAC ATCCAGGATC TGCACGCCGC GGGCGGCATC
CCGGCGGTGA TGGCGGAACT GACCAGAAAG GGGCTGGTGG ACACCTCGGT CATGACCGTC
ACCGGCAAGA CCCTGGCCGA AAACCTGGCC GAGCTGAACG CGCGCGTGCT CAATCCGGAC
GTAATACGTT CTGCAGACGC TCCGTATTCG GCGCAGGGCG GCATCGCCAT CCTGAAGGGT
TCGCTGGCCC CGCAGGGCGC GGTGGTCAAG CAGTCCGCCG TGGCGCCGGA AATGATGGTG
CGCGAGGCCG TGGCCCGCGT CTTCGATTCC GAGGGCGAGG CGCACGCCGC CATCATGGGC
GGCAAGATCA ACAAGGGCGA CGCCATCATC ATCCGCTACG AAGGACCGCG CGGTGGCCCC
GGCATGCGCG AGATGCTGTC GCCCACCGCT GCCATCGCGG GCATGGGCCT GGGCGCGGAC
GTGGCCCTGA TCACCGATGG CCGCTTCAGC GGCGGCACGC GCGGCGCGGC CATCGGCCAC
GTTTCGCCGG AAGCCGCCGA TGGCGGCAAC ATCGGGCTGG TCAGGGAAGG CGACCACATC
CTCATCGACA TTCCGGCCCG CAGGCTGGAC CTGCTGGTGG ACGAGGCGGA ACTGGCCGCC
CGGCGCGAGA CCTTCGTGCC GCTGGAAAAG CCGGTCACCT CGCCCCTGCT GCGCCGCTAC
GCCCGTCAGG TGACCAGCGC GGCCACCGGG GCCATGTACC GCAAGTAG
 
Protein sequence
MRSKKMTHGL EKAPHRSLLH ALGLTREEIE RPLVGVVNAA NEVVPGHVHL HTIAEAVKAG 
VRAAGGTPME FPAIAVCDGL AMNHEGMHFS LPSREIIADS IEIMATAHPF DALVFIPNCD
KSVPGMLMAM LRMDIPSIMV SGGPMLAGGT LAGRTDLISV FEAVGRVQRG DMTMAELDEM
TETACPGCGS CAGMFTANTM NCMAETMGLA LPGNGTIPAV TAARVRLAKH AGMKVMELLE
KNITPRSIVT PRAVANAVAV DMALGGSTNT VLHLPAVFGE AGLDLTLDIF DEVSRKTPNL
CKLSPAGHHH IQDLHAAGGI PAVMAELTRK GLVDTSVMTV TGKTLAENLA ELNARVLNPD
VIRSADAPYS AQGGIAILKG SLAPQGAVVK QSAVAPEMMV REAVARVFDS EGEAHAAIMG
GKINKGDAII IRYEGPRGGP GMREMLSPTA AIAGMGLGAD VALITDGRFS GGTRGAAIGH
VSPEAADGGN IGLVREGDHI LIDIPARRLD LLVDEAELAA RRETFVPLEK PVTSPLLRRY
ARQVTSAATG AMYRK