Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_2034 |
Symbol | |
ID | 7173953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 2520611 |
End bp | 2522278 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643540551 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002436445 |
Protein GI | 218887124 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAGCA AGAAGATGAC CCATGGATTG GAGAAGGCCC CGCACCGTTC GCTGCTGCAT GCCCTGGGGC TGACGCGCGA GGAGATCGAG CGCCCCCTGG TGGGCGTGGT GAACGCGGCC AACGAGGTGG TGCCCGGCCA CGTGCATTTG CACACCATTG CAGAGGCGGT GAAGGCCGGG GTGCGCGCGG CGGGCGGCAC GCCCATGGAG TTTCCGGCCA TCGCGGTCTG TGACGGCTTG GCCATGAACC ACGAGGGCAT GCATTTTTCG CTGCCCTCGC GCGAGATCAT TGCCGATTCC ATCGAGATCA TGGCCACTGC GCACCCGTTC GACGCGCTGG TGTTCATCCC CAACTGCGAC AAGTCGGTGC CCGGCATGCT GATGGCCATG CTGCGGATGG ACATTCCGTC CATCATGGTC AGCGGCGGCC CCATGCTGGC GGGCGGCACC CTGGCGGGCC GCACCGACCT GATCAGCGTG TTCGAGGCCG TGGGCCGCGT GCAGCGCGGC GACATGACCA TGGCCGAACT GGACGAGATG ACCGAAACTG CCTGCCCGGG CTGCGGTTCG TGCGCGGGCA TGTTCACGGC CAACACCATG AACTGCATGG CCGAGACCAT GGGCCTTGCC CTGCCCGGCA ACGGCACCAT CCCGGCGGTG ACGGCGGCGC GCGTGCGCCT GGCCAAGCAC GCGGGCATGA AGGTGATGGA ACTGCTGGAA AAGAACATCA CCCCGCGCTC CATCGTCACC CCGCGCGCCG TGGCCAACGC CGTGGCCGTG GACATGGCGC TCGGCGGCTC CACCAACACG GTGCTGCACC TGCCCGCCGT GTTCGGCGAG GCCGGGCTGG ACCTGACGCT GGACATCTTC GACGAGGTCA GCCGCAAGAC CCCCAACCTG TGCAAGCTTT CCCCGGCCGG ACACCACCAC ATCCAGGATC TGCACGCCGC GGGCGGCATC CCGGCGGTGA TGGCGGAACT GACCAGAAAG GGGCTGGTGG ACACCTCGGT CATGACCGTC ACCGGCAAGA CCCTGGCCGA AAACCTGGCC GAGCTGAACG CGCGCGTGCT CAATCCGGAC GTAATACGTT CTGCAGACGC TCCGTATTCG GCGCAGGGCG GCATCGCCAT CCTGAAGGGT TCGCTGGCCC CGCAGGGCGC GGTGGTCAAG CAGTCCGCCG TGGCGCCGGA AATGATGGTG CGCGAGGCCG TGGCCCGCGT CTTCGATTCC GAGGGCGAGG CGCACGCCGC CATCATGGGC GGCAAGATCA ACAAGGGCGA CGCCATCATC ATCCGCTACG AAGGACCGCG CGGTGGCCCC GGCATGCGCG AGATGCTGTC GCCCACCGCT GCCATCGCGG GCATGGGCCT GGGCGCGGAC GTGGCCCTGA TCACCGATGG CCGCTTCAGC GGCGGCACGC GCGGCGCGGC CATCGGCCAC GTTTCGCCGG AAGCCGCCGA TGGCGGCAAC ATCGGGCTGG TCAGGGAAGG CGACCACATC CTCATCGACA TTCCGGCCCG CAGGCTGGAC CTGCTGGTGG ACGAGGCGGA ACTGGCCGCC CGGCGCGAGA CCTTCGTGCC GCTGGAAAAG CCGGTCACCT CGCCCCTGCT GCGCCGCTAC GCCCGTCAGG TGACCAGCGC GGCCACCGGG GCCATGTACC GCAAGTAG
|
Protein sequence | MRSKKMTHGL EKAPHRSLLH ALGLTREEIE RPLVGVVNAA NEVVPGHVHL HTIAEAVKAG VRAAGGTPME FPAIAVCDGL AMNHEGMHFS LPSREIIADS IEIMATAHPF DALVFIPNCD KSVPGMLMAM LRMDIPSIMV SGGPMLAGGT LAGRTDLISV FEAVGRVQRG DMTMAELDEM TETACPGCGS CAGMFTANTM NCMAETMGLA LPGNGTIPAV TAARVRLAKH AGMKVMELLE KNITPRSIVT PRAVANAVAV DMALGGSTNT VLHLPAVFGE AGLDLTLDIF DEVSRKTPNL CKLSPAGHHH IQDLHAAGGI PAVMAELTRK GLVDTSVMTV TGKTLAENLA ELNARVLNPD VIRSADAPYS AQGGIAILKG SLAPQGAVVK QSAVAPEMMV REAVARVFDS EGEAHAAIMG GKINKGDAII IRYEGPRGGP GMREMLSPTA AIAGMGLGAD VALITDGRFS GGTRGAAIGH VSPEAADGGN IGLVREGDHI LIDIPARRLD LLVDEAELAA RRETFVPLEK PVTSPLLRRY ARQVTSAATG AMYRK
|
| |