Gene Dvul_0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0024 
Symbol 
ID4662749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp35497 
End bp37161 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content67% 
IMG OID639818217 
Productdihydroxy-acid dehydratase 
Protein accessionYP_965475 
Protein GI120601075 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGCA AGAAGATGAC CCACGGGCTG GAAAAGGCCC CTCACCGCTC TCTCCTCCAC 
GCCCTCGGGC TGACCCGCGA AGAACTGGCC CGCCCGCTGG TCGGCGTGGT CAACGCCGCC
AACGAAGTCG TACCGGGGCA TATCCACCTC GATGACATCG CCGAGGCGGT CAAGGCCGGA
GTCCGCGCCG CCGGGGGCAC CCCGCTCGAG TTTCCCGCCA TCGCGGTGTG CGACGGCCTT
GCCATGAACC ACGAGGGGAT GCGCTTTTCG CTGCCCTCGC GCGAACTCAT CGCCGACTCC
ATCGAAATCA TGGCCACCGC GCACCCCTTC GACGCGCTGG TGTTCATCCC CAACTGTGAC
AAGTCCGTAC CCGGCATGCT CATGGCCATG CTGCGCCTTG ACGTGCCTTC CGTCATGGTC
AGCGGCGGCC CCATGCTTGC CGGTGCCACC CTTGCCGGTC GTGCCGACCT CATCACCGTC
TTCGAAGGAG TGGGGCGCGT CCAGCGCGGC GACATGACCG AGGCCGAACT CGATGAACTG
GTCGAGGGCG CATGCCCCGG CTGCGGTTCG TGCGCGGGCA TGTTCACCGC CAACTCCATG
AACTGTCTTG CCGAGACCAT CGGCCTTGCC CTGCCGGGCA ACGGCACGAC CCCCGCCGTC
ACCGCCGCGC GCATCCGTCT TGCCAAGCAT GCGGGCATGA AGGTGATGGA GATGCTGGAG
CGCAATATCC GCCCGCGCGA CATCGTCACC GAGAAGGCCG TGGCCAACGC GGTTGCCGTG
GACATGGCCC TTGGCTGTTC CACCAACACC GTGCTGCACC TGCCCGCCGT CTTCGCCGAG
GCGGGACTCG ACCTCACCCT CGACATCTTC GACAAGGTCA GCCGCAAGAC GCCCAACCTC
TGCAAACTCT CGCCCGCCGG GCATCATCAT ATTCAGGACC TGCATGCCGC AGGGGGCATT
CCCGCGGTCA TGGCCGAACT CGACAGTATA GGGCTCATCG ACCGCAGTGC CATGACCGTG
ACCGGGCGCA CCGTGGGCGA GAATCTCGAT GCACTGGGGG CCAAGGTGCG TGACGCCGAT
GTCATCCGTT CCGTCGACGC CCCGTATTCG CCGCAGGGCG GCATCGCCAT CCTCAAGGGT
TCACTCGCGC CCGGTGGTGC GGTGGTCAAG CAGTCCGCCG TGGCCCCGGA GATGATGGTG
CGCGAGGCCG TGGCGCGTGT CTTCGACAGC GAAGAGGCCG CCTGTGAGGC TATCATGGGA
GGGCGCATCA AGGCCGGGGA CGCCATAGTC ATCCGCTACG AAGGCCCCAA GGGCGGCCCC
GGCATGCGCG AGATGCTCAC TCCCACCTCG GCCATCGCCG GTATGGGCCT CGGGGCGGAT
GTGGCCCTCA TCACCGACGG GCGCTTCAGC GGCGGCACCC GTGGCGCAGC CATAGGCCAT
GTCTCGCCGG AAGCAGCCGA AGGCGGGCCC ATCGGCCTCG TGCAGGAGGG CGACCGCATC
CGCATCGACA TCCCCGCGCG CGCCCTCGAC CTGCTGGTGG ACGAGGATGA ACTCGCCCGT
CGCAGGGCTG TCTTCGTGCC CGTCGAGAAG GAAATCACTT CCCCCCTGCT GCGCCGCTAT
GCCCGCATGG TGTCGTCAGC TGCCACGGGT GCGCGCCAGC GCTAG
 
Protein sequence
MRSKKMTHGL EKAPHRSLLH ALGLTREELA RPLVGVVNAA NEVVPGHIHL DDIAEAVKAG 
VRAAGGTPLE FPAIAVCDGL AMNHEGMRFS LPSRELIADS IEIMATAHPF DALVFIPNCD
KSVPGMLMAM LRLDVPSVMV SGGPMLAGAT LAGRADLITV FEGVGRVQRG DMTEAELDEL
VEGACPGCGS CAGMFTANSM NCLAETIGLA LPGNGTTPAV TAARIRLAKH AGMKVMEMLE
RNIRPRDIVT EKAVANAVAV DMALGCSTNT VLHLPAVFAE AGLDLTLDIF DKVSRKTPNL
CKLSPAGHHH IQDLHAAGGI PAVMAELDSI GLIDRSAMTV TGRTVGENLD ALGAKVRDAD
VIRSVDAPYS PQGGIAILKG SLAPGGAVVK QSAVAPEMMV REAVARVFDS EEAACEAIMG
GRIKAGDAIV IRYEGPKGGP GMREMLTPTS AIAGMGLGAD VALITDGRFS GGTRGAAIGH
VSPEAAEGGP IGLVQEGDRI RIDIPARALD LLVDEDELAR RRAVFVPVEK EITSPLLRRY
ARMVSSAATG ARQR