Gene Rmet_2804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_2804 
Symbol 
ID4039631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp3051158 
End bp3052831 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content65% 
IMG OID637978203 
Productdihydroxy-acid dehydratase 
Protein accessionYP_584946 
Protein GI94311736 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.416197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.582651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTACA ACAAACGTTC CCAGCACATC ACGCAAGGCG TGGCGCGCTC CCCCAACCGT 
TCGATGTACT ACGCGCTCGG CTACAAGAAG GAAGACTTCG ACAAGCCGAT GGTGGGTATC
GCCAACGGCC ACTCGACGAT CACCCCCTGC AACGCCGGCC TGCAGCGCCT GGCGGACGCG
GCCGTGGATG CGATCAAGGC CTCGGACGCC AACCCGCAGA TCTTCGGCAC GCCGACGATC
TCCGACGGCA TGTCGATGGG CACCGAGGGC ATGAAGTACT CGCTGATCTC GCGTGAAGTC
ATTGCCGACT GCATCGAAAC CGCTGCCCAG GGCCAGTGGA TGGACGGCGT GGTCGTGATC
GGCGGCTGCG ACAAGAACAT GCCCGGCGGC ATGATCGCAC TGGCGCGCAC CAACGTGCCG
GGCATCTACG TCTACGGCGG CACCATCCGC CCGGGTAACT GGAAGGGCAA GGACCTGACC
ATCGTGTCGT CGTTCGAGGC CGTGGGCGAA TTCACCGCCG GCCGCATGAG CGAGGAAGAC
TTCGAGGGCG TGGAGAAGAA CGCCTGCCCG AGCACCGGCT CGTGCGGTGG CATGTACACC
GCCAACACAA TGAGCTCGTC GTTCGAGGCG CTGGGCATGT CGCTGCTGAA TTCGTCGACG
ATGGCCAACC CGGACCAGGA AAAGGTGGAC AGCGCCGCCG AATCGGCCCG CGTGCTGGTG
GAAGCCATCA AGAAGGACAT CAAGCCGCGC GACATCATCA CGCGCAAATC GATCGAGAAC
GCCGTGACGC TGATCATGGC CACGGGCGGT TCCACCAACG CGGTACTGCA TTACCTGGCC
ATCGCCCACT CGGCCGAAGT GGAATGGACC ATCGACGATT TCGAGCGCAT CCGCCGCCGC
GTGCCGGTGA TCTGCAACCT GAAGCCGTCG GGCGCCTACG TGGCCACCGA CCTGCACCGC
GCCGGTGGCA TTCCGCAAGT GATGAAGATC CTGCTGAACG CCGGTCTGCT GCATGGCGAC
TGCCTGACGA TCACGGGCCG CACGCTGGCC GAGGAGCTCG AGCACGTTCC CAATGAGCCG
CGTACCGACC AGGACGTGAT CCTGCCGATC TCGCAGGCCC TGTACAAGCA AGGCCACCTG
GCCATCCTGA AGGGCAACCT CGCCGAGGAA GGCGCAGTGG CCAAAATCAC CGGCCTGAAG
AACCCGGTCA TCACCGGCCC GGCCCGCGTG TTCGAGGACG AACAGAGCGC CATGGAAGCG
ATCCTGGCGG ACAAGATCAA CGCTGGCGAC ATCCTGGTGC TGCGCTACCT TGGCCCGAAG
GGCGGCCCGG GCATGCCTGA AATGCTGGCG CCGACTTCGG CGATCATCGG CAAGGGTCTC
GGCGAATCGG TTGGTTTCAT CACCGACGGC CGCTTCTCGG GGGGCACCTG GGGCATGGTG
GTTGGCCACG TTGCGCCGGA AGCCCATGTG GGCGGTACGA TTGCACTGGT TCAGGAAGGC
GACTCGATCA CGATCGACGC GCACCAGCTG CTGCTGCAAC TGAATGTGGC CGACGACGAG
CTGGCCCGCC GCCGCGCTGC CTGGAAACAG CCGGCGCCGC GCTACACGCG CGGCGTACTG
GCGAAGTTCG CGAGGCTGGC CTCCACGGCC AGCAAGGGCG CCGTGACGGA CTGA
 
Protein sequence
MAYNKRSQHI TQGVARSPNR SMYYALGYKK EDFDKPMVGI ANGHSTITPC NAGLQRLADA 
AVDAIKASDA NPQIFGTPTI SDGMSMGTEG MKYSLISREV IADCIETAAQ GQWMDGVVVI
GGCDKNMPGG MIALARTNVP GIYVYGGTIR PGNWKGKDLT IVSSFEAVGE FTAGRMSEED
FEGVEKNACP STGSCGGMYT ANTMSSSFEA LGMSLLNSST MANPDQEKVD SAAESARVLV
EAIKKDIKPR DIITRKSIEN AVTLIMATGG STNAVLHYLA IAHSAEVEWT IDDFERIRRR
VPVICNLKPS GAYVATDLHR AGGIPQVMKI LLNAGLLHGD CLTITGRTLA EELEHVPNEP
RTDQDVILPI SQALYKQGHL AILKGNLAEE GAVAKITGLK NPVITGPARV FEDEQSAMEA
ILADKINAGD ILVLRYLGPK GGPGMPEMLA PTSAIIGKGL GESVGFITDG RFSGGTWGMV
VGHVAPEAHV GGTIALVQEG DSITIDAHQL LLQLNVADDE LARRRAAWKQ PAPRYTRGVL
AKFARLASTA SKGAVTD