Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_0834 |
Symbol | |
ID | 4037625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | + |
Start bp | 911378 |
End bp | 912733 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637976210 |
Product | D-glucarate dehydratase |
Protein accession | YP_582989 |
Protein GI | 94309779 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR03247] glucarate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.608839 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0271686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCCG CTTCCCTTCA AGGCGCCGCT GGTGCCACGC CTGTCGTCAC TGAACTGACC GTTGTCCCCG TCGCCGGCCA CGACAGCATG CTGATGAACC TGTCGGGGGC CCACGGCCCG TACTTCACCC GCAACATCGT GATTCTGCGC GACAGCGCAG GCAATACCGG CGTTGGCGAA GTGCCGGGCG GCGAGGGCAT TCGCAAGACG CTCGAGGACG CGCGCCCGCT CGTGGTGGGT CAACCGATCG GCCAGTATCA GGCGATGCTG AACAAGGTTC GAGCCACGTT CGCGAGCCGC GATGCCGGGG GACGTGGTCT GCAGACCTTC GACCTGCGCA TCACGATCCA TGCCGTGACC GCGCTTGAAG CTGCGCTGCT CGATCTGCTC GGCAAGCACC TTCAGGTGCC CGTGGCGGCC CTGCTCGGCG AAGGCCAGCA GCGCGACGCG GTGGAGATGC TGGGCTACCT GTTCTACGTC GGTGATCGCC AGCGTACGAC GCTTGACTAT CGCACCGAGC CCGATGCGGA CAACGAATGG TTCCGCCTGC GCAATGAAGT GGCGATGGAT GCGAAAGGCG TGGTGCGACT GGCCGAGGCC GCGTACGAGC GCTACGGCTT CAACGATTTC AAGCTCAAGG GTGGCGTGCT GAGCGGTGAC GAGGAAATGG AGGCGATCCT CGCGCTGCAT GAGCGTTTTC CGAAAGCACG CGTCACGCTC GACCCGAACG GCGGCTGGCT GCTGGCCGAC GCGATCCGCC TCTGCCGCGA CAAGCATGGC GTGCTGGCCT ATGCGGAGGA TCCCTGCGGC GCCGAGGACG GCTATTCGGG CCGCGAGATC ATGGCCGAGT TCCGCACTGC CACCGGACTA CCTACGGCCA CCAACATGAT CGCCACCGAC TGGCGTCAGA TGGGTCATGC GGTGCGCCTG CATTCGGTCG ACATTCCGCT GGCCGATCCG CACTTCTGGA CCATGCAGGG CTCGGTCCGC GTGGCGCAGA TGTGCGCGGA GTGGGGGCTG ACCTGGGGCT CGCACTCGAA CAATCACTTC GATATTTCGC TGGCGATGTT CACGCATGTG GCCGCCGCCG CGCCGGGCCG CGTGACGGCA ATCGACACGC ACTGGATCTG GCAGGACGGT GAGCACCTGA CACGCAACCC GCTCAGGATC GAAGGTGGCC TGGTGCAGGT ACCGAAGACA CCTGGTCTGG GCGTGGAGCT GGATATGGAT GCGCTCGCGC GCGCCAACCG CCTCTATCAG GAAAAGGGAC TCGGCGCGCG CGACGATGCG ATCGCGATGC AGTTCCTGAT CCCTGGCTGG AAGTTCAACA ACAAGATGCC CTGCATGGTG CGCTGA
|
Protein sequence | MNAASLQGAA GATPVVTELT VVPVAGHDSM LMNLSGAHGP YFTRNIVILR DSAGNTGVGE VPGGEGIRKT LEDARPLVVG QPIGQYQAML NKVRATFASR DAGGRGLQTF DLRITIHAVT ALEAALLDLL GKHLQVPVAA LLGEGQQRDA VEMLGYLFYV GDRQRTTLDY RTEPDADNEW FRLRNEVAMD AKGVVRLAEA AYERYGFNDF KLKGGVLSGD EEMEAILALH ERFPKARVTL DPNGGWLLAD AIRLCRDKHG VLAYAEDPCG AEDGYSGREI MAEFRTATGL PTATNMIATD WRQMGHAVRL HSVDIPLADP HFWTMQGSVR VAQMCAEWGL TWGSHSNNHF DISLAMFTHV AAAAPGRVTA IDTHWIWQDG EHLTRNPLRI EGGLVQVPKT PGLGVELDMD ALARANRLYQ EKGLGARDDA IAMQFLIPGW KFNNKMPCMV R
|
| |