Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_0989 |
Symbol | |
ID | 4037786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | - |
Start bp | 1073842 |
End bp | 1074921 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637976370 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_583144 |
Protein GI | 94309934 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.161245 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAGA ACACCGACGA CCTGCGTATT CGAGAACTCA AGGAACTGCT GCCGCCCGCG CACCTGATCC GCGAGTTTGC TTGCTCGGAG GCTGCGTCCG ACGTGATCTA TGGCGCCCGC CAGGCCATGC ATCGCATTCT GCACGGCATG GACGACCGCC TGATCGTCAT CATCGGCCCG TGCTCGATTC ACGACACGCG CGCGGCCCTG GAATACGCCA AGCTGCTCAA GGTGCAGCGC GACCGCTTCG CGGGCGAGCT CGAGATCGTG ATGCGCGTCT ACTTCGAGAA GCCGCGCACG ACGGTGGGCT GGAAGGGCCT GATCAACGAT CCGCACATGG ATGGCAGCTT CAAGATCAAC GACGGCCTGC GCACCGCCCG CGAACTGCTG CTGAATATCA GCGAAATGGG CGTGCCGACG GGGACGGAAT ATCTGGACAT GATCAGCCCC CAGTACATCG CCGATCTGGT GAGCTGGGGC GCGATCGGCG CGCGCACCAC CGAGTCGCAG GTGCATCGCG AACTCGCTTC CGGACTGTCG TGCCCGGTCG GCTTCAAGAA CGGCACCGAC GGCAACGTGA AGATCGCCGT CGACGCGATC AAGGCCGCCT CGCAGCCCCA CCATTTCCTG TCGGTAACCA AGGGCGGCCA CTCGGCCATC GTGTCGACCT CGGGTAACGA GGACTGCCAC ATCATCCTGC GCGGCGGCAA GACACCGAAC TACGACGCGG CCAGCGTGCA GGAAGCCTGC GACGCGATCT CGAAGTCCGG CCTGGCCGCA CGCCTGATGA TCGACGCCTC GCACGCCAAC AGCAGCAAGA AGCACGAGAA CCAGATCCCG GTCTGCGAGG ACATTGGCAA GCAGATCGCC GGCGGCGAGC AGCGCATCGT CGGCGTCATG GTGGAATCGC ACCTGGTAGC CGGCCGACAG GACCATGTGC AGGGCACTCC GGTGGAGAAC CTGACCTACG GCCAGTCGGT GACCGACGCC TGCATCGCCT GGGATGACTC CGTGGCCGTA CTGGAGACGC TGGCCAACGC CGTGAAGCAG CGCCGTCTGG TGACCGGCAG CGGCAACTGA
|
Protein sequence | MLKNTDDLRI RELKELLPPA HLIREFACSE AASDVIYGAR QAMHRILHGM DDRLIVIIGP CSIHDTRAAL EYAKLLKVQR DRFAGELEIV MRVYFEKPRT TVGWKGLIND PHMDGSFKIN DGLRTARELL LNISEMGVPT GTEYLDMISP QYIADLVSWG AIGARTTESQ VHRELASGLS CPVGFKNGTD GNVKIAVDAI KAASQPHHFL SVTKGGHSAI VSTSGNEDCH IILRGGKTPN YDAASVQEAC DAISKSGLAA RLMIDASHAN SSKKHENQIP VCEDIGKQIA GGEQRIVGVM VESHLVAGRQ DHVQGTPVEN LTYGQSVTDA CIAWDDSVAV LETLANAVKQ RRLVTGSGN
|
| |