Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_5209 |
Symbol | mhpE |
ID | 4042070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 1903487 |
End bp | 1904524 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637980627 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_587337 |
Protein GI | 94314128 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.28421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.217725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA AGAAACTTTA TATCTCCGAT GTGACCCTGC GTGATGGTAG CCACGCGATC CGTCACCAGT ACTCGGTTCC GCAGGTGCGT GCCATCGCAC GCGCGCTCGA TGCGGCAGGC GTGGACTCGA TCGAGGTGGC GCACGGCGAC GGCCTGGCTG GCTCCAGCTT CAACTACGGA TTCGGCGCCC ACACCGACGT GGAATGGATC GCGGCAGTGG CGGAGTCCGT GCAGCGCGCA GCGGTGGCCA CGCTGCTGCT GCCGGGCATC GGTACCGTGC ACGACTTGCG CGAAGCCTAC GCCGCCGGCG CACGCGTGGT CAGGGTGGCC ACGCACTGCA CCGAGGCCGA CACGGCGCGG CAGCATATCG AGACCGCGCG GTCGATGGGG ATGAATGTCG CCGGCTTCCT GATGATGAGC CACATGATCC CGCCCGACAG GCTGGCCGGC CAGGCAAAGC TGATGGAAAG TTACGGCGCG CATTGTGTCT ACGTGGTGGA CTCCGGCGGC GCCTTGACCA TGGACGGCGT GCGCGCGCGC TTTCGTGCGT TCAAAGACGT GCTCGATCCA AAGACTGAGA CCGGCATGCA CGCGCACCAC AACCTCAGCC TGGGCGTTGC CAACAGCATC GTCGCGGTTG AGGAAGGTTG CGACCGCATC GATGCGAGCC TGGCCGGCAT GGGCGCGGGC GCGGGCAATG CGCCGCTGGA GGTCTTTATT GCCGCTGCCG AACGCATGGG ATGGCACCAC GGCTGCGATC TCTACCAGTT GATGGACGCC GCCGACGACA TCGTGCGCCC GCTGCAGGAT CGCCCCGTGC GCGTGGACCG CGAAACGCTG GCGCTCGGCT ACGCCGGCGT GTATTCGAGC TTCCTGCGCC ACGCGGAGAG CGCGGCCGGC AAGTACGGCC TGAAGACCGT CGATATCCTG GTCGAGCTGG GACGCCGCCG CATGGTGGGC GGGCAGGAAG ACATGATCGT CGATGTGGCG CTCGACCTGC AGCGGTCGGG CGGCGCGAAG AGCAGGGAGG CAGCATGA
|
Protein sequence | MTDKKLYISD VTLRDGSHAI RHQYSVPQVR AIARALDAAG VDSIEVAHGD GLAGSSFNYG FGAHTDVEWI AAVAESVQRA AVATLLLPGI GTVHDLREAY AAGARVVRVA THCTEADTAR QHIETARSMG MNVAGFLMMS HMIPPDRLAG QAKLMESYGA HCVYVVDSGG ALTMDGVRAR FRAFKDVLDP KTETGMHAHH NLSLGVANSI VAVEEGCDRI DASLAGMGAG AGNAPLEVFI AAAERMGWHH GCDLYQLMDA ADDIVRPLQD RPVRVDRETL ALGYAGVYSS FLRHAESAAG KYGLKTVDIL VELGRRRMVG GQEDMIVDVA LDLQRSGGAK SREAA
|
| |