Gene Rmet_5209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5209 
SymbolmhpE 
ID4042070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1903487 
End bp1904524 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content67% 
IMG OID637980627 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_587337 
Protein GI94314128 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.28421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.217725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA AGAAACTTTA TATCTCCGAT GTGACCCTGC GTGATGGTAG CCACGCGATC 
CGTCACCAGT ACTCGGTTCC GCAGGTGCGT GCCATCGCAC GCGCGCTCGA TGCGGCAGGC
GTGGACTCGA TCGAGGTGGC GCACGGCGAC GGCCTGGCTG GCTCCAGCTT CAACTACGGA
TTCGGCGCCC ACACCGACGT GGAATGGATC GCGGCAGTGG CGGAGTCCGT GCAGCGCGCA
GCGGTGGCCA CGCTGCTGCT GCCGGGCATC GGTACCGTGC ACGACTTGCG CGAAGCCTAC
GCCGCCGGCG CACGCGTGGT CAGGGTGGCC ACGCACTGCA CCGAGGCCGA CACGGCGCGG
CAGCATATCG AGACCGCGCG GTCGATGGGG ATGAATGTCG CCGGCTTCCT GATGATGAGC
CACATGATCC CGCCCGACAG GCTGGCCGGC CAGGCAAAGC TGATGGAAAG TTACGGCGCG
CATTGTGTCT ACGTGGTGGA CTCCGGCGGC GCCTTGACCA TGGACGGCGT GCGCGCGCGC
TTTCGTGCGT TCAAAGACGT GCTCGATCCA AAGACTGAGA CCGGCATGCA CGCGCACCAC
AACCTCAGCC TGGGCGTTGC CAACAGCATC GTCGCGGTTG AGGAAGGTTG CGACCGCATC
GATGCGAGCC TGGCCGGCAT GGGCGCGGGC GCGGGCAATG CGCCGCTGGA GGTCTTTATT
GCCGCTGCCG AACGCATGGG ATGGCACCAC GGCTGCGATC TCTACCAGTT GATGGACGCC
GCCGACGACA TCGTGCGCCC GCTGCAGGAT CGCCCCGTGC GCGTGGACCG CGAAACGCTG
GCGCTCGGCT ACGCCGGCGT GTATTCGAGC TTCCTGCGCC ACGCGGAGAG CGCGGCCGGC
AAGTACGGCC TGAAGACCGT CGATATCCTG GTCGAGCTGG GACGCCGCCG CATGGTGGGC
GGGCAGGAAG ACATGATCGT CGATGTGGCG CTCGACCTGC AGCGGTCGGG CGGCGCGAAG
AGCAGGGAGG CAGCATGA
 
Protein sequence
MTDKKLYISD VTLRDGSHAI RHQYSVPQVR AIARALDAAG VDSIEVAHGD GLAGSSFNYG 
FGAHTDVEWI AAVAESVQRA AVATLLLPGI GTVHDLREAY AAGARVVRVA THCTEADTAR
QHIETARSMG MNVAGFLMMS HMIPPDRLAG QAKLMESYGA HCVYVVDSGG ALTMDGVRAR
FRAFKDVLDP KTETGMHAHH NLSLGVANSI VAVEEGCDRI DASLAGMGAG AGNAPLEVFI
AAAERMGWHH GCDLYQLMDA ADDIVRPLQD RPVRVDRETL ALGYAGVYSS FLRHAESAAG
KYGLKTVDIL VELGRRRMVG GQEDMIVDVA LDLQRSGGAK SREAA