Gene Rmet_0989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_0989 
Symbol 
ID4037786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp1073842 
End bp1074921 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content65% 
IMG OID637976370 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_583144 
Protein GI94309934 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.161245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGA ACACCGACGA CCTGCGTATT CGAGAACTCA AGGAACTGCT GCCGCCCGCG 
CACCTGATCC GCGAGTTTGC TTGCTCGGAG GCTGCGTCCG ACGTGATCTA TGGCGCCCGC
CAGGCCATGC ATCGCATTCT GCACGGCATG GACGACCGCC TGATCGTCAT CATCGGCCCG
TGCTCGATTC ACGACACGCG CGCGGCCCTG GAATACGCCA AGCTGCTCAA GGTGCAGCGC
GACCGCTTCG CGGGCGAGCT CGAGATCGTG ATGCGCGTCT ACTTCGAGAA GCCGCGCACG
ACGGTGGGCT GGAAGGGCCT GATCAACGAT CCGCACATGG ATGGCAGCTT CAAGATCAAC
GACGGCCTGC GCACCGCCCG CGAACTGCTG CTGAATATCA GCGAAATGGG CGTGCCGACG
GGGACGGAAT ATCTGGACAT GATCAGCCCC CAGTACATCG CCGATCTGGT GAGCTGGGGC
GCGATCGGCG CGCGCACCAC CGAGTCGCAG GTGCATCGCG AACTCGCTTC CGGACTGTCG
TGCCCGGTCG GCTTCAAGAA CGGCACCGAC GGCAACGTGA AGATCGCCGT CGACGCGATC
AAGGCCGCCT CGCAGCCCCA CCATTTCCTG TCGGTAACCA AGGGCGGCCA CTCGGCCATC
GTGTCGACCT CGGGTAACGA GGACTGCCAC ATCATCCTGC GCGGCGGCAA GACACCGAAC
TACGACGCGG CCAGCGTGCA GGAAGCCTGC GACGCGATCT CGAAGTCCGG CCTGGCCGCA
CGCCTGATGA TCGACGCCTC GCACGCCAAC AGCAGCAAGA AGCACGAGAA CCAGATCCCG
GTCTGCGAGG ACATTGGCAA GCAGATCGCC GGCGGCGAGC AGCGCATCGT CGGCGTCATG
GTGGAATCGC ACCTGGTAGC CGGCCGACAG GACCATGTGC AGGGCACTCC GGTGGAGAAC
CTGACCTACG GCCAGTCGGT GACCGACGCC TGCATCGCCT GGGATGACTC CGTGGCCGTA
CTGGAGACGC TGGCCAACGC CGTGAAGCAG CGCCGTCTGG TGACCGGCAG CGGCAACTGA
 
Protein sequence
MLKNTDDLRI RELKELLPPA HLIREFACSE AASDVIYGAR QAMHRILHGM DDRLIVIIGP 
CSIHDTRAAL EYAKLLKVQR DRFAGELEIV MRVYFEKPRT TVGWKGLIND PHMDGSFKIN
DGLRTARELL LNISEMGVPT GTEYLDMISP QYIADLVSWG AIGARTTESQ VHRELASGLS
CPVGFKNGTD GNVKIAVDAI KAASQPHHFL SVTKGGHSAI VSTSGNEDCH IILRGGKTPN
YDAASVQEAC DAISKSGLAA RLMIDASHAN SSKKHENQIP VCEDIGKQIA GGEQRIVGVM
VESHLVAGRQ DHVQGTPVEN LTYGQSVTDA CIAWDDSVAV LETLANAVKQ RRLVTGSGN