Gene Rmet_1681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_1681 
Symbol 
ID4038484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp1817805 
End bp1818947 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content64% 
IMG OID637977063 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_583831 
Protein GI94310621 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.145582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATC CACAAAAGTC CCCGCCCAGC GTTGCCGAAG GCTGGCAGGC GCCGCCGGAC 
CGCACCAGTG TGACGGACGA TGCGCGCGTG GAGGACATCA TCCCGCTGCC CCCGCCCGAG
CACTTGATCC GTTTCTTCCC CATCCGTGGC ACCCCCGTTG AATCGCTGGT CACCCAGACG
CGCCAGCGCA TCTCCCGCAT TCTGCATGGC AGTGATGACC GGCTGCTCGT GATCATGGGC
CCCTGCTCGA TCCACGACCC GCAGGCCGCG CTCGACTACG CACGGCGTCT GGCGGCCGAA
CGCGAGCGCT ACGCGGACTC GCTGGAAATC GTGATGCGGG TGTATTTCGA GAAGCCCCGC
ACCACGGTCG GCTGGAAGGG CCTGATCAAC GACCCCTACC TGGACGAGAG CTATCGCATC
GACGAAGGCC TGCGCATCGC GCGCAGCCTG CTGGTGGATA TCAACCGCCT CGGGTTGCCG
GCGGCCGGCG AGTTCCTGGA CGTCATTTCG CCCCAGTACA TCGGCGATCT GATCTGCTGG
GGCGCGATTG GCGCCCGCAC GACCGAGAGC CAGGTACACC GGGAACTGGC TTCGGGCGTC
TCCGCGCCCA TTGGCTTCAA GAATGGCACC GACGGAAACA TCAAGATCGC GATCGACGCG
ATCCAGGCCG CATCGCGCCC GCACCACTTC CTTGGCGTGC ACAAGAACGG CCAGGTCGCG
ACGGTCCATA CCAAGGGCAA CCCGGACTGC CACGTCATTC TTCGCGGCGG CAAGGCGCCC
AACTACGACG CGGAGGTTGT CGCCGCAGCG TGCAAGGAAC TGGAAGCGGC GCGGCTGCGC
AATTCGTTGA TGGTCGATTG CAGCCATGCC AACAGCAACA AGCAGCACCA ACGCCAGATC
GACGTCGCAC GCGACGTGGC GCAGCAGATC AGCGGCGGCA GCCAGTCGAT CTTCGGCCTG
ATGGTCGAGA GCCATCTGGT ACCCGGCGCG CAGAAGTTCA CCCCAGGAGA ACACAACCCA
TCGGGTCTCA CTTATGGTCA GAGCATCACG GACGCCTGCA TCGGGTGGGA GGACTCCGTG
ACGGTGCTGG AACTGCTGAG CGAGGCGGTA AATGTACGGC GTGGGGTAAA CAGGAAGGCT
TAA
 
Protein sequence
MNDPQKSPPS VAEGWQAPPD RTSVTDDARV EDIIPLPPPE HLIRFFPIRG TPVESLVTQT 
RQRISRILHG SDDRLLVIMG PCSIHDPQAA LDYARRLAAE RERYADSLEI VMRVYFEKPR
TTVGWKGLIN DPYLDESYRI DEGLRIARSL LVDINRLGLP AAGEFLDVIS PQYIGDLICW
GAIGARTTES QVHRELASGV SAPIGFKNGT DGNIKIAIDA IQAASRPHHF LGVHKNGQVA
TVHTKGNPDC HVILRGGKAP NYDAEVVAAA CKELEAARLR NSLMVDCSHA NSNKQHQRQI
DVARDVAQQI SGGSQSIFGL MVESHLVPGA QKFTPGEHNP SGLTYGQSIT DACIGWEDSV
TVLELLSEAV NVRRGVNRKA