Gene Rmet_5212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5212 
Symbol 
ID4042073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1905771 
End bp1906772 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content63% 
IMG OID637980630 
Productaminocarboxymuconate-semialdehyde decarboxylase 
Protein accessionYP_587340 
Protein GI94314131 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.473622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.325543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA TCGATATGCA TGCCCACTTC TTTCCGCGCA TCACGCGGGA AGAAGCCGCC 
GCGCTGGACG CGGACAACGC GCCATGGCTC GCCGTCGACA GCGATGGCGA ATCGGGCCAC
ATCATGGCCG GCGACAGACG TTTCCGGCCC GTCTATCGCG CGCTATGGGA CCCCGCGCTG
CGCATCGAGG AAATGGACCG CAACGGACTG GACATGCAGA TCGTCTGCGC CACGCCGATC
ATGTTCGGTT ATGGATACGA CGCCAGCGCC GCAGCCACGT GGGCGCGCCG GATGAACGAC
CTCGCGCTCG AGCATTGCGC CTATCGCCCG CAGCGCCTGA AGGCGCTGGC ACAGGTGCCG
CTGCAGGACC TGGACCTGGC GTGCATCGAA GCCTCACGCG CCAGGGAGTC CGGCCACGTT
GGCGTGCAAA TCGGCAATCA CCTGGGTCCG CACGACCTGG ATGACGAGCG CCTGGTCAGG
TTCCTGGTGC ATTGCGCGAA CAACGATATT CCGGTGCTGG TGCATCCATG GGACATGATG
ACCGACGGGC GCATGAAAAA ATGGATGCTG CCGTGGCTGG TATCGATGCC GGCGGAAACG
CAACTCGGCA TCCTCTCGTT GATCCTGTCC GGCGCGTTTG AGCGGATTCC GGAAACGCTG
AAGCTCTGCT TCGCCCACGG CGGTGGTGGT TTTGCCTTCC TGCTGGGTCG CGCGGAGAAC
GCCTGGCATT GCCGGGACAT CGTGCGGCAG GACTGTCCCC AGCCGCCCTC CCACTATCTG
AAGCGGTTCT CCGTGGACAG CGCGGTATTC GACGATCGTT CGCTACGTCT GCTGGTGGAA
GTCATGGGTG CCGACCACGT GATGCTGGGC TCGGACTACC CGTTTCCGCT CGGTGAACAG
GAAATCGGCA AGCTGGTCGC CAACAGCCCC AACCTCGATG AAACGGACCG GGCGCGGATT
CTGGCAGGCA ATGCCATGCG CTTCTTCGGT CTGACAGGCT GA
 
Protein sequence
MKKIDMHAHF FPRITREEAA ALDADNAPWL AVDSDGESGH IMAGDRRFRP VYRALWDPAL 
RIEEMDRNGL DMQIVCATPI MFGYGYDASA AATWARRMND LALEHCAYRP QRLKALAQVP
LQDLDLACIE ASRARESGHV GVQIGNHLGP HDLDDERLVR FLVHCANNDI PVLVHPWDMM
TDGRMKKWML PWLVSMPAET QLGILSLILS GAFERIPETL KLCFAHGGGG FAFLLGRAEN
AWHCRDIVRQ DCPQPPSHYL KRFSVDSAVF DDRSLRLLVE VMGADHVMLG SDYPFPLGEQ
EIGKLVANSP NLDETDRARI LAGNAMRFFG LTG