Gene Rmet_5199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5199 
Symbol 
ID4042060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1894502 
End bp1895440 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content65% 
IMG OID637980617 
Productfumarylacetoacetate hydrolase 
Protein accessionYP_587327 
Protein GI94314118 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00147171 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0797568 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAATT GGTATAGCCT CGCCACTTTT GAACTTGCCG GTCGGCAGCA GCCCGGCCTG 
GTCGTTGAGA ACACGCTCTA TGCGCTGGCC GATGTGGGCC AGGCGTGCGG CATCGAGGCA
TCGCGCCTGC CGCAGGACCT GAATGCGGCA CTGGCGGACT GGTCGCGCCA TGCGCCGCTG
CTGGCCGACG CCGCCGCACG TATTCCCGCA TTGCGCGCGG CTGGCAAGCT CGCCGCGGTG
GACGCCGCCG CCACCTATGC GGCGCCCTAT CGGCCACGCC GGATCTTTGG CACCGCCTCA
AACTTCTACG AGCACGCGGA CGAGATGGGC ACCAAGCTGG CTGCCCGCAG CGAAAGCCAG
CCCTACATAT TCATGAAGGC TGAAACCAGC GTGGTGGCCA CCGGCACCAC GGTGCTGATG
CCGCCGGAAA CCAAGAAACT CGACTGGGAA GTGGAGCTGG GCGTAGTGAT TGGCCAGGCA
TGCCGCCATG TCAGCGTGGA GGACGCGCTG TCGGTGATCG CCGGCTATAC CGTGTTCAAC
GACATCAGCG CGCGTGACCT GAACCGCCGC ACCGACTATC CGTTCACGCA CGACTGGTTC
CGCGGCAAGA GCTTCGATAC CTTCGGCCCG ATGGGCCCGT GGCTTGTGCC CGCGACCTGC
ATTCCCAATC CGCAGAACCT GCGCATGACG CTGCATGTCA ACGGCGAGGT CATGCAGAAC
GGCAACACCT CGCAAATGAT CTTCTCGGTG GCCGAGCAGA TCGCCTACCT GTCGCGTATT
CTGACGCTGC AACCTGGCGA CCTTATCGCC ACGGGCACGC CGGACGGTGT GGGCATGGGG
CGTGGACTCT TCCTGAAGCC TGGCGACAGC ATGACGGCCT GGGTCGAGCA GATCGGCACG
ATCGAGAACC GCGTCGCGCT GGAACCGAAC GCACGCTAG
 
Protein sequence
MVNWYSLATF ELAGRQQPGL VVENTLYALA DVGQACGIEA SRLPQDLNAA LADWSRHAPL 
LADAAARIPA LRAAGKLAAV DAAATYAAPY RPRRIFGTAS NFYEHADEMG TKLAARSESQ
PYIFMKAETS VVATGTTVLM PPETKKLDWE VELGVVIGQA CRHVSVEDAL SVIAGYTVFN
DISARDLNRR TDYPFTHDWF RGKSFDTFGP MGPWLVPATC IPNPQNLRMT LHVNGEVMQN
GNTSQMIFSV AEQIAYLSRI LTLQPGDLIA TGTPDGVGMG RGLFLKPGDS MTAWVEQIGT
IENRVALEPN AR