Gene Rmet_4375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_4375 
SymbolfahA 
ID4041233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp973929 
End bp975191 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content64% 
IMG OID637979796 
Productfumarylacetoacetase 
Protein accessionYP_586509 
Protein GI94313300 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00028234 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.510471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCC CCAAGACTAG CTGGGTCAAT TCCGCGAACG ACGGCCAGAC CCATTTCCCG 
CTGCAGAACC TGCCGTATGG CATCTTTTCG ACCACGGGCG GCGCGGCACG CGTGGGCGTG
GCGATCGGCA ACCAGATCGT CGATCTGGCT GCGCTGGACG ATGCCGGCCT GATGCCGACG
GCGGCCAAGG GCGCGTTTGC AGCGTCGAGC CTCAACCGCT TTATCGCGCT GGGCAAGCCG
GTCTGGACCG ACGTGCGCGC GCGCCTGACC GCGCTGCTGT CCGCCGATGA CCAGCGGCTC
TCCGGCAACG TGGCGCTGCG CGACAAGGCA CTCGTGCCGA TGTCGGCCGC CACGCTGCAT
TTGCCGGTGG ATATTCCCGG CTATACGGAC TTCTATTCGT CGCGCGAACA CGCCACCAAC
GTTGGCCGCA TGTTCCGCGA TCCCGAAAAC GCGCTGCTGC CAAACTGGCT CGAAATCCCG
ATCGGCTATA ACGGCCGTGC GAGTTCGGTG GTCGTCAGCG GCACCGCGCT GCATCGTCCC
AATGGCCAGA TCAAGCTGCC GAACGAGGCA CGCCCGATCT TCAGCCCCTG CCGCAAGCTT
GATTACGAGC TCGAGATGGC CTTTATCGTC GGCAAGCCGT CGAACCTCGG CGAGCCGGTG
AGCACAGGCG ATGCCCCGGC CCATATGTTC GGTCTGGTGA TCCTCAACGA TTGGAGCGCC
CGCGATATCC AGCAGTGGGA GTACGTGCCG CTCGGTCCGT TCAACAGCAA GTCCTTCGGC
ACCTCAATTT CGCCGTGGGT GGTGACGATG GACGCGCTGG AGCCGTTCCG CCGCGAAAAC
CCGGCGCAAT CGCCGGAGCC GTTGCCGTAT CTGCAGCAGC AAGGGCAAAA CGCCTACGAC
ATCGACCTGG AAGTGGCGCT GCAACCAGCC GGCGCCACGG CGGCCAGCAC GGTGTGCCGC
ACCAACTTCA AGGCGATGTA CTGGACCATG GCGCAGCAAC TGGCCCACCA CACGGTATCG
GGCTGCAATG TGCGCATTGG CGACCTGATG GGCTCCGGCA CGATCAGCGG CACTACGTCG
GATTCGTGCG GCAGCCTGCT GGAAACCACG CGTAATGGTG CGGAACCCGT CACCTTGGCT
GATGGCGCGA AGCGCGGTTT CCTCGAGGAT GGCGATACCG TGACCATGAC GGGCTGGTGT
CAGGGCGAAG GGTATCGCGT CGGCTTCGGC GAGGTAACAG GTAAAATCCT GCCGGCGCGC
TAA
 
Protein sequence
MTAPKTSWVN SANDGQTHFP LQNLPYGIFS TTGGAARVGV AIGNQIVDLA ALDDAGLMPT 
AAKGAFAASS LNRFIALGKP VWTDVRARLT ALLSADDQRL SGNVALRDKA LVPMSAATLH
LPVDIPGYTD FYSSREHATN VGRMFRDPEN ALLPNWLEIP IGYNGRASSV VVSGTALHRP
NGQIKLPNEA RPIFSPCRKL DYELEMAFIV GKPSNLGEPV STGDAPAHMF GLVILNDWSA
RDIQQWEYVP LGPFNSKSFG TSISPWVVTM DALEPFRREN PAQSPEPLPY LQQQGQNAYD
IDLEVALQPA GATAASTVCR TNFKAMYWTM AQQLAHHTVS GCNVRIGDLM GSGTISGTTS
DSCGSLLETT RNGAEPVTLA DGAKRGFLED GDTVTMTGWC QGEGYRVGFG EVTGKILPAR