Gene Sare_4686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4686 
Symbol 
ID5704313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5308187 
End bp5309383 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content71% 
IMG OID641274084 
Productfumarylacetoacetase 
Protein accessionYP_001539430 
Protein GI159040177 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.454046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.30172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGGG TGACCGGCCT CGAGGGCTCG CCGTACGGGG TGCACAACCT GCCGTACGGG 
GTGTTCCGCA CCGCAGCCCG TGAGCCGCGG GTGGGCGTAC GCGTCGGTGC CCACGTGCTC
GACCTCGGCG GTGCGGAGCA GGCTGGTCTG GTGTTTGCTG CCGGCACGTT GGGCCGCCCC
CGGCTGAACG ACTTCATGGC GCTCGGTCGA CCGCAGTGGA CGGCGGTGCG GCAGCGGGTC
ACCGAGCTGC TCACCGACAG CGCCCACCGG GCAGCGGTCG AGCCGCTGCT CCTGCCGCTG
CTGGACGTCG AGCTGCTGCT TCCCTTCGAC GTGGCCGACT ACGTCGACTT CTACTCTTCC
GAGCAGCACG CCACCAACGT CGGCAAGATC TTCCGCCCCG GTCAGCCACC GCTGTTGCCC
AACTGGAAGC ACCTGCCGAT CGGCTACCAC GGCCGGGCCG GCACCGTCGT GGTCTCCGGC
ACCCCGATCG TCCGGCCGTG CGGCCAGCGG GCCACCCCGC AGGGCCCGGT CACCGGTGCC
TCCGTCCGCC TCGACATCGA GGCGGAGGTC GGCTTCGTGG TGGGAGTGCC CAGCCCGCTG
GGCCACCGCG CCGCAGCCGG CGACTTCGCC GACCACGTGT TCGGCGTGGT ACTGGTCAAC
GACTGGTCCG CCCGGGACAT CCAAGCGTGG GAGTACCAAC CTCTCGGCCC GTTCCTCGGT
AAGTCCTTTG CCACGTCGAT CGCCGCCTGG GTGACGCCGC TGGAGGCTCT TGGCGCGGCG
TTCGTACCAG CGCCGGACCA GGATCCGCCC GTCGCCGACT ATCTGCGCGA CGAGCCCCAC
CTCGGATTGG ACCTGCGTCT GTCGGTGGAG TGGAACGGTG AGCGGGTGAG CGAGCCACCG
TTCGCCGCGA TGTACTGGAC GCCGGCACAG CAACTAGCCC ACCTGACCAT CAACGGAGCC
GCGTTGCGCA CCGGTGACCT GTACGCCTCC GGCACCGTGT CCGGGGACGA GCGGCACCAG
GTCGGCTCGT TCCTCGAGCT GACCTGGGGC GGTACGGAGC CCGTGCGAGT TGGCGGCGAG
GAACGGATGT TCCTGGCGGA CGGCGACACC GTCACCATCT CTGGTACCGC GCCCGGACCG
GACGGTACGA CCGTCGGCCT CGGCGAGGTC ACCGGCACCA TCATCAGCCC CCGCTGA
 
Protein sequence
MTWVTGLEGS PYGVHNLPYG VFRTAAREPR VGVRVGAHVL DLGGAEQAGL VFAAGTLGRP 
RLNDFMALGR PQWTAVRQRV TELLTDSAHR AAVEPLLLPL LDVELLLPFD VADYVDFYSS
EQHATNVGKI FRPGQPPLLP NWKHLPIGYH GRAGTVVVSG TPIVRPCGQR ATPQGPVTGA
SVRLDIEAEV GFVVGVPSPL GHRAAAGDFA DHVFGVVLVN DWSARDIQAW EYQPLGPFLG
KSFATSIAAW VTPLEALGAA FVPAPDQDPP VADYLRDEPH LGLDLRLSVE WNGERVSEPP
FAAMYWTPAQ QLAHLTINGA ALRTGDLYAS GTVSGDERHQ VGSFLELTWG GTEPVRVGGE
ERMFLADGDT VTISGTAPGP DGTTVGLGEV TGTIISPR