Gene RoseRS_4208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4208 
Symbol 
ID5211193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5268185 
End bp5269186 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content61% 
IMG OID640597797 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_001278501 
Protein GI148658296 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.97195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGG TCACCTTTCA GTTCGTTGAT CGTCAGCCGC GGGTTGGCGT GCTGCTCGGC 
AGTGCCGTTA TCGATCTGGC GGCAGCGGCG CCGCTCGTCT TTGAAGATCC GCCCCAGCCG
CCCTGGAGTC TGCTCGATGT GCTGGGTGGC GTGCCGGACG GTATGGGGCT CGATGGCGCG
GCGGAGATCG TAGCGGCGGT GATCGATCAG ATCGGCGGTG TGGATGATGA GCATCTCGTT
CTACAGTCCG GTTCTCTGAT GATCGGCGGC GTCGAGATGC TTATCCCGCT CGACGAGGCG
CGCCTGTTGC CGCCATTGCC GCGCCCTTCC AGCCTGCGCT GCTTTGAATC CAGCGAACAG
CACATGGCGG CGCTTGCCCG TCTGCACGGC GGCGGGATCC CCTATTTCTG GTATGAACGA
CCACTCTTTG CCTTTGGCAA TCACGCAGCC ATGTATGGTT CCGAAGCATA TGTGCCACTC
CCACGCACCG TCGCTTTCGA TTATGAACTC GAGGTGGCGT GCGTTATCGG TCGCAGCGGA
CGCGACATTT CCCCTGAAGA GGCGGGCGAT TACATTGCCG GGTATCTGCT GCTCAACGAC
TGGACGGCGC GTGACGTCCA GCGCGAGGAA CTGATCGCCG GGTTCGCCTT CAGCAAGAGT
AAAGATGCAG CGACCTCCAT CGGTCCCTGG CTTGCCACGC CGGACGAACT CCAGGAATAT
GCCCTCGACG ATGGTCAGTT CAATCTGACC CTGATCGCTC GCGTCAACGG CGTCGAACAG
TCGCGTGGCA ACCTTCGCGA TCTGACGTAC TCTTTTGCCC AAATGATTGC GATTGCTTCC
GAAGGGTGCA CCCTGTTCCC CGGCGACATG ATCGCCTGCG GCGCGATTGG CGGTTCACTG
CTCGAGGCGA CCAATGGACA GGGACCCTGG CTCGAACCTG ATGATCTGGT CGAGCTTGAG
GCTGGCGGGC TGGGTATTCT CCGCAACCGC ATTGTCGCCT GA
 
Protein sequence
MRLVTFQFVD RQPRVGVLLG SAVIDLAAAA PLVFEDPPQP PWSLLDVLGG VPDGMGLDGA 
AEIVAAVIDQ IGGVDDEHLV LQSGSLMIGG VEMLIPLDEA RLLPPLPRPS SLRCFESSEQ
HMAALARLHG GGIPYFWYER PLFAFGNHAA MYGSEAYVPL PRTVAFDYEL EVACVIGRSG
RDISPEEAGD YIAGYLLLND WTARDVQREE LIAGFAFSKS KDAATSIGPW LATPDELQEY
ALDDGQFNLT LIARVNGVEQ SRGNLRDLTY SFAQMIAIAS EGCTLFPGDM IACGAIGGSL
LEATNGQGPW LEPDDLVELE AGGLGILRNR IVA