Gene Rxyl_2241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2241 
Symbol 
ID4115190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2250906 
End bp2251955 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content69% 
IMG OID638037026 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_644989 
Protein GI108805052 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGATCAG AGGATTCCCC GAGGGGCTGG TTCGCGCTCG GGACCTTCGA GCACGGAGGG 
CTCGCCTTCC CGGGCCTCGT GCTCGAGGGC GGGCGGGTGG TCGACCTCTC CAGGACGGAG
GTCCCCGGTG CTAGCGTGGC GAGCTTCCGC TCGGTTCGCC AGATCCTCGA GGGCTGGGAG
GCGAACCGTG CGGCGCTGGC CGCCTTCGCG AGGCATGGAG CGGCAGACGC CCACGACCTC
GCCGATCTGC GGGTCCTGCC GCCCGTCGAG CCGGTCCAGA TCCTGCAGAG CGGGGCCAAC
TACCACAGGC ACGTCGTGGA TCTCATCGTC GCCGAGGCGC GGGCCGGGAA CCCCCGGATG
ACACCCGAGG AGGAGGCGGA GGTGCGCCGG GCCGGCGAGA GGCTCATGGA CGAGCGCGCA
GAGCGCGGAG AGCCCTACCT CTTTCTCGGT TCCCCGACCG CCCTGTGTGG CCCCTACGAC
GATGTGGTGC TCCCCGCGGA GGGCGATCAG CACGACTGGG AGCTCGAGTT CGCGGCTGTC
ATCGGCAGGA GCGGGCGTCA CGTGCCCCCC GAGCGTGCTC TCGACCTCGT CGCCGGGTAT
ACGATCGCGA ACGACATCAC CACCCGCGAT CTCGTCTACC GTCCGGACCT CAAGGCTATC
GGCACCGACT GGCTGCGCTC CAAGAACGCG CCTACTTTCC TTCCGACCGG TCCCTACATC
GTCCCCAAGG AGTTCGTCGG CGACACCAGC GGCCTGCGCA TCACGCTCAG GCTCAACGGC
GAGACCATGC AGGACGAGTC TGCCTCGGAC ATGATCTTCG ACGTAGCGCG CCTCGTCTCC
TACGCATCAT CCCGGGTCTT GCTCCGGCCG GGAGACCTGA TCCTCACCGG CTCTCCCGCG
GGCAACGGCT CTTACTGGGG ACGTTTTCTC GGGGAGGGTG ACGTCATGGA GGGGACCGTC
ACCGGCCTCG GGTATCAGCG GAACCGCTGC GTCAGGGAAC GGCTACCGGA GGCGGCGCCC
GGTCGAGTTC CCGATGGGTC GGCGACATGA
 
Protein sequence
MRSEDSPRGW FALGTFEHGG LAFPGLVLEG GRVVDLSRTE VPGASVASFR SVRQILEGWE 
ANRAALAAFA RHGAADAHDL ADLRVLPPVE PVQILQSGAN YHRHVVDLIV AEARAGNPRM
TPEEEAEVRR AGERLMDERA ERGEPYLFLG SPTALCGPYD DVVLPAEGDQ HDWELEFAAV
IGRSGRHVPP ERALDLVAGY TIANDITTRD LVYRPDLKAI GTDWLRSKNA PTFLPTGPYI
VPKEFVGDTS GLRITLRLNG ETMQDESASD MIFDVARLVS YASSRVLLRP GDLILTGSPA
GNGSYWGRFL GEGDVMEGTV TGLGYQRNRC VRERLPEAAP GRVPDGSAT