Gene Rxyl_2349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2349 
Symbol 
ID4115615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2360787 
End bp2362100 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content70% 
IMG OID638037129 
Productsulfatase 
Protein accessionYP_645089 
Protein GI108805152 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00689032 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGGGCG CCCCCGACGC GCCGCCCAAC ATCCTCTACC TCCACTCCCA CGACACCGGG 
CGCTGGGTGC AGCCCTACGG GCACGCCGTA CCCACCCCCA ACATTCAGAA GCTCGCCGAG
GAGGGGGTGC TCTTCAGGCA GGCCTTCTGC GCCGCCCCCA CCTGCTCCGG GAGCCGGGCC
TGCCTGCTCA CCGGGCAGTA CGCGCACTCC AACGGGATGG TCGGGCTCGC GCACCGGGGG
TTCTCGCTCA AGGACTACCG GCACCACATC GTCCATACCC TGCGCCGCCT CGGTTACTGG
TCGGCGCTCA TCGGGGAGCA GCACATCTCC AAAAAGCCTG AGGTGATCGG CTACGACGAG
GTCTTCAAGA TCCCCACCAA CCACACCGAC GACGTGGTGC CCGTGACCCT CGAGCTCCTC
TCCCGGGACC ACGGGCGGCC CTTCTTTCTC TCCGTGGGCT TCTTCGAGAC CCACCGGGAG
TTCTTCCGTC CCAGCTCGCC CAGAAAGGCC AACTACGTCC TCCCGCCGCC CAACCTCCCG
GACGCTCCGG AGACGCGGCT GGACATGGCC GCCTTCTGCG AGAGCGCCCG CTCGCTCGAC
CGGGGGGTGG GGGCGGTGCT GGACGCCCTC GACGCTGCCG GGCTCGCCGA GAACACGCTG
GTGATCTGCA CCACGGACCA CGGGATCGCC TTCCCGGGCT GCAAGGCCAC CCTCTACGAC
CGGGGGCTGG GGGTCATGCT GATCCTGCGC GGCCCCGGCG GCTTCTCCGG CGGCCGGGCG
TCGGACGCGC TGGTCTCCCA CATAGACATC TTCCCGACGG TCTGCGACCT CGTCGGGATA
GAACGCCCCC CGTGGCTGCA GGGCGAGTCG CTGCTGCCGC TGGTGCGGGG GGAGGTGGAG
GAGGTCCGGG AGGCCATCTT CGCCGAGAAG ACCTACCACG TTGCTTACGA GCCCCAGCGC
TGCGTCCGGA CCCGGCGCTG GAAGTACATC CGGCGCTTTG ACGACCGCTC GACCCCGGTG
CTGGCGAACA CCGACGACGG CCCGAGCAAG GAGCTTTTGC TGCGCCACGG CTGGGCGGAG
CGCCCAGTCC CCGAGGAGCA GCTCTACGAC CTCCTCTACG ACCCGAACGA GGCGTGCAAC
CTGGCCGGCG ACCCCGCCCA CGCCCCCGTG CTGCGGGAGA TGCGGGCGAG GCTCGAGCGT
TGGATGCGCT CCACGCAGGA CCCCATCCTG CGGGGACCCG TACCGCCCCC GCCCGGCGCG
GAGCTCAACC TCCAGGACCA GCTCTCCCCC AAGGACCCCA CCGTGCGGGT CTGA
 
Protein sequence
MRGAPDAPPN ILYLHSHDTG RWVQPYGHAV PTPNIQKLAE EGVLFRQAFC AAPTCSGSRA 
CLLTGQYAHS NGMVGLAHRG FSLKDYRHHI VHTLRRLGYW SALIGEQHIS KKPEVIGYDE
VFKIPTNHTD DVVPVTLELL SRDHGRPFFL SVGFFETHRE FFRPSSPRKA NYVLPPPNLP
DAPETRLDMA AFCESARSLD RGVGAVLDAL DAAGLAENTL VICTTDHGIA FPGCKATLYD
RGLGVMLILR GPGGFSGGRA SDALVSHIDI FPTVCDLVGI ERPPWLQGES LLPLVRGEVE
EVREAIFAEK TYHVAYEPQR CVRTRRWKYI RRFDDRSTPV LANTDDGPSK ELLLRHGWAE
RPVPEEQLYD LLYDPNEACN LAGDPAHAPV LREMRARLER WMRSTQDPIL RGPVPPPPGA
ELNLQDQLSP KDPTVRV