Gene Rxyl_3110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_3110 
Symbol 
ID4114909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp3119336 
End bp3120814 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content70% 
IMG OID638037877 
Productsulfatase 
Protein accessionYP_645829 
Protein GI108805892 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCTTTC TGCCGCCGGC CGCATTGCGG GTGCGGCGCC TCCCTGCGAA GCTGGTGCTC 
GCCGGGCTCC TCGTCGCCGC GGTCTGCGTC CTCTCGTCGG GGGACCGGCA AGTGGTTCTG
GCCCGGAAAC CCGAGCGCCC CAACCTCATC CTCATCCTCA CCGACGACCA GACGCCGGGT
GATGTCGGGT ACATGCCTGG GGTGAGAGCG CTGCTCCGGG ACCGGGGAAC CACCTTCCGC
AACGCCTTCG TCACCGACTC CGTCTGCTGC CCCTCGCGGG CGACGATCCT GCGCGGCCAG
TACGCCCACA ACCACGAGAT AGCCGGCGCC AAACCGCCCG CGGGCGGTTT CGAGAAGTTC
CGGCGGCTCG GGCTCGAGAG GTCCACCGTG GCCACCTGGC TCAAGGCCCG GGGCTACGCG
ACGGGCTTCG TAGGGAAGTA CCTCAACGGC TACCTCAGGA CCACCCACGT CCCTCCGGGC
TGGGACCGGT GGTACGGCTT CAACGGCGGC GGGTACCACG ACTTCACCCT GAACGAGAAC
GGGCGCAACG TCTCCTACCG GGGCCCCTCG AGCTACCAGA CCGACGTCCT CGGCCGGAAG
GCCCTCGGCT TCGTCCGGTG GGCGGCCCGG AGGGACAGAC CCTTCTTCCT GCACCTCTCC
CCGTGGGCGC CACACGGTCC GGCGGAGCCC GCCCCCCGGC ACGCCCGGCT GTTCGCCCGG
ACGCCGCTGC CCCGCCCGCC CTCCTTCGAC GAGCGGGACG TCTCGGACAA GCCCCGCTGG
GTGCGGGACA ACCCCCGCCT GGGCCGGGAG GAGGTGCGGG AGATGGGACG GCTCTACCGC
AACAGGCTCC GCACCCTGAG GGCGGTCGAC GAGTTGGTGG GCCGCCTTGT GGCCGCCCTC
CGCGAGAGCG GGCAGCTCGA GAACACCTAC ATCTTCTTCA CCTCGGACAA CGGCTTCCAC
ATGGGCCACC ACCGGCTGCC GGAGGGGAAG TGGACCGCCT ACGAGGAGGA CATCAGGGTT
CCGCTCCTGG TGCGGGGGCC TGGGGTGCCC GAGGGACGGG TGCTCCCGCA CCTGGTGCTG
AACAACGACC TTGCGCCGAC CTTTGGCCGG CTTGGGGGGG CGAGGGTTCC GGGGTATGTG
GACGGGCGCT CTCTTGTTTT GCTGCTGCGG CGGGACCCTC CCTCCCGGCG TAGCTGGCGC
TCGGCCTTTC TTGTGGAGGC GAAGCGGGAT GGCGCCAACC GGCGTCCCGC CTACCGGGCG
CTTCGCTCCG TCGGACACCT GTACGTGGAG TACGAGAGCG GGGAGAGGGA GCTCTACGAC
CTGCGCCGCG ACCCCCACCA GCTCCGGAAC CTCGCACCGC GTCTGGATGG GGAGAGCGCC
CGGAAGCTCC GCTCGCGGCT TGCTAAATTG AGCGGGTGCG CGGAAGAGGA GTGCAGAACC
CTGGAGAACC GGAAGCCCGT GTGGCCGGAG GTCCGGTGA
 
Protein sequence
MSFLPPAALR VRRLPAKLVL AGLLVAAVCV LSSGDRQVVL ARKPERPNLI LILTDDQTPG 
DVGYMPGVRA LLRDRGTTFR NAFVTDSVCC PSRATILRGQ YAHNHEIAGA KPPAGGFEKF
RRLGLERSTV ATWLKARGYA TGFVGKYLNG YLRTTHVPPG WDRWYGFNGG GYHDFTLNEN
GRNVSYRGPS SYQTDVLGRK ALGFVRWAAR RDRPFFLHLS PWAPHGPAEP APRHARLFAR
TPLPRPPSFD ERDVSDKPRW VRDNPRLGRE EVREMGRLYR NRLRTLRAVD ELVGRLVAAL
RESGQLENTY IFFTSDNGFH MGHHRLPEGK WTAYEEDIRV PLLVRGPGVP EGRVLPHLVL
NNDLAPTFGR LGGARVPGYV DGRSLVLLLR RDPPSRRSWR SAFLVEAKRD GANRRPAYRA
LRSVGHLYVE YESGERELYD LRRDPHQLRN LAPRLDGESA RKLRSRLAKL SGCAEEECRT
LENRKPVWPE VR