Gene Rxyl_0661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0661 
Symbol 
ID4114647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp694601 
End bp696541 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content71% 
IMG OID638035446 
Productsulfatase 
Protein accessionYP_643443 
Protein GI108803506 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCAGA GCCTGCAGAG AGCCCTGCTG AGCCTCCTGG ACCGCCGGGA CTGGTTCTAC 
CTGCTCTGCC TGCTACTGCC CCTCGCGGCC TACACCCTGC TGCTGCGGCT GCTCGGCCTC
CGGCTGCAGG GCGAGGCCGG GGGGCTGCTC GGGACGCTGG CGCTCTTGCG CTCCGATTTG
CTCTTTCTCG CCGGGTACGC CCTGCTGTGG GTGGGGCTCT TCGCCGCCTT CCGCCGGGGG
CCGGGCCGCT GGGCGGCGCT CGGGCTGCTC CACGCCAGCG CGGTGCTCGT GGTGGCCATC
TCCACCAGCG CCTACCAGTA CCTCAGGTCC ACCGGGGCCA CCCTCGACTA CAGCGTCGTC
GCCTACTACC TGACCTCCTT CGGGGAGGCC ACGGGGGCCA TCTCCAGCGA GGCCCCGCTG
TACATGTGGC TCATCCTCGC CGAGGCGCTC CTGTACGCGG CCTTCGGGCC CTGGGCCTTC
ACGCGGGCCT TTCTCGGGCC GCGGCGGGAG GGGGCCGGGG AGGGGGGAGC GCCCCCGCGG
GGACGGGGCG TGAGCCGCCG GCGCTTTATC GCCTCGGGGG TGGGGGCGGG GGCCGGAATC
CTCCTGCTGC GCGAGTCGCT GCTCCCCGAG GCCGCGCGGG GGCAGGGCAC CTCCGTCTCC
CGCTCGCCCG TCTCCAACCT CATCGCCACC CGCATAGAGG AGTCCCGGAT GGACGCCGCG
GCCGAGAGCG TCCGGGTCAC CAACACCCTG CGGGGCATCC GCCTCGAGCC CACCTTCCGG
ACCAGGAGGC GGCACGTCGC CCTCATCCAC CTGGAGTCCA CCCGCGAGCG CTCCGTAACC
CCCTACAACC GGGACATCGC CACCATGCCC CTGCTCGCCG AGCTCGCCCG GGACAACAGT
TTGCTCGTCG AGTGGGCCTA CACCACCACC CCGCACACCT CCAAGGCCAT AACCTCCGTG
AACACCGGGC TCTACCCCCA CCCGGACACC GAGATCGTGG AGGCCCGTCC CGGGGCCATC
CCGGCGCCGG GGATCGCCGC GCTGCTGGCC GGGCAGGGCT ACCGCACCGC CTGGTTCCAG
TCGGCCACCG AGAAGTTCGA GAACCGGGCG CAGCTGGTGA AGAACTTCGG CTACGGGCAC
TTCCAGGCCT TCGAGGACAT GAGCACCGAG GGCTTCCAGC GCTCCAACTA CCTCGGGTAC
GAGGACGACA TCATGCTCGG TCCCAGCCGC CGCTGGCTGG AGGAGAACGC CTCCTCTCCC
ACCCTCGTCA TGTACCTCGG GGTCACCCCG CACCACCAGT ACCTGGTCCC CGACCGCTAC
GGGCGCCGCC GGTTCTCGGG GGAGGAGATG CTCAACCGCT ACCTCAACAA CGTCCGCTAC
GACGACTTCT GGGTGCGCAA CATCCTCCGG CAGTACAGGG AGCTCGGGCT CTACGAGGAC
ACCATCTTCG TGATCTACGG CGACCACGGG GAGGCCTTCG GCGAGCACGG GCTCAAGGGG
CACGACCCCA TACCCTACGA GGAGGTGCTG CGGGTCCCCC TGATCATCCA CGACCCCCAG
GGCTTCGACG GCGGGGCGAG GATCGAGGGC CCGGTCCAGC TCATAGACTT CCCGCCGACC
ATCGTGGACC TGCTCGGCTT CAGGGTCGCC GGCGGCGAGT ACCTGGGGCG CTCGCTGCTG
CGGCCGCCGG AGGAGCGCAC CCTCCTCTTC AGCTGCCGGC CGGACATCAC GGCGATGGCC
AGCATCCGGG GCTACGAGAA GTACATCTAC CACTACGACA AGCGGCCCGA GGAGTTCTAC
GACCTCTCCC GCGACCCCAC CGAGCAGAAC AACCTCGCCT CCCGGGTCGG CCGGCGGGAG
CTGCGCCGGC GGCGCGAGGA GCTCCTGGAG TGGCACGCCC GGACGGCCGC GATCTTCGAG
GAGCGCCAGC GGCGGGCGTA G
 
Protein sequence
MPQSLQRALL SLLDRRDWFY LLCLLLPLAA YTLLLRLLGL RLQGEAGGLL GTLALLRSDL 
LFLAGYALLW VGLFAAFRRG PGRWAALGLL HASAVLVVAI STSAYQYLRS TGATLDYSVV
AYYLTSFGEA TGAISSEAPL YMWLILAEAL LYAAFGPWAF TRAFLGPRRE GAGEGGAPPR
GRGVSRRRFI ASGVGAGAGI LLLRESLLPE AARGQGTSVS RSPVSNLIAT RIEESRMDAA
AESVRVTNTL RGIRLEPTFR TRRRHVALIH LESTRERSVT PYNRDIATMP LLAELARDNS
LLVEWAYTTT PHTSKAITSV NTGLYPHPDT EIVEARPGAI PAPGIAALLA GQGYRTAWFQ
SATEKFENRA QLVKNFGYGH FQAFEDMSTE GFQRSNYLGY EDDIMLGPSR RWLEENASSP
TLVMYLGVTP HHQYLVPDRY GRRRFSGEEM LNRYLNNVRY DDFWVRNILR QYRELGLYED
TIFVIYGDHG EAFGEHGLKG HDPIPYEEVL RVPLIIHDPQ GFDGGARIEG PVQLIDFPPT
IVDLLGFRVA GGEYLGRSLL RPPEERTLLF SCRPDITAMA SIRGYEKYIY HYDKRPEEFY
DLSRDPTEQN NLASRVGRRE LRRRREELLE WHARTAAIFE ERQRRA