Gene Rxyl_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2114 
Symbol 
ID4114710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2142537 
End bp2143991 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content73% 
IMG OID638036900 
Productsulfatase 
Protein accessionYP_644870 
Protein GI108804933 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.711307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTCCCT CTCGCCGGAA GAGGCGCGCC GCCCTCCTGC TCGGGGCGCT CCTGGTCCTC 
TCCGCTTTGC TTTCCTGCGC CTGCGGGGAG CGGCGCCGGG GGCAGGGGCG GGCGGTCGGC
GGGCCGAACA TCGTGCTTGT CGTGGCCGAC GACCTCGACG TCCGGACCGC GGAGCGCCTG
CCGCGCCTGC GCCGGCTCCT CGCCGACCGC GGGACGAGCT TCGAGAACGC CTTCGTGACG
GACGCCCTGT GCTGCCCCTC GCGGGCGACG ATCCTGCGCG GCCAGTACGC CCACAACCAC
GGGATCCGGG GCAACGAGCC CCCGCGCGGC GGCTTCGAGA GGTTCCGGCG GCTCGAGGGC
TCGACGGTGG CCACCTGGCT GAAGGCCGCC GGTTACCGGA CGGCGTACTT CGGCAAGTAC
ATGAACGGCT ACGGCAGGAG CGAGACCCGC GTGCCCCCGG GGTGGGACGA GTGGCACGCG
GTGGCCGGGA ACTACCTGAG CAGCTGGTAC AACGATAACG GCCGCGTCCG CTACTACAGC
CCCGCCCTCT ACAACGACAC CGACCTCATC GCCGAAAAGG CCACCTCTTA CCTGAGGAGG
ACCGCCGGGA GGGGGGCGCC GTTCTTCGTG GTGCTGGCGC CGCGGGCCCC GCACCAGCCC
GCCGTCCCGC CGCCCCGCTA CGCGGACGCC TTCCCGGAGG CCCCCCTCCC CCGCGGCCCC
TCCTTCGACG AGCGGGACGT CTCGGACAAG CCCCGCTGGG TGCGGGACAA CCCCCGCCTG
GGCCGAAAGA AGCTGGAGTT TCTGGGGTCG CTCTACCGGC GGCGGCTGCG CTCGATGCTC
GCGGTGGAGG ATCTGGTGGA GCGCCTGCTG CGCACCCTCC GCGAGAGCGG GCAGCTCGAG
AACACCTACA TCTTCTTCAC CTCGGACAAC GGCTTCCACA TGGGCCACCA CCGGCTGCCG
GAGGGGAAGT GGACCGCCTA CGAGGAGGAC ATCAGGGTTC CGCTCCTGGT GCGGGGGCCT
GGGGTGCCCG AGGGACGGGT GCTCCCGCAC CTGGTGCTGA ACAACGACCT TGCGCCGACC
TTTGGCCGGC TTGGGGGGGC GAGGGTTCCG GGGTATGTGG ACGGGCGCTC TCTTGTTTTG
CTGCTGCGGC GGGACCCTCC CTCCCGGCAT AGCTGGCGCT CGGCCTTTCT TGTGGAGGCG
GCCTCGCACG GGGAGTCGGG GAGGCCGGGG CTCGTGGCGG TGAGGACGCG CGGGCACCTG
TACGTGGAGT ACGAGAGCGG GGAGAGGGAG CTCTACGACC TGCGCCGCGA CCCCCACCAG
CTCCGGAACC TCTACCGGCG CGCCCCCCGG GGGCTCGTGC GGGACCTGAA GGGGCGGCTC
GAGGCGCTCG CGGACTGCTC GGGGGAGGGA TGCCGGGCGG CCGAGGACGG CCCGGGACGG
GACGGGGGGC GCTAA
 
Protein sequence
MAPSRRKRRA ALLLGALLVL SALLSCACGE RRRGQGRAVG GPNIVLVVAD DLDVRTAERL 
PRLRRLLADR GTSFENAFVT DALCCPSRAT ILRGQYAHNH GIRGNEPPRG GFERFRRLEG
STVATWLKAA GYRTAYFGKY MNGYGRSETR VPPGWDEWHA VAGNYLSSWY NDNGRVRYYS
PALYNDTDLI AEKATSYLRR TAGRGAPFFV VLAPRAPHQP AVPPPRYADA FPEAPLPRGP
SFDERDVSDK PRWVRDNPRL GRKKLEFLGS LYRRRLRSML AVEDLVERLL RTLRESGQLE
NTYIFFTSDN GFHMGHHRLP EGKWTAYEED IRVPLLVRGP GVPEGRVLPH LVLNNDLAPT
FGRLGGARVP GYVDGRSLVL LLRRDPPSRH SWRSAFLVEA ASHGESGRPG LVAVRTRGHL
YVEYESGERE LYDLRRDPHQ LRNLYRRAPR GLVRDLKGRL EALADCSGEG CRAAEDGPGR
DGGR