Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0661 |
Symbol | |
ID | 4114647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | + |
Start bp | 694601 |
End bp | 696541 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638035446 |
Product | sulfatase |
Protein accession | YP_643443 |
Protein GI | 108803506 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCAGA GCCTGCAGAG AGCCCTGCTG AGCCTCCTGG ACCGCCGGGA CTGGTTCTAC CTGCTCTGCC TGCTACTGCC CCTCGCGGCC TACACCCTGC TGCTGCGGCT GCTCGGCCTC CGGCTGCAGG GCGAGGCCGG GGGGCTGCTC GGGACGCTGG CGCTCTTGCG CTCCGATTTG CTCTTTCTCG CCGGGTACGC CCTGCTGTGG GTGGGGCTCT TCGCCGCCTT CCGCCGGGGG CCGGGCCGCT GGGCGGCGCT CGGGCTGCTC CACGCCAGCG CGGTGCTCGT GGTGGCCATC TCCACCAGCG CCTACCAGTA CCTCAGGTCC ACCGGGGCCA CCCTCGACTA CAGCGTCGTC GCCTACTACC TGACCTCCTT CGGGGAGGCC ACGGGGGCCA TCTCCAGCGA GGCCCCGCTG TACATGTGGC TCATCCTCGC CGAGGCGCTC CTGTACGCGG CCTTCGGGCC CTGGGCCTTC ACGCGGGCCT TTCTCGGGCC GCGGCGGGAG GGGGCCGGGG AGGGGGGAGC GCCCCCGCGG GGACGGGGCG TGAGCCGCCG GCGCTTTATC GCCTCGGGGG TGGGGGCGGG GGCCGGAATC CTCCTGCTGC GCGAGTCGCT GCTCCCCGAG GCCGCGCGGG GGCAGGGCAC CTCCGTCTCC CGCTCGCCCG TCTCCAACCT CATCGCCACC CGCATAGAGG AGTCCCGGAT GGACGCCGCG GCCGAGAGCG TCCGGGTCAC CAACACCCTG CGGGGCATCC GCCTCGAGCC CACCTTCCGG ACCAGGAGGC GGCACGTCGC CCTCATCCAC CTGGAGTCCA CCCGCGAGCG CTCCGTAACC CCCTACAACC GGGACATCGC CACCATGCCC CTGCTCGCCG AGCTCGCCCG GGACAACAGT TTGCTCGTCG AGTGGGCCTA CACCACCACC CCGCACACCT CCAAGGCCAT AACCTCCGTG AACACCGGGC TCTACCCCCA CCCGGACACC GAGATCGTGG AGGCCCGTCC CGGGGCCATC CCGGCGCCGG GGATCGCCGC GCTGCTGGCC GGGCAGGGCT ACCGCACCGC CTGGTTCCAG TCGGCCACCG AGAAGTTCGA GAACCGGGCG CAGCTGGTGA AGAACTTCGG CTACGGGCAC TTCCAGGCCT TCGAGGACAT GAGCACCGAG GGCTTCCAGC GCTCCAACTA CCTCGGGTAC GAGGACGACA TCATGCTCGG TCCCAGCCGC CGCTGGCTGG AGGAGAACGC CTCCTCTCCC ACCCTCGTCA TGTACCTCGG GGTCACCCCG CACCACCAGT ACCTGGTCCC CGACCGCTAC GGGCGCCGCC GGTTCTCGGG GGAGGAGATG CTCAACCGCT ACCTCAACAA CGTCCGCTAC GACGACTTCT GGGTGCGCAA CATCCTCCGG CAGTACAGGG AGCTCGGGCT CTACGAGGAC ACCATCTTCG TGATCTACGG CGACCACGGG GAGGCCTTCG GCGAGCACGG GCTCAAGGGG CACGACCCCA TACCCTACGA GGAGGTGCTG CGGGTCCCCC TGATCATCCA CGACCCCCAG GGCTTCGACG GCGGGGCGAG GATCGAGGGC CCGGTCCAGC TCATAGACTT CCCGCCGACC ATCGTGGACC TGCTCGGCTT CAGGGTCGCC GGCGGCGAGT ACCTGGGGCG CTCGCTGCTG CGGCCGCCGG AGGAGCGCAC CCTCCTCTTC AGCTGCCGGC CGGACATCAC GGCGATGGCC AGCATCCGGG GCTACGAGAA GTACATCTAC CACTACGACA AGCGGCCCGA GGAGTTCTAC GACCTCTCCC GCGACCCCAC CGAGCAGAAC AACCTCGCCT CCCGGGTCGG CCGGCGGGAG CTGCGCCGGC GGCGCGAGGA GCTCCTGGAG TGGCACGCCC GGACGGCCGC GATCTTCGAG GAGCGCCAGC GGCGGGCGTA G
|
Protein sequence | MPQSLQRALL SLLDRRDWFY LLCLLLPLAA YTLLLRLLGL RLQGEAGGLL GTLALLRSDL LFLAGYALLW VGLFAAFRRG PGRWAALGLL HASAVLVVAI STSAYQYLRS TGATLDYSVV AYYLTSFGEA TGAISSEAPL YMWLILAEAL LYAAFGPWAF TRAFLGPRRE GAGEGGAPPR GRGVSRRRFI ASGVGAGAGI LLLRESLLPE AARGQGTSVS RSPVSNLIAT RIEESRMDAA AESVRVTNTL RGIRLEPTFR TRRRHVALIH LESTRERSVT PYNRDIATMP LLAELARDNS LLVEWAYTTT PHTSKAITSV NTGLYPHPDT EIVEARPGAI PAPGIAALLA GQGYRTAWFQ SATEKFENRA QLVKNFGYGH FQAFEDMSTE GFQRSNYLGY EDDIMLGPSR RWLEENASSP TLVMYLGVTP HHQYLVPDRY GRRRFSGEEM LNRYLNNVRY DDFWVRNILR QYRELGLYED TIFVIYGDHG EAFGEHGLKG HDPIPYEEVL RVPLIIHDPQ GFDGGARIEG PVQLIDFPPT IVDLLGFRVA GGEYLGRSLL RPPEERTLLF SCRPDITAMA SIRGYEKYIY HYDKRPEEFY DLSRDPTEQN NLASRVGRRE LRRRREELLE WHARTAAIFE ERQRRA
|
| |