Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0770 |
Symbol | |
ID | 4116906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 806321 |
End bp | 807781 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638035554 |
Product | sulfatase |
Protein accession | YP_643550 |
Protein GI | 108803613 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.119074 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGCAG AGCGCCCGAA CATCGTCGTC ATCGTCTCGG ACACGTTTCG GCGGGACCAT CTGGGGGCTT ATGGGAACCC CTGGATCCGG ACGCCCAACC TGGACGCCCT CGCGCGCTCC TCGGTCGTCT TTGAGCGGCA CGTGATCTCC TCCTTCCCCA CCATGCCCGC CCGGGCGGAC ATCCTCACGG GGACCTTCTC CTACACCTTC ATGGGCTGGG AGCCCCTGCC CAGGGACCTG CCCACCCTCC CCGGCCTGCT CTCCGAGGCC GGCTACCTCA CCATGGGGAT CGTCGACACC CCCTTCTTCG TCAGGAACGG CTACGGCTAC GACCGGGGCT TCGACGACTT CATCTGGGTC AGGGGACAGG GCGACGACAC CCGGCCGCAG GAGCGCTCCG ACTGCCGCTC CACCTGGCGC TACGAGGCCG ACAGGATGGT GGCGCGCACC ATGACCGAGG CCGAGCGCTG GCTCGAGCGG CACCACCGGG AGCGCTTCTT CCTCTACGTG GACACCTGGG ACCCCCACGA GCCGTGGGAC GCCCCCGACT ACTACACCCG GCTCTACCGC CCGGACTACG ACGGGCGCAA GATCTACCCG GCCTACGGCC GGTGGGAGGA GGTCGGGCTC TCCGAGGAGG ACGTGCGGGT GGCGCACGCC ACCTACTGCG GCGAGGTCAC CATGGTGGAC CTGTGGGTCG GCCGGCTGCT CGCCAAGCTC GACGTCCTCG GGCTCCGGGA GAACACCGCC GTCTTCTTCC TCTCCGACCA CGGCTTCTAC TTCGGCGAGC ACGGGTACTT CGGGAAGGCC GAGTGGGTCC ACGACCCGGA CGCGGTGGTC TCCGAGGACT CCGTGCTCCC CGACTGGCTC CCCGAGTCCT GGCTGCTCAC CGTGGGGTGG TCCCCCCTGT ACTCCGAGCT GACCCGGGTG CCGCTCATCG CCCGCGTCCC CGGCGTCCCC CCGGGCCGCC GGGACACCAT GACCACCCAC CCCGACCTCG CCCCCACCCT GCTGGAGCTC GCGGGGGTGG AGAGGCCGCA GCGCGTGCAG GGCGAGTCCT TCCTCGGCGT GTTGCGCGGC GAGCGGGAGG AGCACCGGCG CTTCGTCATC AGCTCCTGGC CCCTGTACTT CGCCGAGGGC GAGCTCACCA CCGCCGTGGA CTCCAGGCCC CGGCGCATAG CCAGCTACAT GCCGCTCACC GTCACCACCC GCGAGCGCTC GCTCATCCTC GGGGGGCCGG AGGACGAACC CGAGCTCTAC GACCTCGGAC GGGACCCGCA GGAGAAGCAC AACGTCTGGC CGGAAGCGCC GGGGGAGGGC GTGCGCCTCG CCGGGGAGGC CGTCTCCTTC CTGGAGCGGC TCGGAACCCC GGAACGCCAC CTCGAGCCCC GGCGGGCGGC GCTGGAGAGG CTCCGCGGGA GCATCCCCGC CAGCCGCGAC CTGGCAGGGG AGGCGAGCTA G
|
Protein sequence | MAAERPNIVV IVSDTFRRDH LGAYGNPWIR TPNLDALARS SVVFERHVIS SFPTMPARAD ILTGTFSYTF MGWEPLPRDL PTLPGLLSEA GYLTMGIVDT PFFVRNGYGY DRGFDDFIWV RGQGDDTRPQ ERSDCRSTWR YEADRMVART MTEAERWLER HHRERFFLYV DTWDPHEPWD APDYYTRLYR PDYDGRKIYP AYGRWEEVGL SEEDVRVAHA TYCGEVTMVD LWVGRLLAKL DVLGLRENTA VFFLSDHGFY FGEHGYFGKA EWVHDPDAVV SEDSVLPDWL PESWLLTVGW SPLYSELTRV PLIARVPGVP PGRRDTMTTH PDLAPTLLEL AGVERPQRVQ GESFLGVLRG EREEHRRFVI SSWPLYFAEG ELTTAVDSRP RRIASYMPLT VTTRERSLIL GGPEDEPELY DLGRDPQEKH NVWPEAPGEG VRLAGEAVSF LERLGTPERH LEPRRAALER LRGSIPASRD LAGEAS
|
| |