Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_3110 |
Symbol | |
ID | 4114909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 3119336 |
End bp | 3120814 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638037877 |
Product | sulfatase |
Protein accession | YP_645829 |
Protein GI | 108805892 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCTTTC TGCCGCCGGC CGCATTGCGG GTGCGGCGCC TCCCTGCGAA GCTGGTGCTC GCCGGGCTCC TCGTCGCCGC GGTCTGCGTC CTCTCGTCGG GGGACCGGCA AGTGGTTCTG GCCCGGAAAC CCGAGCGCCC CAACCTCATC CTCATCCTCA CCGACGACCA GACGCCGGGT GATGTCGGGT ACATGCCTGG GGTGAGAGCG CTGCTCCGGG ACCGGGGAAC CACCTTCCGC AACGCCTTCG TCACCGACTC CGTCTGCTGC CCCTCGCGGG CGACGATCCT GCGCGGCCAG TACGCCCACA ACCACGAGAT AGCCGGCGCC AAACCGCCCG CGGGCGGTTT CGAGAAGTTC CGGCGGCTCG GGCTCGAGAG GTCCACCGTG GCCACCTGGC TCAAGGCCCG GGGCTACGCG ACGGGCTTCG TAGGGAAGTA CCTCAACGGC TACCTCAGGA CCACCCACGT CCCTCCGGGC TGGGACCGGT GGTACGGCTT CAACGGCGGC GGGTACCACG ACTTCACCCT GAACGAGAAC GGGCGCAACG TCTCCTACCG GGGCCCCTCG AGCTACCAGA CCGACGTCCT CGGCCGGAAG GCCCTCGGCT TCGTCCGGTG GGCGGCCCGG AGGGACAGAC CCTTCTTCCT GCACCTCTCC CCGTGGGCGC CACACGGTCC GGCGGAGCCC GCCCCCCGGC ACGCCCGGCT GTTCGCCCGG ACGCCGCTGC CCCGCCCGCC CTCCTTCGAC GAGCGGGACG TCTCGGACAA GCCCCGCTGG GTGCGGGACA ACCCCCGCCT GGGCCGGGAG GAGGTGCGGG AGATGGGACG GCTCTACCGC AACAGGCTCC GCACCCTGAG GGCGGTCGAC GAGTTGGTGG GCCGCCTTGT GGCCGCCCTC CGCGAGAGCG GGCAGCTCGA GAACACCTAC ATCTTCTTCA CCTCGGACAA CGGCTTCCAC ATGGGCCACC ACCGGCTGCC GGAGGGGAAG TGGACCGCCT ACGAGGAGGA CATCAGGGTT CCGCTCCTGG TGCGGGGGCC TGGGGTGCCC GAGGGACGGG TGCTCCCGCA CCTGGTGCTG AACAACGACC TTGCGCCGAC CTTTGGCCGG CTTGGGGGGG CGAGGGTTCC GGGGTATGTG GACGGGCGCT CTCTTGTTTT GCTGCTGCGG CGGGACCCTC CCTCCCGGCG TAGCTGGCGC TCGGCCTTTC TTGTGGAGGC GAAGCGGGAT GGCGCCAACC GGCGTCCCGC CTACCGGGCG CTTCGCTCCG TCGGACACCT GTACGTGGAG TACGAGAGCG GGGAGAGGGA GCTCTACGAC CTGCGCCGCG ACCCCCACCA GCTCCGGAAC CTCGCACCGC GTCTGGATGG GGAGAGCGCC CGGAAGCTCC GCTCGCGGCT TGCTAAATTG AGCGGGTGCG CGGAAGAGGA GTGCAGAACC CTGGAGAACC GGAAGCCCGT GTGGCCGGAG GTCCGGTGA
|
Protein sequence | MSFLPPAALR VRRLPAKLVL AGLLVAAVCV LSSGDRQVVL ARKPERPNLI LILTDDQTPG DVGYMPGVRA LLRDRGTTFR NAFVTDSVCC PSRATILRGQ YAHNHEIAGA KPPAGGFEKF RRLGLERSTV ATWLKARGYA TGFVGKYLNG YLRTTHVPPG WDRWYGFNGG GYHDFTLNEN GRNVSYRGPS SYQTDVLGRK ALGFVRWAAR RDRPFFLHLS PWAPHGPAEP APRHARLFAR TPLPRPPSFD ERDVSDKPRW VRDNPRLGRE EVREMGRLYR NRLRTLRAVD ELVGRLVAAL RESGQLENTY IFFTSDNGFH MGHHRLPEGK WTAYEEDIRV PLLVRGPGVP EGRVLPHLVL NNDLAPTFGR LGGARVPGYV DGRSLVLLLR RDPPSRRSWR SAFLVEAKRD GANRRPAYRA LRSVGHLYVE YESGERELYD LRRDPHQLRN LAPRLDGESA RKLRSRLAKL SGCAEEECRT LENRKPVWPE VR
|
| |