Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_1235 |
Symbol | |
ID | 4116238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 1254691 |
End bp | 1256031 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638036024 |
Product | sulfatase |
Protein accession | YP_644012 |
Protein GI | 108804075 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTCA TCCTGATCAT CCTGGACTCG CTGCGCAGAG ACCACGTGGG CGACTACGGG AACCGCTGGA TCCTCACCCC CAACCTGGAC GCCCTCTCGG CCGGGAGCCT GCGCTTCGAC AGGGCCTACC CGGAGTCCCT GCCCTCCATC CCGGCCCGCC GGGCGATACA CACCGGCCTG AGGACGTGGC CCTTCAGAGG CTGGCACAGG GCCAGCCGGC AGGACGTGGG GCTCTACGGC TGGCAGCCGG TGCCCGAGGA CCAGACGACG CTCGCCGAGA TCCTGCACGG CGCCGGGTAC GCGACGCTGT TCGTCACCGA CACGCTGCAC CAGTTCCGCC CCGGCTACAA CTTCCACCGC GGCTTCGACG TCTTCCACTT CGTCCGGGGG CAGGAGCGGG ACTTCTACCG CCCGGCCTCC CTGTGCCCCA AAGGGGCGCT AGAGGGGGTC CTGCTCGGCG GCCCCCAGCG GGAGCACGCC ACCCGCATCA TGCGCCAGTA CCTGGCCAAC ACCCGCGGCC GCCGGCGCGA GGAGGACTGG TTCGCCCCCC GGGTCTTCCT GAAGGCCATG GAGTTCCTGA GGCCCGCCCG GGAGGCACAG CCCTTCTTCC TGCTCGTCGA CGCCTATGAC CCGCACGAGC CCTGGGACCC GCCCGACCAC TACGTCCGGC TCTACGACGA GGGCTACCGG GGCCCCGAGC CGCAGATAGC CAGCAGCGGG GACAGCGGCT GGCTCGAGGA GCGGCAGCTC GAGAGGATGC GCGCGCTCTA CGCCGCCGAG GTGACCATGG TGGACCGCTG GCTCGGCAAC TTCCTGGACC GGGCGGAGGA GCTGGGGCTG CTGGAGAGCA CCCTCGTCCT GCTCCTCGCG GACCACGGGC ACGCCTTCGG GGAGCACGGC ATAGCCGGGA AGGTCCCCTC GGCCCTCTAC CCGGAGCTCC TGGACATCCC CTTTCTTTTG CGCCACCCGG AAGGCCGGGG CGCCGGAAGG AGCGCGGCGT ACCCCGCCTC CACCCACGAC GTGGCCCCCA CCATCCTCGG CGCTCTGGGG CTGGAGCCAC CCTCGCCCGT GGACGGCGCG GACCTCACCC CCCTGCTCGA GGGGAGGGAG CCGGACCGGG AGCGGCCGCA TCTCACCGCC GGCTACCACG ACCACGCCTG GGCCAGAGAC GAGGACTACG CCCTGATCGT CCGCAGCGAC GGCTCCGAGC CCCGCCTGTT CGACCTGCGG GAGGACCCGG AGCAGCGGCG CGACGTGGCC TCCGAACGCC CCGAGGTCGC AAGGAGGATG TTCGAGGAGT ACATCCTCGC GGACGCGAGC GGCCCGCTCC CCGAGTACTA G
|
Protein sequence | MNVILIILDS LRRDHVGDYG NRWILTPNLD ALSAGSLRFD RAYPESLPSI PARRAIHTGL RTWPFRGWHR ASRQDVGLYG WQPVPEDQTT LAEILHGAGY ATLFVTDTLH QFRPGYNFHR GFDVFHFVRG QERDFYRPAS LCPKGALEGV LLGGPQREHA TRIMRQYLAN TRGRRREEDW FAPRVFLKAM EFLRPAREAQ PFFLLVDAYD PHEPWDPPDH YVRLYDEGYR GPEPQIASSG DSGWLEERQL ERMRALYAAE VTMVDRWLGN FLDRAEELGL LESTLVLLLA DHGHAFGEHG IAGKVPSALY PELLDIPFLL RHPEGRGAGR SAAYPASTHD VAPTILGALG LEPPSPVDGA DLTPLLEGRE PDRERPHLTA GYHDHAWARD EDYALIVRSD GSEPRLFDLR EDPEQRRDVA SERPEVARRM FEEYILADAS GPLPEY
|
| |