Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_2114 |
Symbol | |
ID | 4114710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 2142537 |
End bp | 2143991 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 638036900 |
Product | sulfatase |
Protein accession | YP_644870 |
Protein GI | 108804933 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.711307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTCCCT CTCGCCGGAA GAGGCGCGCC GCCCTCCTGC TCGGGGCGCT CCTGGTCCTC TCCGCTTTGC TTTCCTGCGC CTGCGGGGAG CGGCGCCGGG GGCAGGGGCG GGCGGTCGGC GGGCCGAACA TCGTGCTTGT CGTGGCCGAC GACCTCGACG TCCGGACCGC GGAGCGCCTG CCGCGCCTGC GCCGGCTCCT CGCCGACCGC GGGACGAGCT TCGAGAACGC CTTCGTGACG GACGCCCTGT GCTGCCCCTC GCGGGCGACG ATCCTGCGCG GCCAGTACGC CCACAACCAC GGGATCCGGG GCAACGAGCC CCCGCGCGGC GGCTTCGAGA GGTTCCGGCG GCTCGAGGGC TCGACGGTGG CCACCTGGCT GAAGGCCGCC GGTTACCGGA CGGCGTACTT CGGCAAGTAC ATGAACGGCT ACGGCAGGAG CGAGACCCGC GTGCCCCCGG GGTGGGACGA GTGGCACGCG GTGGCCGGGA ACTACCTGAG CAGCTGGTAC AACGATAACG GCCGCGTCCG CTACTACAGC CCCGCCCTCT ACAACGACAC CGACCTCATC GCCGAAAAGG CCACCTCTTA CCTGAGGAGG ACCGCCGGGA GGGGGGCGCC GTTCTTCGTG GTGCTGGCGC CGCGGGCCCC GCACCAGCCC GCCGTCCCGC CGCCCCGCTA CGCGGACGCC TTCCCGGAGG CCCCCCTCCC CCGCGGCCCC TCCTTCGACG AGCGGGACGT CTCGGACAAG CCCCGCTGGG TGCGGGACAA CCCCCGCCTG GGCCGAAAGA AGCTGGAGTT TCTGGGGTCG CTCTACCGGC GGCGGCTGCG CTCGATGCTC GCGGTGGAGG ATCTGGTGGA GCGCCTGCTG CGCACCCTCC GCGAGAGCGG GCAGCTCGAG AACACCTACA TCTTCTTCAC CTCGGACAAC GGCTTCCACA TGGGCCACCA CCGGCTGCCG GAGGGGAAGT GGACCGCCTA CGAGGAGGAC ATCAGGGTTC CGCTCCTGGT GCGGGGGCCT GGGGTGCCCG AGGGACGGGT GCTCCCGCAC CTGGTGCTGA ACAACGACCT TGCGCCGACC TTTGGCCGGC TTGGGGGGGC GAGGGTTCCG GGGTATGTGG ACGGGCGCTC TCTTGTTTTG CTGCTGCGGC GGGACCCTCC CTCCCGGCAT AGCTGGCGCT CGGCCTTTCT TGTGGAGGCG GCCTCGCACG GGGAGTCGGG GAGGCCGGGG CTCGTGGCGG TGAGGACGCG CGGGCACCTG TACGTGGAGT ACGAGAGCGG GGAGAGGGAG CTCTACGACC TGCGCCGCGA CCCCCACCAG CTCCGGAACC TCTACCGGCG CGCCCCCCGG GGGCTCGTGC GGGACCTGAA GGGGCGGCTC GAGGCGCTCG CGGACTGCTC GGGGGAGGGA TGCCGGGCGG CCGAGGACGG CCCGGGACGG GACGGGGGGC GCTAA
|
Protein sequence | MAPSRRKRRA ALLLGALLVL SALLSCACGE RRRGQGRAVG GPNIVLVVAD DLDVRTAERL PRLRRLLADR GTSFENAFVT DALCCPSRAT ILRGQYAHNH GIRGNEPPRG GFERFRRLEG STVATWLKAA GYRTAYFGKY MNGYGRSETR VPPGWDEWHA VAGNYLSSWY NDNGRVRYYS PALYNDTDLI AEKATSYLRR TAGRGAPFFV VLAPRAPHQP AVPPPRYADA FPEAPLPRGP SFDERDVSDK PRWVRDNPRL GRKKLEFLGS LYRRRLRSML AVEDLVERLL RTLRESGQLE NTYIFFTSDN GFHMGHHRLP EGKWTAYEED IRVPLLVRGP GVPEGRVLPH LVLNNDLAPT FGRLGGARVP GYVDGRSLVL LLRRDPPSRH SWRSAFLVEA ASHGESGRPG LVAVRTRGHL YVEYESGERE LYDLRRDPHQ LRNLYRRAPR GLVRDLKGRL EALADCSGEG CRAAEDGPGR DGGR
|
| |