Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_0496 |
Symbol | |
ID | 5082855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 490675 |
End bp | 492180 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640482050 |
Product | sulfatase |
Protein accession | YP_001166707 |
Protein GI | 146276548 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.261478 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.387641 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAGC CCAACATCCT CATCCTGATG GTCGATCAGC TCAACGGGAC GCTCTTCCCC GACGGGCCGG CCGACTGGCT GCACGCGCCG AACCTGAAGC GGCTGGCCGA GCGATCGACG CGGTTCGCGA ACAGCTACAC GGCCAGCCCG CTCTGCGCGC CCGCCCGCGC CTCCTTCATG TCCGGCCAGC TGCCCTCGCG GACCCGCGTC TATGACAATG CGGCCGAATT CGCCTCGGAC ATCCCGACCT TTGCCCACCA TCTGCGGCGC GCGGGCTACC AGACGACCCT TTCGGGCAAG ATGCATTTCG TGGGCCCCGA CCAGCTCCAC GGGTTCGAGG AACGGTTGAC CACCGACATC TACCCGGCCG ATTTCGGCTG GACGCCGGAC TATCGCAAGC CGGGCGAGCG GATCGACTGG TGGTATCACA ACCTGGGCTC CGTGACCGGC GCCGGTGTGG CCGAGATCAC CAACCAGCTC GAATATGACG ATGACGTGGC GCATCAGGCG ATCCAGAAGC TCTACGACCT GTCGCGCGGC GCCGATCCGC GGCCCTGGTG CCTGACGGTC AGCTTCACCC ATCCCCACGA TCCGTTCGTG GCGCGGCGGA AATACTGGGA TCTCTACGAG GATCATCCGA TGCTCGAGCC GCCCGAGCCG ATCCCCTACG CCCAGCAGGA CAGCCACTCG CAGCGCCTGA TGGACGCCTG CGATTTCGGC GCGTTCGAGA TCACGCGCGA CCATGTGCGC CGGGCGCGGC AGGGCTATTT CGCCAACATC TCCTACATCG ACGACAAGAT CGGCGAGATC CTCGCGGTGC TCGACGCCTC GCGGCAGGAG GCGATCGTGG TCTTCGTCTC GGACCACGGA GAGATGCTGG GCGAGCGCGG CCTGTGGTTC AAGATGAGCT TCCACGAGGG CTCGGCCCGG GTGCCGCTGA TGATGGCGGC TCCGGGTCTG CCGGCGGGCC GGATCGACGC GCCGGTTTCG ACCATCGACG TGACGCCCAC GCTCTGCGCG CTCGCGGGGA TCGACATGGG CGAGATCGCG CCCTGGACCG ACGGGGTCAG TCTCGTGCCG CTGGCGCAGG GCTCCGGGCG TCCCGAGCCG GTGCTGATGG AATATGCCGC CGAAGGATCG GTGACGCCTC TCGTCGCGAT CCGGGACGGG CGGTGGAAGC ATGTCCGGTG CCTCGCCGAT CCCGAGCAGC TGTTCGACCT TGAGGCCGAC CCTGCCGAAC GGACGAACCT TGCGTCCGAT CCCGCCCATG CCGGGACGCT GGCCCGGCTC CGGGCGCTGT CCGAGGCGCG GTGGGATCTC GCCGCCTTCG ACGCGGCGGT GCGGGAGAGT CAGGCCCGGC GGTGGGTGGT CTATGAGGCT CTGCGGCAGG GCGGCTACTA TCCGTGGGAC TATCAGCCGC TGCAGAAGGC CTCCGAGCGT TACATGCGCA ACCACATGGA TCTGAACATC CTGGAGGAGA GCAAGCGCTT CCCCCGCGGC GAGTGA
|
Protein sequence | MTKPNILILM VDQLNGTLFP DGPADWLHAP NLKRLAERST RFANSYTASP LCAPARASFM SGQLPSRTRV YDNAAEFASD IPTFAHHLRR AGYQTTLSGK MHFVGPDQLH GFEERLTTDI YPADFGWTPD YRKPGERIDW WYHNLGSVTG AGVAEITNQL EYDDDVAHQA IQKLYDLSRG ADPRPWCLTV SFTHPHDPFV ARRKYWDLYE DHPMLEPPEP IPYAQQDSHS QRLMDACDFG AFEITRDHVR RARQGYFANI SYIDDKIGEI LAVLDASRQE AIVVFVSDHG EMLGERGLWF KMSFHEGSAR VPLMMAAPGL PAGRIDAPVS TIDVTPTLCA LAGIDMGEIA PWTDGVSLVP LAQGSGRPEP VLMEYAAEGS VTPLVAIRDG RWKHVRCLAD PEQLFDLEAD PAERTNLASD PAHAGTLARL RALSEARWDL AAFDAAVRES QARRWVVYEA LRQGGYYPWD YQPLQKASER YMRNHMDLNI LEESKRFPRG E
|
| |