Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2247 |
Symbol | |
ID | 4897353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 2379336 |
End bp | 2380841 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640112841 |
Product | sulfatase |
Protein accession | YP_001044122 |
Protein GI | 126463008 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.466267 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGC CCAACATCCT CATCCTGATG GTCGATCAGC TGAACGGAAC GCTGTTTCCC GACGGGCCGG CCGACTGGCT GCACGCGCCG AACCTGAAGC GGCTCGCGGA ACGCTCTCTC CGGTTTTCGA ACAGCTATAC GGCCAGCCCC CTCTGCGCCC CGGCGCGGGC CTCCTTCATG TCGGGTCAGC TGCCCTCGCG CACCCGGGTC TATGACAATG CGGCCGAGTT CGCCTCGAAC ATTCCCACCT TCGCACATCA TCTGCGGCGT GCCGGCTATC AGACGACGCT CTCGGGCAAG ATGCATTTCG TGGGACCCGA CCAGCTGCAC GGGTTCGAGG AGCGGCTGAC GACCGACATC TATCCCGCCG ACTTCGGCTG GACGCCGGAC TATCGCAAGC CGGGCGAGCG GATCGACTGG TGGTATCACA ACCTGGGCTC CGTCACCGGC GCGGGCGTGG CCGAGATCAC CAACCAGCTC GAATATGACG ACGACGTGGC GCATCAGGCG ATCCAGAAGC TCTACGACCT GTCGCGCGGC GCCGATCCGC GGCCCTGGTG CCTGACGGTC AGCTTCACCC ATCCCCACGA TCCCTTTGTG GCGCGCCGGA AATACTGGGA CCTCTACGAG GACCATCCGA TGCTCGAGCC GCCCGCCTCC ATTCCCTACG AAGCTCAGGA CAGCCACTCG CGCCGCCTGA TGGATGCCTG CGATTTCAAG GCGTTCGACA TCACGCCCGA GCAGGTGCGC CGCGCGCGGC AGGGCTACTT CGCCAATATC TCCTATGTGG ACGACAAGAT CGGCGAGATC CTCGCCGTGC TCGAAGCCTC GCGTCAGGAG GCCATCGTCG TCTTCGTCTC GGATCACGGC GAGATGCTGG GCGACCGCGG TCTCTGGTTC AAGATGAGCT TCTTCGAGGG CTCGGCCCGG GTGCCGCTGA TGATCGCGGC TCCCGGCCTG CCCGCCGGGC GGATCGCGGC GCCGGTCTCG ACCATCGACG TGACGCCCAC GCTCTGCGCG CTGGCCGGCA TCGACATGGG CGAGATCGCG CCCTGGACCG ATGGCGTCAG TCTGGTCCCG CTGGCCGAAG GCGAACTGCG TCCCGAGCCC GTGTTCATGG AATATGCGGC CGAGGGTTCG ATCACGCCGC TGGTGGCGAT CCGCGAGGGG CGCTGGAAAT ATGTCCGCTG CCTCGCAGAT CCCGAACAGC TGTTCGATAT CGAGGCGGAT CCCGGCGAGC GGACGGATCT GGCGGCCGAT CCGGCCCATG CCGACACGCT GGCGCGGCTG CGGTCCCTCT CGCAGGCGCG CTGGGATCTC GCGGCCTTCG ACGCCGCCGT GCGCGAAAGC CAGGCCCGGC GCTGGATCGT CTACGAGGCG CTGCGGCAGG GCGGCTATTA CCCCTGGGAT TACCAGCCGC TGCAGAAGGC TTCCGAGCGC TACATGCGCA ACCACATGGA TCTGAATATT CTCGAAGAGA GCAAGCGCTT CCCGCGCGGC GAATGA
|
Protein sequence | MTQPNILILM VDQLNGTLFP DGPADWLHAP NLKRLAERSL RFSNSYTASP LCAPARASFM SGQLPSRTRV YDNAAEFASN IPTFAHHLRR AGYQTTLSGK MHFVGPDQLH GFEERLTTDI YPADFGWTPD YRKPGERIDW WYHNLGSVTG AGVAEITNQL EYDDDVAHQA IQKLYDLSRG ADPRPWCLTV SFTHPHDPFV ARRKYWDLYE DHPMLEPPAS IPYEAQDSHS RRLMDACDFK AFDITPEQVR RARQGYFANI SYVDDKIGEI LAVLEASRQE AIVVFVSDHG EMLGDRGLWF KMSFFEGSAR VPLMIAAPGL PAGRIAAPVS TIDVTPTLCA LAGIDMGEIA PWTDGVSLVP LAEGELRPEP VFMEYAAEGS ITPLVAIREG RWKYVRCLAD PEQLFDIEAD PGERTDLAAD PAHADTLARL RSLSQARWDL AAFDAAVRES QARRWIVYEA LRQGGYYPWD YQPLQKASER YMRNHMDLNI LEESKRFPRG E
|
| |