Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_0594 |
Symbol | |
ID | 3718008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 2334500 |
End bp | 2336005 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640071805 |
Product | putative choline sulfatase |
Protein accession | YP_353669 |
Protein GI | 77464165 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAGC CCAACATCCT CATCCTGATG GTCGATCAGC TGAACGGAAC GCTGTTTCCC GACGGGCCGG CCGACTGGCT GCACGCGCCG AACCTGAAGC GGCTCGCGGA CCGCTCTCTC CGGTTTTCGA ACAGCTATAC GGCCAGCCCC CTCTGCGCCC CGGCGCGGGC CTCCTTCATG TCGGGTCAGC TGCCCTCGCG CACCCGGGTC TATGACAATG CGGCCGAGTT CGCCTCGGAC ATTCCCACCT TCGCACATCA TCTGCGGCGC GCCGGCTATC AGACCACGCT CTCGGGCAAG ATGCATTTCG TGGGGCCCGA CCAGCTGCAC GGGTTCGAGG CGCGGCTGAC GACCGACATC TATCCCGCCG ACTTCGGCTG GACGCCGGAC TATCGCAAGC CGGGCGAGCG GATCGACTGG TGGTATCACA ACTTGGGCTC CGTCACCGGC GCGGGCGTGG CCGAGATCAC CAACCAGCTC GAATATGACG ACGACGTGGC GCATCAGGCG ATCCAGAAGC TCTACGACCT GTCGCGCGGC GCCGATCCGC GGCCCTGGTG CCTCACGGTC AGCTTCACCC ATCCGCACGA TCCCTTCGTG GCGCGCCGGA AATACTGGGA CCTCTACGAG GACCATCCAA TGCTCGAGCC GCCCGCCTCC ATTCCCTACG AAGCCCAGGA CAGCCACTCG CGCCGCCTGA TGGATGCCTG CGATTTCAAG GCGTTCGACA TCACGCCCGA GCAGGTGCGC CGCGCGCGGC AGGGCTACTT CGCCAATATC TCCTATGTGG ACGACAAGAT CGGCGAGATC CTCGCCGTAC TCGAGGCCTC GCGTCAGGAG GCCATCGTCG TCTTCGTCTC GGATCACGGC GAGATGCTGG GCGACCGCGG CCTCTGGTTC AAGATGAGCT TCTTCGAGGG CTCGGCCAGG GTGCCGCTGA TGATCGCGGC TCCCGGCCTG CCCGCCGGGC GGATCGCGGC GCCGGTTTCG ACCATCGACG TGACGCCCAC GCTCTGCGCG CTGGCCGGCA TCGACATGGG CGAGATCGCG CCCTGGACCG ATGGCATCAG TCTGGTCCCG CTGGCCGAAG GCGAACTGCG TCCCGAGCCC GTGTTCATGG AATATGCGGC CGAGGGTTCG ATCACGCCGC TGGTGGCGAT CCGCGAGGGG CGCTGGAAAT ATGTCCGCTG TCTCGCAGAT CCCGAACAGC TGTTCGATAT CGAGGCGGAT CCCGGCGAGC GGACGGATCT GGCGGCCGAT CCGGCCCATG CCGACACGCT GGCGCGGCTG CGGTCCCTCT CGCAGGCGCG CTGGGATCTC GCGGCCTTCG ACGCCGCCGT GCGCGAAAGC CAGGCCCGGC GCTGGATCGT CTACGAGGCG CTGCGGCAGG GCGGCTATTA CCCGTGGGAT TACCAGCCGC TGCAGAAGGC TTCCGAGCGC TACATGCGCA ACCACATGGA TCTGAATATT CTCGAAGAGA GCAAGCGCTT CCCGCGGGGC GAATGA
|
Protein sequence | MTQPNILILM VDQLNGTLFP DGPADWLHAP NLKRLADRSL RFSNSYTASP LCAPARASFM SGQLPSRTRV YDNAAEFASD IPTFAHHLRR AGYQTTLSGK MHFVGPDQLH GFEARLTTDI YPADFGWTPD YRKPGERIDW WYHNLGSVTG AGVAEITNQL EYDDDVAHQA IQKLYDLSRG ADPRPWCLTV SFTHPHDPFV ARRKYWDLYE DHPMLEPPAS IPYEAQDSHS RRLMDACDFK AFDITPEQVR RARQGYFANI SYVDDKIGEI LAVLEASRQE AIVVFVSDHG EMLGDRGLWF KMSFFEGSAR VPLMIAAPGL PAGRIAAPVS TIDVTPTLCA LAGIDMGEIA PWTDGISLVP LAEGELRPEP VFMEYAAEGS ITPLVAIREG RWKYVRCLAD PEQLFDIEAD PGERTDLAAD PAHADTLARL RSLSQARWDL AAFDAAVRES QARRWIVYEA LRQGGYYPWD YQPLQKASER YMRNHMDLNI LEESKRFPRG E
|
| |