Gene Rsph17025_0496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0496 
Symbol 
ID5082855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp490675 
End bp492180 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content68% 
IMG OID640482050 
Productsulfatase 
Protein accessionYP_001166707 
Protein GI146276548 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.261478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.387641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGC CCAACATCCT CATCCTGATG GTCGATCAGC TCAACGGGAC GCTCTTCCCC 
GACGGGCCGG CCGACTGGCT GCACGCGCCG AACCTGAAGC GGCTGGCCGA GCGATCGACG
CGGTTCGCGA ACAGCTACAC GGCCAGCCCG CTCTGCGCGC CCGCCCGCGC CTCCTTCATG
TCCGGCCAGC TGCCCTCGCG GACCCGCGTC TATGACAATG CGGCCGAATT CGCCTCGGAC
ATCCCGACCT TTGCCCACCA TCTGCGGCGC GCGGGCTACC AGACGACCCT TTCGGGCAAG
ATGCATTTCG TGGGCCCCGA CCAGCTCCAC GGGTTCGAGG AACGGTTGAC CACCGACATC
TACCCGGCCG ATTTCGGCTG GACGCCGGAC TATCGCAAGC CGGGCGAGCG GATCGACTGG
TGGTATCACA ACCTGGGCTC CGTGACCGGC GCCGGTGTGG CCGAGATCAC CAACCAGCTC
GAATATGACG ATGACGTGGC GCATCAGGCG ATCCAGAAGC TCTACGACCT GTCGCGCGGC
GCCGATCCGC GGCCCTGGTG CCTGACGGTC AGCTTCACCC ATCCCCACGA TCCGTTCGTG
GCGCGGCGGA AATACTGGGA TCTCTACGAG GATCATCCGA TGCTCGAGCC GCCCGAGCCG
ATCCCCTACG CCCAGCAGGA CAGCCACTCG CAGCGCCTGA TGGACGCCTG CGATTTCGGC
GCGTTCGAGA TCACGCGCGA CCATGTGCGC CGGGCGCGGC AGGGCTATTT CGCCAACATC
TCCTACATCG ACGACAAGAT CGGCGAGATC CTCGCGGTGC TCGACGCCTC GCGGCAGGAG
GCGATCGTGG TCTTCGTCTC GGACCACGGA GAGATGCTGG GCGAGCGCGG CCTGTGGTTC
AAGATGAGCT TCCACGAGGG CTCGGCCCGG GTGCCGCTGA TGATGGCGGC TCCGGGTCTG
CCGGCGGGCC GGATCGACGC GCCGGTTTCG ACCATCGACG TGACGCCCAC GCTCTGCGCG
CTCGCGGGGA TCGACATGGG CGAGATCGCG CCCTGGACCG ACGGGGTCAG TCTCGTGCCG
CTGGCGCAGG GCTCCGGGCG TCCCGAGCCG GTGCTGATGG AATATGCCGC CGAAGGATCG
GTGACGCCTC TCGTCGCGAT CCGGGACGGG CGGTGGAAGC ATGTCCGGTG CCTCGCCGAT
CCCGAGCAGC TGTTCGACCT TGAGGCCGAC CCTGCCGAAC GGACGAACCT TGCGTCCGAT
CCCGCCCATG CCGGGACGCT GGCCCGGCTC CGGGCGCTGT CCGAGGCGCG GTGGGATCTC
GCCGCCTTCG ACGCGGCGGT GCGGGAGAGT CAGGCCCGGC GGTGGGTGGT CTATGAGGCT
CTGCGGCAGG GCGGCTACTA TCCGTGGGAC TATCAGCCGC TGCAGAAGGC CTCCGAGCGT
TACATGCGCA ACCACATGGA TCTGAACATC CTGGAGGAGA GCAAGCGCTT CCCCCGCGGC
GAGTGA
 
Protein sequence
MTKPNILILM VDQLNGTLFP DGPADWLHAP NLKRLAERST RFANSYTASP LCAPARASFM 
SGQLPSRTRV YDNAAEFASD IPTFAHHLRR AGYQTTLSGK MHFVGPDQLH GFEERLTTDI
YPADFGWTPD YRKPGERIDW WYHNLGSVTG AGVAEITNQL EYDDDVAHQA IQKLYDLSRG
ADPRPWCLTV SFTHPHDPFV ARRKYWDLYE DHPMLEPPEP IPYAQQDSHS QRLMDACDFG
AFEITRDHVR RARQGYFANI SYIDDKIGEI LAVLDASRQE AIVVFVSDHG EMLGERGLWF
KMSFHEGSAR VPLMMAAPGL PAGRIDAPVS TIDVTPTLCA LAGIDMGEIA PWTDGVSLVP
LAQGSGRPEP VLMEYAAEGS VTPLVAIRDG RWKHVRCLAD PEQLFDLEAD PAERTNLASD
PAHAGTLARL RALSEARWDL AAFDAAVRES QARRWVVYEA LRQGGYYPWD YQPLQKASER
YMRNHMDLNI LEESKRFPRG E