Gene Rsph17029_2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2247 
Symbol 
ID4897353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2379336 
End bp2380841 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content66% 
IMG OID640112841 
Productsulfatase 
Protein accessionYP_001044122 
Protein GI126463008 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.466267 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGC CCAACATCCT CATCCTGATG GTCGATCAGC TGAACGGAAC GCTGTTTCCC 
GACGGGCCGG CCGACTGGCT GCACGCGCCG AACCTGAAGC GGCTCGCGGA ACGCTCTCTC
CGGTTTTCGA ACAGCTATAC GGCCAGCCCC CTCTGCGCCC CGGCGCGGGC CTCCTTCATG
TCGGGTCAGC TGCCCTCGCG CACCCGGGTC TATGACAATG CGGCCGAGTT CGCCTCGAAC
ATTCCCACCT TCGCACATCA TCTGCGGCGT GCCGGCTATC AGACGACGCT CTCGGGCAAG
ATGCATTTCG TGGGACCCGA CCAGCTGCAC GGGTTCGAGG AGCGGCTGAC GACCGACATC
TATCCCGCCG ACTTCGGCTG GACGCCGGAC TATCGCAAGC CGGGCGAGCG GATCGACTGG
TGGTATCACA ACCTGGGCTC CGTCACCGGC GCGGGCGTGG CCGAGATCAC CAACCAGCTC
GAATATGACG ACGACGTGGC GCATCAGGCG ATCCAGAAGC TCTACGACCT GTCGCGCGGC
GCCGATCCGC GGCCCTGGTG CCTGACGGTC AGCTTCACCC ATCCCCACGA TCCCTTTGTG
GCGCGCCGGA AATACTGGGA CCTCTACGAG GACCATCCGA TGCTCGAGCC GCCCGCCTCC
ATTCCCTACG AAGCTCAGGA CAGCCACTCG CGCCGCCTGA TGGATGCCTG CGATTTCAAG
GCGTTCGACA TCACGCCCGA GCAGGTGCGC CGCGCGCGGC AGGGCTACTT CGCCAATATC
TCCTATGTGG ACGACAAGAT CGGCGAGATC CTCGCCGTGC TCGAAGCCTC GCGTCAGGAG
GCCATCGTCG TCTTCGTCTC GGATCACGGC GAGATGCTGG GCGACCGCGG TCTCTGGTTC
AAGATGAGCT TCTTCGAGGG CTCGGCCCGG GTGCCGCTGA TGATCGCGGC TCCCGGCCTG
CCCGCCGGGC GGATCGCGGC GCCGGTCTCG ACCATCGACG TGACGCCCAC GCTCTGCGCG
CTGGCCGGCA TCGACATGGG CGAGATCGCG CCCTGGACCG ATGGCGTCAG TCTGGTCCCG
CTGGCCGAAG GCGAACTGCG TCCCGAGCCC GTGTTCATGG AATATGCGGC CGAGGGTTCG
ATCACGCCGC TGGTGGCGAT CCGCGAGGGG CGCTGGAAAT ATGTCCGCTG CCTCGCAGAT
CCCGAACAGC TGTTCGATAT CGAGGCGGAT CCCGGCGAGC GGACGGATCT GGCGGCCGAT
CCGGCCCATG CCGACACGCT GGCGCGGCTG CGGTCCCTCT CGCAGGCGCG CTGGGATCTC
GCGGCCTTCG ACGCCGCCGT GCGCGAAAGC CAGGCCCGGC GCTGGATCGT CTACGAGGCG
CTGCGGCAGG GCGGCTATTA CCCCTGGGAT TACCAGCCGC TGCAGAAGGC TTCCGAGCGC
TACATGCGCA ACCACATGGA TCTGAATATT CTCGAAGAGA GCAAGCGCTT CCCGCGCGGC
GAATGA
 
Protein sequence
MTQPNILILM VDQLNGTLFP DGPADWLHAP NLKRLAERSL RFSNSYTASP LCAPARASFM 
SGQLPSRTRV YDNAAEFASN IPTFAHHLRR AGYQTTLSGK MHFVGPDQLH GFEERLTTDI
YPADFGWTPD YRKPGERIDW WYHNLGSVTG AGVAEITNQL EYDDDVAHQA IQKLYDLSRG
ADPRPWCLTV SFTHPHDPFV ARRKYWDLYE DHPMLEPPAS IPYEAQDSHS RRLMDACDFK
AFDITPEQVR RARQGYFANI SYVDDKIGEI LAVLEASRQE AIVVFVSDHG EMLGDRGLWF
KMSFFEGSAR VPLMIAAPGL PAGRIAAPVS TIDVTPTLCA LAGIDMGEIA PWTDGVSLVP
LAEGELRPEP VFMEYAAEGS ITPLVAIREG RWKYVRCLAD PEQLFDIEAD PGERTDLAAD
PAHADTLARL RSLSQARWDL AAFDAAVRES QARRWIVYEA LRQGGYYPWD YQPLQKASER
YMRNHMDLNI LEESKRFPRG E