Gene RSP_0594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_0594 
Symbol 
ID3718008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp2334500 
End bp2336005 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content66% 
IMG OID640071805 
Productputative choline sulfatase 
Protein accessionYP_353669 
Protein GI77464165 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAGC CCAACATCCT CATCCTGATG GTCGATCAGC TGAACGGAAC GCTGTTTCCC 
GACGGGCCGG CCGACTGGCT GCACGCGCCG AACCTGAAGC GGCTCGCGGA CCGCTCTCTC
CGGTTTTCGA ACAGCTATAC GGCCAGCCCC CTCTGCGCCC CGGCGCGGGC CTCCTTCATG
TCGGGTCAGC TGCCCTCGCG CACCCGGGTC TATGACAATG CGGCCGAGTT CGCCTCGGAC
ATTCCCACCT TCGCACATCA TCTGCGGCGC GCCGGCTATC AGACCACGCT CTCGGGCAAG
ATGCATTTCG TGGGGCCCGA CCAGCTGCAC GGGTTCGAGG CGCGGCTGAC GACCGACATC
TATCCCGCCG ACTTCGGCTG GACGCCGGAC TATCGCAAGC CGGGCGAGCG GATCGACTGG
TGGTATCACA ACTTGGGCTC CGTCACCGGC GCGGGCGTGG CCGAGATCAC CAACCAGCTC
GAATATGACG ACGACGTGGC GCATCAGGCG ATCCAGAAGC TCTACGACCT GTCGCGCGGC
GCCGATCCGC GGCCCTGGTG CCTCACGGTC AGCTTCACCC ATCCGCACGA TCCCTTCGTG
GCGCGCCGGA AATACTGGGA CCTCTACGAG GACCATCCAA TGCTCGAGCC GCCCGCCTCC
ATTCCCTACG AAGCCCAGGA CAGCCACTCG CGCCGCCTGA TGGATGCCTG CGATTTCAAG
GCGTTCGACA TCACGCCCGA GCAGGTGCGC CGCGCGCGGC AGGGCTACTT CGCCAATATC
TCCTATGTGG ACGACAAGAT CGGCGAGATC CTCGCCGTAC TCGAGGCCTC GCGTCAGGAG
GCCATCGTCG TCTTCGTCTC GGATCACGGC GAGATGCTGG GCGACCGCGG CCTCTGGTTC
AAGATGAGCT TCTTCGAGGG CTCGGCCAGG GTGCCGCTGA TGATCGCGGC TCCCGGCCTG
CCCGCCGGGC GGATCGCGGC GCCGGTTTCG ACCATCGACG TGACGCCCAC GCTCTGCGCG
CTGGCCGGCA TCGACATGGG CGAGATCGCG CCCTGGACCG ATGGCATCAG TCTGGTCCCG
CTGGCCGAAG GCGAACTGCG TCCCGAGCCC GTGTTCATGG AATATGCGGC CGAGGGTTCG
ATCACGCCGC TGGTGGCGAT CCGCGAGGGG CGCTGGAAAT ATGTCCGCTG TCTCGCAGAT
CCCGAACAGC TGTTCGATAT CGAGGCGGAT CCCGGCGAGC GGACGGATCT GGCGGCCGAT
CCGGCCCATG CCGACACGCT GGCGCGGCTG CGGTCCCTCT CGCAGGCGCG CTGGGATCTC
GCGGCCTTCG ACGCCGCCGT GCGCGAAAGC CAGGCCCGGC GCTGGATCGT CTACGAGGCG
CTGCGGCAGG GCGGCTATTA CCCGTGGGAT TACCAGCCGC TGCAGAAGGC TTCCGAGCGC
TACATGCGCA ACCACATGGA TCTGAATATT CTCGAAGAGA GCAAGCGCTT CCCGCGGGGC
GAATGA
 
Protein sequence
MTQPNILILM VDQLNGTLFP DGPADWLHAP NLKRLADRSL RFSNSYTASP LCAPARASFM 
SGQLPSRTRV YDNAAEFASD IPTFAHHLRR AGYQTTLSGK MHFVGPDQLH GFEARLTTDI
YPADFGWTPD YRKPGERIDW WYHNLGSVTG AGVAEITNQL EYDDDVAHQA IQKLYDLSRG
ADPRPWCLTV SFTHPHDPFV ARRKYWDLYE DHPMLEPPAS IPYEAQDSHS RRLMDACDFK
AFDITPEQVR RARQGYFANI SYVDDKIGEI LAVLEASRQE AIVVFVSDHG EMLGDRGLWF
KMSFFEGSAR VPLMIAAPGL PAGRIAAPVS TIDVTPTLCA LAGIDMGEIA PWTDGISLVP
LAEGELRPEP VFMEYAAEGS ITPLVAIREG RWKYVRCLAD PEQLFDIEAD PGERTDLAAD
PAHADTLARL RSLSQARWDL AAFDAAVRES QARRWIVYEA LRQGGYYPWD YQPLQKASER
YMRNHMDLNI LEESKRFPRG E