Gene Rsph17029_3812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3812 
Symbol 
ID4899133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp944619 
End bp946268 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content69% 
IMG OID640114416 
Productsulfatase 
Protein accessionYP_001045664 
Protein GI126464551 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0194754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.186688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCCAT TTCCGCTCAG CGTCGGACGG GCTCCGGCAA CGGCTCCCGC ACGGGGTCGC 
GCGCTTCCCT CCGTCTTCCA GATCAATCTT GCGGTGACGA GCTTCCTGCT GATCGCGGAC
AATGCGACCT TCTGGGCCCG TGCGCTCGGG ATCTTCGGGC CGGGGGCGGA GCTTCTGCTG
TTCGGGACGG CGATCTGGGC ACTCACCTTC TTCATCGTCG CCCTGTTCGG AACGGGGCCG
CTGCGCCGGC CGATGCTGGC ACTGCTGCTG CTGCTGGGCG CCGCGACCGG TTACTTCCAG
GATCGGCTGG GCGTGACGAT GGACCGGGAC ATGATCGAGA ACGTGATGAC GACGACGCTC
TCCGAGGGGC GCCATCTCGT CACGCCGGGC TTCCTGATCC ATCTGGCGAT CTTCGGCCTC
TTGCCCGCGC TGGTCGTGCT GCGCCTGCCC GTGCGCGAGG GCGGCTGGCG GGCGGCGGCG
CGCAGCGGGC TCTCGGCGGT CATGGCGCTC GCCCTCGGGG CGGGTCTCGT GATGGCCGAC
TTCAAGACCC TCTCGGCGGT CCTGCGCGAG CACAAGGAGC TGGTGTCGGC CTGGCAGCCC
GCGATGCCGC TTGGCTCGGC GCTCCGCTAT GCCAAGCTGC GCGTGCGCAC CCATGATCTG
ACGGTGGCCG CTCTCGGCAC CGATGCGCAG AAGGGGCCGC TCCTCGCGGC GGCGCCGAAG
CCGGTTCTGA CCGTCCTCGT GGTGGGCGAG ACGGCGCGGG CGCAGAACTG GGGGCTGAAC
GGCTACGAGC GCGACACGAC GCCCGAGCTG CGCGCCCGCG GCGTGGTGAA TTTTTCCGAC
GTCGAGAGCT GCGGCACGGC GACGGCCGTG TCGATGCCCT GCATGTTCTC GAACCTCACG
CGCAAGAGCT ACAGCCACGA GAAGGGCCTC GCGCAGGAGA ACCTGCTCGA CGTGCTGGCC
CATGCGGGCG TGGCGGTGGA GTGGTGGGAC AACAACACGG GCGACAAGGA CATCGCGGCG
CGCCTGCCGT CGGCGCGGGT GCCCGAGACG GTGGACGCCT GCGGCGAGGG CGAATGCACC
GATGCGGCCT TCCTGCCGCT CCTCGACCGG ACGCTCGCCG GCATGAAGGA CGATACGGTG
TTGGTGCTGC ACCAGATCGG CAGCCACGGG CCCGCCTATC ACCTGCGCTA TCCCAAGGCC
TTCGAGCGGT TCTCGCCCGC CTGCCAGAGC GCGGAATTCT CGCGCTGCAC GGACGAGGAG
ATCCGCAATG CCTATGACAA CAGCCTGGCC TTCACCGATC ATATCCTCGC CGCGATGATC
GACCGGCTGG CCGCGCAGGA CCGCGTGATC CCGGCGCTGG TCTATGTCTC GGACCACGGC
GAGTCGCTGG GCGAGAACGG GCTCTATCTG CATGGCGCGC CGCGCTTCAT GGCGCCCGAC
ACGCAGACCC ATGTGCCGAT GGTGATGTGG CTCTCCGAGG CGTTCCGGTC GGCCATGCAC
CTCGATGTGG GCTGCCTGCA GGCACAGGCG GCCGAGCCGG CGAGCCATGA CAACCTGTTC
CATTCGGTGC TCGGGCTGAT GGACATCCGC ACCGAGGTGC GCGACACGAG CCTCGACCGT
GTCTCGTCCT GCCGCGCTTC CGCTTCCTGA
 
Protein sequence
MFPFPLSVGR APATAPARGR ALPSVFQINL AVTSFLLIAD NATFWARALG IFGPGAELLL 
FGTAIWALTF FIVALFGTGP LRRPMLALLL LLGAATGYFQ DRLGVTMDRD MIENVMTTTL
SEGRHLVTPG FLIHLAIFGL LPALVVLRLP VREGGWRAAA RSGLSAVMAL ALGAGLVMAD
FKTLSAVLRE HKELVSAWQP AMPLGSALRY AKLRVRTHDL TVAALGTDAQ KGPLLAAAPK
PVLTVLVVGE TARAQNWGLN GYERDTTPEL RARGVVNFSD VESCGTATAV SMPCMFSNLT
RKSYSHEKGL AQENLLDVLA HAGVAVEWWD NNTGDKDIAA RLPSARVPET VDACGEGECT
DAAFLPLLDR TLAGMKDDTV LVLHQIGSHG PAYHLRYPKA FERFSPACQS AEFSRCTDEE
IRNAYDNSLA FTDHILAAMI DRLAAQDRVI PALVYVSDHG ESLGENGLYL HGAPRFMAPD
TQTHVPMVMW LSEAFRSAMH LDVGCLQAQA AEPASHDNLF HSVLGLMDIR TEVRDTSLDR
VSSCRASAS