Gene Rsph17029_3668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3668 
Symbol 
ID4898658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp768911 
End bp770437 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content75% 
IMG OID640114276 
Productsulfatase 
Protein accessionYP_001045530 
Protein GI126464417 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.902858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0745402 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGCG CGGCGCTGAG CCTCCTCCTC CTTGGCCTTG TGCTGGCGCT GCCCGGCCAC 
CCGTGGGGCG CGTGGCGCTT TCCGCTCGAG CTTCCGGCGC TGGTGGCGCT TCTGGCGCTC
CGGCCGGGAC GCGCGCGGGG GCTGCGGGCG CTGCTGGTTG CGGCTCTCGG GGCGGTCACG
GTGCTGAAGG CGCTCGATCT CGCGCTGCAG CTGGCGTTCG GGCGGCCGTT CGATCCGGTG
GCCGACCTGC CGCTTCTCGC CTCGGCCCTC GATCTGGGCG AGGGGACGCT GGGGGCGGGA
GGGGCACTGC TGGCCGCGGT GCTCATCGGG GCGGTGCTGG CCGGTCTGGC GGCGGGCCTC
TGGTGGGCGA GCGGGGTCTG GGCCGGACGC GGACGCCCGC TTGCTTGCGG GACGGCGGCG
GCGCTGGTCC TTGCGCTGGT GGCGGCGGAT CTCGCTTGGC CCGACCGGGT GCCGGGCGCG
GCCTTCACCA CGCGGCTCGT CTGGGACCAT GCCGTGACGG CCCGGCAGAC GCGCGCCGAT
CTGGCGGCCT TCCGCGCGGC GGCCCGCACC GATCCGTGGG CCGGACGCAC GGATGCCTTC
GCGCGCCTGG GGCCGGCGGA GCTTCGCATC CTCTTCGTCG AAAGCTACGG GCGGGCGAGC
TTCGACAATC CGCTCTATGC CTCCCATGCC GCCCTCCTGC GCGCGGCAGA GAGCGGGATC
GCGGCGCAGG GCCTCGCCAT GCGGTCGGGC TGGCTCGGCT CGCCGGTTGC GGGCGGGCAG
AGCTGGCTTG CCCATGCCAC GCTGGCCTCT GGTCTGCGGA TCGACGGCGC CATCCGCTAC
CGCGCCCTGA TCGCGAGCCC GCGCAAGACC CTGTTCGAGC TGGCGCGAGC CGCGGGGCGG
GAGACGCTGG CCGTCATGCC GGGCATCACC CGCGCCTGGC CCGAGGGCGT GAGGCTCGGC
TTCTCGCACA TTCTGGATGC CGAGGGGCTG GGCTACCGGG GGCGTCCCTT CAACTGGGTC
ACCATGCCCG ACCAGTTCAC CCTGACGGCC TTCGACCGGC TCGGGCCCGA GGCGTCGCGG
GTGGCGCAGA TCGTGCTTCT CTCGAGCCAC GCGCCCTGGG TACCCGAGCC GCGGCTGGTG
CCGTGGGAGG CGGTGGACGA CGGCCGGATC TTCGACGATC AGGCTGCGGC GGGCGATCCG
CCGGAGGTGG TCTGGCGCGA TCCCGACCGG GTGCGGGCCG CCTATCGCCA GTCGCTCGCC
TATGCGCTGC GGACGGCAAC GGCCCATGCG GCGCGGTCGG GCGCGGGTGC CTTGACGCTG
ATCCTCGGGG ATCATCCGCC CGCCCCCTTC GTCTCGGGCA TCGCGGGGCG GGACGTGCCG
GCCCATCTGA TCGGCCCGCC CGAGCTTCTG GCCGCTTTCG ACGGCTGGGG CTGGACCGCG
GGCCTGATCC CGGCGCCGGA CCTGCCCGTC CTGCCGATGG AGGGCTTCCG CGACCGGTTC
CTGACCGCCC TCTCCGGCCC GCCATGA
 
Protein sequence
MARAALSLLL LGLVLALPGH PWGAWRFPLE LPALVALLAL RPGRARGLRA LLVAALGAVT 
VLKALDLALQ LAFGRPFDPV ADLPLLASAL DLGEGTLGAG GALLAAVLIG AVLAGLAAGL
WWASGVWAGR GRPLACGTAA ALVLALVAAD LAWPDRVPGA AFTTRLVWDH AVTARQTRAD
LAAFRAAART DPWAGRTDAF ARLGPAELRI LFVESYGRAS FDNPLYASHA ALLRAAESGI
AAQGLAMRSG WLGSPVAGGQ SWLAHATLAS GLRIDGAIRY RALIASPRKT LFELARAAGR
ETLAVMPGIT RAWPEGVRLG FSHILDAEGL GYRGRPFNWV TMPDQFTLTA FDRLGPEASR
VAQIVLLSSH APWVPEPRLV PWEAVDDGRI FDDQAAAGDP PEVVWRDPDR VRAAYRQSLA
YALRTATAHA ARSGAGALTL ILGDHPPAPF VSGIAGRDVP AHLIGPPELL AAFDGWGWTA
GLIPAPDLPV LPMEGFRDRF LTALSGPP