Gene RoseRS_3340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3340 
Symbol 
ID5210317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4188887 
End bp4190509 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content61% 
IMG OID640596938 
Productsulfatase 
Protein accessionYP_001277651 
Protein GI148657446 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.171629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCACG TTCTTCCATA TGCAGGCGCT CGACGTGGTA TACTAAAGCA ACCCGAACCA 
GCAGTTCCTG GAGGAGGAGC ATTTGTGAGT CGGCGCCCCG ATATTGTGTT GCTCGTGCTG
GATACCCAGC GTTGCGATAG ACTTTCGTGC TACGGCTATT CTCGACCAAC CTCGCCCTGC
CTCGATGAGC TTGCGGCTGA AGCGACCCTT TTCCGTCGCG TCTTTGCCAC TGCGCAGTGG
ACGATCCCAT CACACGCTTC GATGTTTACC GGTCTCTATC CATCGGAGCA TGCGACCAAC
CAATCGTCGG CGGCGCTCCC CTCCGGCATT CCGACGCTGG CGGAACGCCT GCGTGAAGGC
GGATATATGA CGGCGGCGTT CTGCAACAAC CCGCTGGTGG GCGTCGTCAA CAACGGGTTG
CGGCGCGGTT TTGAGAGTTT TCTGAATTAC AGCGGTCTGC TGACCTCGCG CCCGAATCAG
GCAGGCGCGC ATCCGGGACT GATCAGTCGT TATCGTCAGT GGTTCAAGGG TCGTCTGGCG
GCAACGCTCA ATCGCATTCA GAACTCATTC GCGCGTTCTG AATTCATGCT GGAATTCGCG
TTTACGCCAT TGATGGTGCC AATCTGGCAG ACGGCGCTCA GTTTCAAGGG GAATACGCCC
AAATCGCTCA GCGATGCTGC GCGTTTGCTG ATCGAACGGC GCGGCGTTGA GCCGAATCAG
CCCATTTTTG CCTTCATCAA CCTGATGGGC GTTCATACGC CGTACCATCC GGATCGGCGG
ATGCTCGAAC GGTTCGCGCC GGACGTGATC CGTGACCGGG AAGCAGCGCG CTATGTGCGT
CGCTTCAACG GCGATGTGTT CGGCTGGCTT GCGCCATTCT CCAGTATTGA CGAACGGTAT
CACCACGTGC TCAGCGATGT GTACGACGCC GAGGTCGCCA CGCAGGATGC GCATCTTGGC
GTGTTCCTGC GCCGGATGCG CGAGAGCGGC GCGCTCGACC GCACCCTGCT GCTGGTATGC
GCCGACCATG GTGATCACCT GGGCGAAAAA GGTCTCGTCG GGCACACGGT ATCGGTCTAC
AACGAACTGA TCCATGTGCC GCTGATGGTG CGCGATCCGG ATGGTGACTT TCCACGAGGT
GCGGTGGTCG ATCATCCGGT GTCGTTGCGA CGGGTTTTCC ACACCCTGCT GAGCGCCGCC
AGGCTCGCCA GCGGCGTCGA GCGTGATCGC TCGCTGGCAC AATCGCCAGC TGCCGATCCC
GATGGCGGCA CGGTCTTCAG TGAGGCAGAA CCGCTGCAAA ACGTTCTGGG GATCATGCTG
CGACGCCAGC CCGATCGTGC GCGTGCGCGC CGCTTCGATC AACCGCGCCG CGCGGTGATC
AACGGTTCGC ACAAACTGAT CCAGACCGGG GAAGACCAGG TTGAGTTGTA CGATCTCGAT
GCTGATCCCC GCGAAACAGT TGACCTGGCG GCAATGTTGC CGGAACGAGT CGAAGAACTT
CAGGCGCGTC TCAGCGCCTT TGTGCGCCGC GCCGATGCAA CAGCGCCGCT CATCCGACGC
GCTGAGGGCG TGGATGATCC GACCGTGCAG CGGCGTTTGC GAGAACTGGG GTATCTTGAA
TAA
 
Protein sequence
MLHVLPYAGA RRGILKQPEP AVPGGGAFVS RRPDIVLLVL DTQRCDRLSC YGYSRPTSPC 
LDELAAEATL FRRVFATAQW TIPSHASMFT GLYPSEHATN QSSAALPSGI PTLAERLREG
GYMTAAFCNN PLVGVVNNGL RRGFESFLNY SGLLTSRPNQ AGAHPGLISR YRQWFKGRLA
ATLNRIQNSF ARSEFMLEFA FTPLMVPIWQ TALSFKGNTP KSLSDAARLL IERRGVEPNQ
PIFAFINLMG VHTPYHPDRR MLERFAPDVI RDREAARYVR RFNGDVFGWL APFSSIDERY
HHVLSDVYDA EVATQDAHLG VFLRRMRESG ALDRTLLLVC ADHGDHLGEK GLVGHTVSVY
NELIHVPLMV RDPDGDFPRG AVVDHPVSLR RVFHTLLSAA RLASGVERDR SLAQSPAADP
DGGTVFSEAE PLQNVLGIML RRQPDRARAR RFDQPRRAVI NGSHKLIQTG EDQVELYDLD
ADPRETVDLA AMLPERVEEL QARLSAFVRR ADATAPLIRR AEGVDDPTVQ RRLRELGYLE