Gene Sare_3132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3132 
Symbol 
ID5706342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3561001 
End bp3563397 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content62% 
IMG OID641272564 
Productsulfatase 
Protein accessionYP_001537931 
Protein GI159038678 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000329231 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCAAAG CATTCAAGGG TGTCATCGGC CTGGGCGTCA AGGACTCGAC GCCGGACTGG 
GATCCCTACG CGCAACCACA GGCCCCGAAG GGGGCACCGA ACGTGCTGTT CCTTGTGTGG
GACGACACCG GCTTTGGTTC CTGGGACTTC TTCGGCGGGC CCATCGAGAT GCCGAACATG
AGCAAGCTCG CGAACAACGG GCTGAGGTAC ACCCAGTTCC ACACGACGGC GCTGTGTTCA
CCCACCCGGG CCGCCCTGCT GAGCGGACGT AACCACACCA CGGTCGGAAT GTCCTGTGTC
GCGGAGGCGA CCGAGGGGTT CCCCGGTTTG AACGGGCACA TTCCCGGTGA GGCAGCGCTC
ATCGGCGAGA TTCTGAGCGA CCGGGGCTAC AACACCTATG CGCTCGGCAA GTGGCACTGT
GTCGCCGAGG ACGAGACCAA CATGGCCTCC TCCAAGCGCA ACTGGCCCAC CAGTCGCGGC
TTCGAGCGCT TCTACGGTTT CCTTGGTGGT GAGGCGAACC AGTACTACCC GAACCTCGTG
CAGGACCAGC AGTTCATCGA CCAGACCGCC GACCCGGTCA GCATCGATGA GTGGAAGAAG
GGCAAGGACG GTTATCTCCT CACCGCCGAC CTCGTCGATC GCGCTATCGG CATGATTTCC
GATGCCAAGC AGGTCGCCCC GGACCGGCCG TTTTTCATGT ACTTCTGCCC CGGCGCCAAC
CACGCCCCGC ATCACGTACC GAAAGCCTGG GCAGACAAGT ACAAGGGCAA GTTCGACATG
GGCTACGAGG CCATCCGCGA GAAGATCCTG GCGAAACAGA TCAAGATGGG TATCCTTCCG
AAGGGTACGG AACTCTCTCC GATCAACCCG CTCAGTGACG TGCGCAGTAC GGACGGCGAA
CCGTCGCCGC CGATGAGCGA CGTACGCCCG TGGGATTCGC TCAGCGACGA CGAGAAGAAG
TTGCAGACGC GCATGGCGGA GGTGTTCGCC GGCTTCTCCA GCTACGCGGA CCACGAGATC
GGCCGGCTGA TCTCGTACCT GGAGGAGACC GGGCAACTGG ACAACACCCT CATCTTCGTG
ATCTCGGACA ATGGAGCCTC GGGTGAGGGA GGACCAGATG GTGCGGTCAA TGAGAACACG
TTCTTCAACA GTGTCCCCAG CAGTGTGGAA GAGAACCTGA AGCTGCTCGA CATCCTTGGT
TCCCCAGGTA CGTACAACCA CTACTCGACG GGTTGGGCCT TTGCCTTCAA CACTCCGTTC
AAGCTGTTCA AACAGGACGC GTGGGAGGGA GGCGTCTGCG ACCCGATGAT CGTACACTGG
CCTGCGGGGA TCAAGGCGAA AGGCGAGATG CGCGACCAGT ACGCGCACGT CAGCGACATC
GTGCCGACGG TGTACGAGTG CCTCGGCATC GACCTGCCGG AGACGGTCAA GGGCTTCACC
CAGTGGCCCT TGGAAGGCAC CAGCTTCAAA CACACATTCG AGAAGCCCAA GGCGAAGACG
GCGAAGCGTA GCCAGTTCTA CCAGATGCTG GGCACCCGTG CGCTGTGGCG CGACGGGTGG
AAGGTGGACG CCCTGCATCC GAGTTCACCT GCCGACTGGG GGCACTTCGG GCAGGACAAG
TGGGCGCTCT ACCACACCGA CGTCGACCGT GCCGAGATCC ACGACGTGGC CGACCAACAT
CCCGAACTGG CCGCGGACCT GGTGGGCCTG TGGTACCACG AGGCCGGCAA GTTCTTCGGC
CTGCCGATGG ACGACCGACC TATCGCAGAG ATCCTCAGTA CGTCGCGACC ACAGGTGGCG
CCGCCCCGGG ACCACTACGT CTACTATCCG AACACGCTGG AGGTCCCGGA GGCGGTCGCG
GTCAACATCC GCGGGCGGTC CTACATCATC GCGGCCGATG TCATCATCGA CGGCTCCGAC
GCGGAAGGTG TTCTCTTCGC GCAGGGCTCC AACTTCGGCG GGCACGCACT CTACCTCAAG
GACGGCAAAC TCAAGTACGT CTACAACTAC CTGGGGGAGA ACGAGCAGGT GATCACGGCG
AACAGCGATG TGCCGAAGGG GAAGGTGGTG CTCGGTGTCG CGTTCGAGAA GGAGAAGTTG
ACCACTCCGC CCGGTTCGGA CCGACCCAGT GCGTGCATCG GCAACGCATC GCTGTTCATC
GGCAAGAAAA AGGTCGGTGA ATGTAAGGGT ATGCAAACCC AACTCGGCAA GTTCGCGCTC
GCCGGGGAAG GTTTCAACGT AGGGCGGGAC CGGGGCGCAC CCGTCACCTA TGACTATTCC
GGCGAGCGTC CCTGGAAGCT GACCGGGGCC ACGATCAAAC AGGTCATCGC CGACGTGTCG
GGTGAGGCCT ACGTCGACGT CGAGCGGGAG GCAGCGGCCA TGATGGCGCG GGACTGA
 
Protein sequence
MSKAFKGVIG LGVKDSTPDW DPYAQPQAPK GAPNVLFLVW DDTGFGSWDF FGGPIEMPNM 
SKLANNGLRY TQFHTTALCS PTRAALLSGR NHTTVGMSCV AEATEGFPGL NGHIPGEAAL
IGEILSDRGY NTYALGKWHC VAEDETNMAS SKRNWPTSRG FERFYGFLGG EANQYYPNLV
QDQQFIDQTA DPVSIDEWKK GKDGYLLTAD LVDRAIGMIS DAKQVAPDRP FFMYFCPGAN
HAPHHVPKAW ADKYKGKFDM GYEAIREKIL AKQIKMGILP KGTELSPINP LSDVRSTDGE
PSPPMSDVRP WDSLSDDEKK LQTRMAEVFA GFSSYADHEI GRLISYLEET GQLDNTLIFV
ISDNGASGEG GPDGAVNENT FFNSVPSSVE ENLKLLDILG SPGTYNHYST GWAFAFNTPF
KLFKQDAWEG GVCDPMIVHW PAGIKAKGEM RDQYAHVSDI VPTVYECLGI DLPETVKGFT
QWPLEGTSFK HTFEKPKAKT AKRSQFYQML GTRALWRDGW KVDALHPSSP ADWGHFGQDK
WALYHTDVDR AEIHDVADQH PELAADLVGL WYHEAGKFFG LPMDDRPIAE ILSTSRPQVA
PPRDHYVYYP NTLEVPEAVA VNIRGRSYII AADVIIDGSD AEGVLFAQGS NFGGHALYLK
DGKLKYVYNY LGENEQVITA NSDVPKGKVV LGVAFEKEKL TTPPGSDRPS ACIGNASLFI
GKKKVGECKG MQTQLGKFAL AGEGFNVGRD RGAPVTYDYS GERPWKLTGA TIKQVIADVS
GEAYVDVERE AAAMMARD