Gene SNSL254_A0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0089 
Symbol 
ID6482289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp95777 
End bp97666 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content46% 
IMG OID642735532 
Productsulfatase 
Protein accessionYP_002039314 
Protein GI194442740 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.995209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.813464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATA AAAAAAATCT GTCCGCAGAA GAGACGGATC TTACGCGTAG GAAACTGTTA 
ACCAGTGCCG GTATTCTTGC CGCAGGCGGT ATGTTATCCG GCGCGGTAAA GGCTGATGAA
AAATGCGCCG TCAAGGCGAA ACCGGCGTGG GATAAACCGT TTACTGGCGA AATCCCGGAA
AAATTGCCAG AAGGATATAA TATTCTGTTA GTCGTGACCG ACCAGGAGCG TTTTTTTCCT
ACGTTTCCTT TCCCGGTACC CGGCAGAGAG CGGCTCATGA AAACGGGGGT GACATTCTGT
AATCATCAGA ATACCAGTAA TGTTTGTACG CCTTCCCGCT CCGTATTGTA TACCGGCTTA
CATATGCCCC AGACAAAGAT GTTTGATAAT CTGGGATTAC CCTGGATGCC TTATGACCTT
GACCCCGCTC TTGGAACCAC AGGTCATATG ATGCGGGAAC TGGGATACTA TACGGCCTAT
AAAGGTAAGT GGCATCTTAC AGAAAAACTG GAGAAGCCTT TGCCTGACGA AAAAGATGAG
GATATTGATG TCGGGGATAT TCCTGAACCA GAATTACATA AAATTATGGA AAAATATGGT
TTTGCTGACT ATCACGGCAT CGGCGATATT ATAGGCCATA GTAAAGGCGG CTATTTTTAT
GATTCAACCA CCACGGCTCA GACTATAAAT TGGTTAAGAT GCAAGGGGCA GCCCTTGAAT
GACCAACACA AGCCCTGGTT CCTGGCCGTT AACCTCGTTA ATCCTCATGA CGTCATGTTT
ATTGATACCG ATAAAGAGGG AGAAAAGATA CAGTGGCGTG GCGAGTTGGA TCAGGATGAT
AATACCCTGG CGCCCACGCA GCCGCCGGAA AACGAGCTTT ATCAGGCAAG CTGGCCGAAC
TATCCGCTGC CGGCAAACAG GCATCAATCA TTCAATGAGC AGGGAAGACC GCCGGCGCAT
CTTGAATACC AGACGGCGCG CGCTGCGCTG GAAGGGCAGT TTCCTGATGA AGATCGTCGT
TGGCGTAAAC TGCTTGACTA CTATTTCAAC TGTATCCGCG ATTGTGATAC TCACCTTGAC
CGGATATTAA ATGAACTTGA TGCCCTCAAG TTAACTGATA AAACGATTGT TGTATTTACT
GCCGATCATG GCGAATTAGG CGGAAGCCAT CAGATGCACG GTAAAGGCGC TTCCGTTTAT
AAAGAACAGA TCCATGTACC GATGATTATT TCCCACCCGG CGTACCCCGG TAATAAGAAA
TGTCAGGCGT TGACCTGTCA TCTTGATATC GCGCCGACAT TAGTTGGACT GACCGGTTTG
CCGGAAGAAA AACAGCACCA GGCGTTAGGC AACCGCAAAG GTGTTAATTT TAGCGGATTG
CTAAAAAATC CGGAGGGCGT TGCGGTTAAT GCGGTGAGAA ATGCCAGCTT ATATTGCTAT
GGCATGATCT TGTATACCGA TGCCCATTAT CTCCACCGCG TTATCGCGCT ACAAAGAGAT
AAACAAAAAA CGGTGGCGCA AATCAAGCAG GAAATATCCC ATCTGCATCC TGATTTTAGC
CATCGTTCAG GGACGCGCAT GATTAACGAT GGCCGTTATA AGTTTGCGCG TTATTTCTCG
CTAAGGGAGC ATAATACGCC GGAAACCTGG GAGGATCTTA TTAAGTACAA CGATCTTGAA
CTTTACGATC TTAAAAATGA TCCCGATGAG AACCATAACC TTGCTGCTGA TAAACAGAAA
TATCAGGATC TCATTCTTAC GATGAATGAA AAACTGAATA AAATTATCAA GGACGAAATT
GGCGTGGATG ACGGCAGTTT TATGCCGGAT GCGGCCCATG AGCCGTGGGA TCTTACTATT
GAGCAGTTTA ACCGCATGGC GAAAGATTAA
 
Protein sequence
MSNKKNLSAE ETDLTRRKLL TSAGILAAGG MLSGAVKADE KCAVKAKPAW DKPFTGEIPE 
KLPEGYNILL VVTDQERFFP TFPFPVPGRE RLMKTGVTFC NHQNTSNVCT PSRSVLYTGL
HMPQTKMFDN LGLPWMPYDL DPALGTTGHM MRELGYYTAY KGKWHLTEKL EKPLPDEKDE
DIDVGDIPEP ELHKIMEKYG FADYHGIGDI IGHSKGGYFY DSTTTAQTIN WLRCKGQPLN
DQHKPWFLAV NLVNPHDVMF IDTDKEGEKI QWRGELDQDD NTLAPTQPPE NELYQASWPN
YPLPANRHQS FNEQGRPPAH LEYQTARAAL EGQFPDEDRR WRKLLDYYFN CIRDCDTHLD
RILNELDALK LTDKTIVVFT ADHGELGGSH QMHGKGASVY KEQIHVPMII SHPAYPGNKK
CQALTCHLDI APTLVGLTGL PEEKQHQALG NRKGVNFSGL LKNPEGVAVN AVRNASLYCY
GMILYTDAHY LHRVIALQRD KQKTVAQIKQ EISHLHPDFS HRSGTRMIND GRYKFARYFS
LREHNTPETW EDLIKYNDLE LYDLKNDPDE NHNLAADKQK YQDLILTMNE KLNKIIKDEI
GVDDGSFMPD AAHEPWDLTI EQFNRMAKD