Gene SeSA_A0094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A0094 
Symbol 
ID6516046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp95959 
End bp97848 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content47% 
IMG OID642745268 
Productsulfatase 
Protein accessionYP_002113100 
Protein GI194735205 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.458255 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATA AAAAAAATCT GTCCGCAGAA GAGACGGATC TTACGCGTAG GAAACTGTTA 
ACCAGCGCCG GTATTCTTGC CGCCGGCGGT ATGCTATCCG GCGCGGTAAA GGCTGATGAA
AAATGCGCCG TCAAGGCGAA ACCGGCGTGG GATAAACCGT TTACCGGCGA AATCCCGGAA
AAATTGCCAG AAGGATATAA TATTCTGTTA GTCGTGACCG ACCAGGAGCG TTTTTTTCCT
ACGTTTCCTT TCCCGGTACC CGGCAGAGAG CGGCTCATGA AAACGGGGGT GACATTCTGT
AATCATCAGA ATACCAGTAA TGTCTGCACG CCTTCCCGCT CCGTATTGTA TACCGGCTTA
CATATGCCCC AGACAAAGAT GTTTGATAAT CTGGGATTGC CCTGGATGCC TTATGACCTT
GACCCCGCTC TTGGAACCAC AGGCCATATG ATGCGGGAAC TGGGATACTA TACGGCCTAT
AAAGGTAAGT GGCATCTTAC AGAAAAACTG GAGAAGCCTT TGCCTGACGA AAAAGATGAG
GATATTGATG TCGGGGATAT TCCTGAACCA GAATTACATA AAATTATGGA AAAATATGGT
TTTGCTGACT ATCACGGCAT CGGCGATATT ATTGGCCATA GTAAAGGCGG CTATTTTTAT
GATTCAACCA CCACGGCTCA GACTATAAAT TGGTTAAGAT GCAAGGGGCA GCCCTTGAAT
GACCAACACA AGCCCTGGTT CCTGGCCGTT AACCTCGTTA ATCCTCATGA CGTCATGTTT
ATTGATACCG ATAAAGAGGG AGAAAAGGTA CAGTGGCGTG GCGAGTTGGA TCAGGATGAT
AATACCCTGG CGCCCACGCA GCCGCCGGAA AACGAGCTTT ATCAGGCAAG CTGGCCGAAC
TATCCGCTGC CGGCAAACAG GCATCAATCA TTCAATGAGC AGGGAAGACC GCCGGCGCAT
CTTGAATACC AGACGGCGCG CGCTGCGCTG GAAGGGCAGT TTCCTGATGA AGATCGTCGT
TGGCGTAAAC TGCTTGACTA CTATTTCAAC TGTATCCGCG ATTGTGATAC TCACCTTGAC
CGGATATTAA ATGAACTTGA TGCCCTCAAG TTAACTGATA AAACGATTGT TGTATTTACT
GCAGATCATG GCGAATTAGG CGGAAGCCAT CAGATGCACG GTAAAGGCGC TTCCGTTTAT
AAAGAACAGA TCCATGTACC GATGATTATT TCCCACCCGG CGTACCCCGG TAATAAGAAA
TGTCAGGCGT TGACCTGTCA TCTTGATATT GCGCCGACAT TAGTTGGGCT GACCGGTTTG
CCGGAAGAAA AACAGCACCA GGCGTTAGGC AACCGCAAAG GCGTTAATTT TAGCGGATTG
CTAAAAAACC CGGAGGGTGT TGCGGTTAAT GCGGTGAGAA ATGCCAGCTT ATATTGCTAT
GGCATGATCT TGTATACCGA TGCCCATTAT CTCCACCGCG TTATCGCGCT ACAAAGAGAT
AAACAAAAAA CGGTGGCGCA AATCAAGCAG GAAATATCCC ATTTGCATCC TGATTTCAGC
CATCGTTCAG GGACGCGGAT GATTAACGAT GGTCGTTATA AGTTTGCGCG TTATTTCTCG
CTAAGGGAGC ATAATACGCC GGAAACCTGG GAGGATCTTA TTAAGTACAA CGATCTTGAA
CTTTACGATC TTAAAAATGA TCCCGATGAG AACCATAACC TTGCTGCTGA TAAACAGAAA
TATCAGGATC TCATACTTAC GATGAATGAA AAACTGAATA AAATTATCAA AGACGAAATT
GGCGTGGATG ACGGCAGTTT TATGCCGGAT GCGGCCCGTG AGCCGTGGGA TCTTACTATT
GAGCAGTTTA ACCGCATGGC GAAAGATTAA
 
Protein sequence
MSNKKNLSAE ETDLTRRKLL TSAGILAAGG MLSGAVKADE KCAVKAKPAW DKPFTGEIPE 
KLPEGYNILL VVTDQERFFP TFPFPVPGRE RLMKTGVTFC NHQNTSNVCT PSRSVLYTGL
HMPQTKMFDN LGLPWMPYDL DPALGTTGHM MRELGYYTAY KGKWHLTEKL EKPLPDEKDE
DIDVGDIPEP ELHKIMEKYG FADYHGIGDI IGHSKGGYFY DSTTTAQTIN WLRCKGQPLN
DQHKPWFLAV NLVNPHDVMF IDTDKEGEKV QWRGELDQDD NTLAPTQPPE NELYQASWPN
YPLPANRHQS FNEQGRPPAH LEYQTARAAL EGQFPDEDRR WRKLLDYYFN CIRDCDTHLD
RILNELDALK LTDKTIVVFT ADHGELGGSH QMHGKGASVY KEQIHVPMII SHPAYPGNKK
CQALTCHLDI APTLVGLTGL PEEKQHQALG NRKGVNFSGL LKNPEGVAVN AVRNASLYCY
GMILYTDAHY LHRVIALQRD KQKTVAQIKQ EISHLHPDFS HRSGTRMIND GRYKFARYFS
LREHNTPETW EDLIKYNDLE LYDLKNDPDE NHNLAADKQK YQDLILTMNE KLNKIIKDEI
GVDDGSFMPD AAREPWDLTI EQFNRMAKD