Gene SeAg_B0093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B0093 
Symbol 
ID6795319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp95769 
End bp97658 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content47% 
IMG OID642774404 
Productsulfatase 
Protein accessionYP_002145068 
Protein GI197247800 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATA AAAAAAATCT GTCCGCAGAA GAGACGGATC TTACGCGTAG GAAACTGTTA 
ACCAGCGCCG GTATTCTTGC CGCCGGCGGT ATGCTATCCG GCGCGGTAAA GGCTGATGAA
AAATGCGCCG TCAAGGCGAA ACCGGCGTGG GATAAACCGT TTACCGGCGA AATCCCGGAA
AAATTGCCAG AAGGATATAA TATTCTGTTA GTCGTGACCG ACCAGGAGCG TTTTTTTCCT
ACGTTTCCTT TCCCGGTACC CGGCAGAGAG CGGCTCATGA AAACGGGGGT GACATTCTGT
AATCATCAGA ATACCAGTAA TGTCTGTACG CCTTCCCGCT CCGTATTGTA TACCGGCTTA
CATATGCCCC AGACAAAGAT GTTCGATAAT CTGGGATTAC CCTGGATGCC TTATGACCTT
GACCCCGCTC TTGGAACCAC AGGCCATATG ATGCGGGAAC TGGGATACTA TACGGCCTAT
AAAGGTAAGT GGCATCTTAC AGAAAAACTG GAGAAGCCTT TGCCTGACGA AAAAGATGAG
GATATTGATG TCGGGGATAT TCCCGAACCA GAATTACATA AAATTATGGA AAAATATGGT
TTTGCTGACT ATCACGGCAT CGGCGATATT ATAGGCCATA GTAAAGGCGG CTATTTTTAT
GATTCAACCA CCACGGCTCA GACTATAAAT TGGTTAAGAT GCAAGGGGCA GCCCTTGAAT
GACCAACACA AGCCCTGGTT CCTGGCCGTT AACCTCGTTA ATCCTCATGA CGTCATGTTT
ATTGATACCG ATAAAGAGGG AGAAAAGGTA CAGTGGCGTG GCGAGTTGGA TCAGGATGAT
AATACCCTGG CGCCCACGCA GCCGCCGGAA AACGAGCTTT ATCAGGCAAG CTGGCCGAAC
TATCCGCTGC CGGCAAACAG GCATCAATCA TTCAATGAGC AGGGAAGACC GCCGGCGCAT
CTTGAATACC AGACGGCGCG CGCTGCGCTG GAAGGGCAGT TTCCTGATGA AGATCGTCGT
TGGCGTAAAC TGCTTGACTA CTATTTCAAC TGTATCCGCG ATTGTGATAC TCACCTTGAC
CGGATATTAA ATGAACTTGA TGCCCTCAAG TTAACTGATA AAACGATTGT TGTATTTACT
GCCGATCATG GCGAATTAGG CGGAAGCCAT CAGATGCACG GTAAAGGCGC TTCCGTTTAT
AAAGAACAGA TCCATGTACC GATGATTATT TCCCACCCGG CGTACCCCGG TAATAAGAAA
TGTCAGGCGT TGACCTGTCA TCTTGATATC GCGCCGACAT TAGTTGGACT GACCGGTTTG
CCGGAAGAAA AACAGCACCA GGCGTTAGGC AACCGCAAAG GCGTTAATTT TAGCGGATTG
CTAAAAAACC CGGAGGGCGT TGCGGTTAAT GCGGTGAGAA ATGCCAGCTT ATATTGCTAT
GGCATGATCT TGTATACCGA TGCCCATTAT CTCCACCGCG TTATCGCGCT ACAAAGAGAT
AAACAAAAAA CGGTGGCGCA AATCAAGCAG GAAATATCCC ATTTGCATCC TGATTTTAGC
CATCGTTCAG GGACGCGGAT GATTAACGAT GGTCGTTATA AGTTTGCGCG TTATTTCTCG
CTAAGGGAGC ATAATACGCC GGAAACCTGG GAGGATCTTA TTAAGTACAA CGATCTTGAA
CTTTACGATC TTAAAAATGA TCCCGATGAG AACCATAACC TTGCTGCTGA TAAACAGAAA
TATCAGGATC TCATTCTTAC GATGAATGAA AAACTGAATA AAATTATCAA AGACGAAATT
GGCGTGGATG ACGGCAGTTT TATGCCGGAT GCGGCCCATG AGCCGTGGGA TCTTACTATT
GAGCAGTTTA ACCGCATGGC GAAAGATTAA
 
Protein sequence
MSNKKNLSAE ETDLTRRKLL TSAGILAAGG MLSGAVKADE KCAVKAKPAW DKPFTGEIPE 
KLPEGYNILL VVTDQERFFP TFPFPVPGRE RLMKTGVTFC NHQNTSNVCT PSRSVLYTGL
HMPQTKMFDN LGLPWMPYDL DPALGTTGHM MRELGYYTAY KGKWHLTEKL EKPLPDEKDE
DIDVGDIPEP ELHKIMEKYG FADYHGIGDI IGHSKGGYFY DSTTTAQTIN WLRCKGQPLN
DQHKPWFLAV NLVNPHDVMF IDTDKEGEKV QWRGELDQDD NTLAPTQPPE NELYQASWPN
YPLPANRHQS FNEQGRPPAH LEYQTARAAL EGQFPDEDRR WRKLLDYYFN CIRDCDTHLD
RILNELDALK LTDKTIVVFT ADHGELGGSH QMHGKGASVY KEQIHVPMII SHPAYPGNKK
CQALTCHLDI APTLVGLTGL PEEKQHQALG NRKGVNFSGL LKNPEGVAVN AVRNASLYCY
GMILYTDAHY LHRVIALQRD KQKTVAQIKQ EISHLHPDFS HRSGTRMIND GRYKFARYFS
LREHNTPETW EDLIKYNDLE LYDLKNDPDE NHNLAADKQK YQDLILTMNE KLNKIIKDEI
GVDDGSFMPD AAHEPWDLTI EQFNRMAKD