Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0093 |
Symbol | |
ID | 6795319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 95769 |
End bp | 97658 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642774404 |
Product | sulfatase |
Protein accession | YP_002145068 |
Protein GI | 197247800 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATA AAAAAAATCT GTCCGCAGAA GAGACGGATC TTACGCGTAG GAAACTGTTA ACCAGCGCCG GTATTCTTGC CGCCGGCGGT ATGCTATCCG GCGCGGTAAA GGCTGATGAA AAATGCGCCG TCAAGGCGAA ACCGGCGTGG GATAAACCGT TTACCGGCGA AATCCCGGAA AAATTGCCAG AAGGATATAA TATTCTGTTA GTCGTGACCG ACCAGGAGCG TTTTTTTCCT ACGTTTCCTT TCCCGGTACC CGGCAGAGAG CGGCTCATGA AAACGGGGGT GACATTCTGT AATCATCAGA ATACCAGTAA TGTCTGTACG CCTTCCCGCT CCGTATTGTA TACCGGCTTA CATATGCCCC AGACAAAGAT GTTCGATAAT CTGGGATTAC CCTGGATGCC TTATGACCTT GACCCCGCTC TTGGAACCAC AGGCCATATG ATGCGGGAAC TGGGATACTA TACGGCCTAT AAAGGTAAGT GGCATCTTAC AGAAAAACTG GAGAAGCCTT TGCCTGACGA AAAAGATGAG GATATTGATG TCGGGGATAT TCCCGAACCA GAATTACATA AAATTATGGA AAAATATGGT TTTGCTGACT ATCACGGCAT CGGCGATATT ATAGGCCATA GTAAAGGCGG CTATTTTTAT GATTCAACCA CCACGGCTCA GACTATAAAT TGGTTAAGAT GCAAGGGGCA GCCCTTGAAT GACCAACACA AGCCCTGGTT CCTGGCCGTT AACCTCGTTA ATCCTCATGA CGTCATGTTT ATTGATACCG ATAAAGAGGG AGAAAAGGTA CAGTGGCGTG GCGAGTTGGA TCAGGATGAT AATACCCTGG CGCCCACGCA GCCGCCGGAA AACGAGCTTT ATCAGGCAAG CTGGCCGAAC TATCCGCTGC CGGCAAACAG GCATCAATCA TTCAATGAGC AGGGAAGACC GCCGGCGCAT CTTGAATACC AGACGGCGCG CGCTGCGCTG GAAGGGCAGT TTCCTGATGA AGATCGTCGT TGGCGTAAAC TGCTTGACTA CTATTTCAAC TGTATCCGCG ATTGTGATAC TCACCTTGAC CGGATATTAA ATGAACTTGA TGCCCTCAAG TTAACTGATA AAACGATTGT TGTATTTACT GCCGATCATG GCGAATTAGG CGGAAGCCAT CAGATGCACG GTAAAGGCGC TTCCGTTTAT AAAGAACAGA TCCATGTACC GATGATTATT TCCCACCCGG CGTACCCCGG TAATAAGAAA TGTCAGGCGT TGACCTGTCA TCTTGATATC GCGCCGACAT TAGTTGGACT GACCGGTTTG CCGGAAGAAA AACAGCACCA GGCGTTAGGC AACCGCAAAG GCGTTAATTT TAGCGGATTG CTAAAAAACC CGGAGGGCGT TGCGGTTAAT GCGGTGAGAA ATGCCAGCTT ATATTGCTAT GGCATGATCT TGTATACCGA TGCCCATTAT CTCCACCGCG TTATCGCGCT ACAAAGAGAT AAACAAAAAA CGGTGGCGCA AATCAAGCAG GAAATATCCC ATTTGCATCC TGATTTTAGC CATCGTTCAG GGACGCGGAT GATTAACGAT GGTCGTTATA AGTTTGCGCG TTATTTCTCG CTAAGGGAGC ATAATACGCC GGAAACCTGG GAGGATCTTA TTAAGTACAA CGATCTTGAA CTTTACGATC TTAAAAATGA TCCCGATGAG AACCATAACC TTGCTGCTGA TAAACAGAAA TATCAGGATC TCATTCTTAC GATGAATGAA AAACTGAATA AAATTATCAA AGACGAAATT GGCGTGGATG ACGGCAGTTT TATGCCGGAT GCGGCCCATG AGCCGTGGGA TCTTACTATT GAGCAGTTTA ACCGCATGGC GAAAGATTAA
|
Protein sequence | MSNKKNLSAE ETDLTRRKLL TSAGILAAGG MLSGAVKADE KCAVKAKPAW DKPFTGEIPE KLPEGYNILL VVTDQERFFP TFPFPVPGRE RLMKTGVTFC NHQNTSNVCT PSRSVLYTGL HMPQTKMFDN LGLPWMPYDL DPALGTTGHM MRELGYYTAY KGKWHLTEKL EKPLPDEKDE DIDVGDIPEP ELHKIMEKYG FADYHGIGDI IGHSKGGYFY DSTTTAQTIN WLRCKGQPLN DQHKPWFLAV NLVNPHDVMF IDTDKEGEKV QWRGELDQDD NTLAPTQPPE NELYQASWPN YPLPANRHQS FNEQGRPPAH LEYQTARAAL EGQFPDEDRR WRKLLDYYFN CIRDCDTHLD RILNELDALK LTDKTIVVFT ADHGELGGSH QMHGKGASVY KEQIHVPMII SHPAYPGNKK CQALTCHLDI APTLVGLTGL PEEKQHQALG NRKGVNFSGL LKNPEGVAVN AVRNASLYCY GMILYTDAHY LHRVIALQRD KQKTVAQIKQ EISHLHPDFS HRSGTRMIND GRYKFARYFS LREHNTPETW EDLIKYNDLE LYDLKNDPDE NHNLAADKQK YQDLILTMNE KLNKIIKDEI GVDDGSFMPD AAHEPWDLTI EQFNRMAKD
|
| |