Gene SeHA_C0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0089 
Symbol 
ID6490348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp91213 
End bp93102 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content47% 
IMG OID642740377 
Productsulfatase 
Protein accessionYP_002044051 
Protein GI194451018 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATA AAAAAAATCT GTCCGCAGAA GAGACGGATC TTACGCGTAG GAAACTGTTA 
ACCAGTGCCG GTATTCTTGC CGCAGGCGGT ATGTTATCCG GCGCGGTAAA GGCTGATGAA
AAATGCGCCG TCAAGGCGAA ACCGGCGTGG GATAAACCGT TTACTGGCGA AATCCCGGAA
AAATTGCCAG AAGGATATAA TATTCTGTTA GTCGTGACCG ACCAGGAGCG TTTTTTTCCT
ACGTTTCCTT TCCCGGTACC CGGCAGAGAG CGGCTCATGA AAACGGGGGT GACATTCTGT
AATCATCAGA ATACCAGTAA TGTCTGTACG CCTTCCCGCT CCGTATTGTA TACCGGCTTA
CATATGCCCC AGACAAAGAT GTTTGATAAT TTGGGATTAC CCTGGATGCC TTATGACCTT
GACCCCGCTC TTGGAACCAC AGGTCATATG ATGCGGGAAC TGGGATACTA TACGGCCTAT
AAAGGTAAGT GGCATCTTAC AGAAAAACTG GAGAAGCCTT TGCCTGACGA AAAAGATGAG
GATATTGATG TCGGGGATAT TCCTGAACCA GAATTACATA AAATTATGGA AAAATATGGT
TTTGCTGACT ATCACGGCAT CGGCGATATT ATAGGCCATA GTAAAGGCGG CTATTTTTAT
GATTCAACCA CCACGGCTCA GACTATAAAT TGGTTAAGAT GCAAGGGGCA GCCCTTGAAT
GACCAACACA AGCCCTGGTT CCTGGCCGTT AACCTCGTTA ATCCTCATGA CGTCATGTTT
ATTGATACCG ATAAAGAGGG AGAAAAGGTA CAGTGGCGTG GCGAGTTGGA TCAGGATGAT
AATACCCTGG CGCCCACGCA GCCGCCGGAA AACGAGCTTT ATCAGGCAAG CTGGCCGAAC
TATCCGCTGC CGGCAAACAG GCATCAGTCA TTCAATGAGC AGGGAAGACC GCCGGCGCAT
CTTGAATACC AGACGGCGCG CGCTGCGCTG GAAGGGCAGT TTCCTGATGA AGATCGTCGT
TGGCGTAAAC TGCTTGACTA CTATTTCAAC TGTATCCGCG ATTGTGATAC TCACCTTGAC
CGGATATTAA ATGAACTGGA TGCCCTCAAG TTAACTGATA AAACGATTGT TGTATTTACT
GCCGATCATG GCGAATTAGG CGGAAGCCAT CAGATGCACG GTAAAGGCGC TTCCGTTTAT
AAAGAACAGA TCCATGTACC GATGATTATT TCCCACCCGG CGTACCCCGG TAATAAGAAA
TGTCAGGCGT TGACCTGTCA TCTTGATATT GCGCCGACAT TAGTTGGGCT GACCGGTTTG
CCGGAAGAAA AACAGCACCA GGCGTTAGGC AACCGCAAAG GCGTTAATTT TAGCGGATTG
CTAAAAAACC CGGAGAGCGT TGCGGTTAAT GCGGTGAGAA ATGCCAGCTT ATATTGCTAT
GGCATGATCT TGTATACCGA TGCCCATTAT CTCCACCGCG TTATCGCGCT ACAAAGAGAT
AAACAAAAAA CGGTGGCGCA AATCAAGCAG GAAATATCCC ATTTGCATCC TGATTTCAGC
CATCGTTCAG GGACGCGGAT GATTAACGAT GGCCGTTATA AGTTTGCGCG TTATTTCTCG
CTAAGGGAGC ATAATACGCC GGAAACCTGG GAGGATCTTA TTAAGTACAA CGATCTTGAA
CTTTACGATC TTAAAAATGA TCCCGACGAG AACCATAACC TTGCTGCTGA TAAACAGAAA
TATCAGGATC TCATTCTTAC GATGAATGAA AAACTGAATA AAATTATCAA GGACGAAATT
GGTGTGGATG ACGGCAGTTT TATGCCGGAT GCGGCCCATG AGCCGTGGGA TCTTACTATT
GAGCAGTTTA ACCGCATGGC GAAAGATTAA
 
Protein sequence
MSNKKNLSAE ETDLTRRKLL TSAGILAAGG MLSGAVKADE KCAVKAKPAW DKPFTGEIPE 
KLPEGYNILL VVTDQERFFP TFPFPVPGRE RLMKTGVTFC NHQNTSNVCT PSRSVLYTGL
HMPQTKMFDN LGLPWMPYDL DPALGTTGHM MRELGYYTAY KGKWHLTEKL EKPLPDEKDE
DIDVGDIPEP ELHKIMEKYG FADYHGIGDI IGHSKGGYFY DSTTTAQTIN WLRCKGQPLN
DQHKPWFLAV NLVNPHDVMF IDTDKEGEKV QWRGELDQDD NTLAPTQPPE NELYQASWPN
YPLPANRHQS FNEQGRPPAH LEYQTARAAL EGQFPDEDRR WRKLLDYYFN CIRDCDTHLD
RILNELDALK LTDKTIVVFT ADHGELGGSH QMHGKGASVY KEQIHVPMII SHPAYPGNKK
CQALTCHLDI APTLVGLTGL PEEKQHQALG NRKGVNFSGL LKNPESVAVN AVRNASLYCY
GMILYTDAHY LHRVIALQRD KQKTVAQIKQ EISHLHPDFS HRSGTRMIND GRYKFARYFS
LREHNTPETW EDLIKYNDLE LYDLKNDPDE NHNLAADKQK YQDLILTMNE KLNKIIKDEI
GVDDGSFMPD AAHEPWDLTI EQFNRMAKD