Gene SeHA_C0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0038 
Symbol 
ID6489197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp36895 
End bp38388 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content50% 
IMG OID642740329 
Productsulfatase 
Protein accessionYP_002044003 
Protein GI194451527 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA CAGTCATTGC CAGCATGATT GGACTGGCGC TTTGTGCGAC AAGTACGTTA 
TCTGTCGTGC ACGCGGCTGC CGAAAAACGC CCGAACTTAG TTTTTATTGT GGCAGATGAT
CTTGGCTATG GTGATTTGGC GACTTACGGT CACCAGATAG TGAAAACCCC GAATATTGAC
CGCCTGGCGA AAGAAGGGGT CAAATTCACT GAATACTATG CGCCGGCTCC TTTGTGTTCT
CCTTCCCGCG CCGGGATGTT AACAGGCCGT ATGCCCTTCC GCACCGGTGT ACGCTCCTGG
ATCCCTGAAG GTACTAATGT TTCTATTGGA CGTAACGAAT TGACCATAGC CAATCTGCTG
AAACAGCAGG GTTACGATAC CGCGATGATG GGTAAACTGC ATCTCAACGC CGGGGGCGAT
CGTACCGATC AGCCGCAGCC GAAAGAGTTA GGTTTTGATT ACTCTCTGGT GAACCCGGCT
GGCTTTGTGA CCGATGCTAC GCTGGATAAT GCCAAAGAGC GCCCGCGTTA CGGCGTCGTC
CACCCAACCG GCTGGATGCG TAACGGCAAA CATATTGATC GTGCCGATAG CATCAGCGGC
GAATTTGTCA GTTCTGAAGT GGTGAACTGG CTAGATAACA AGAAAGACGA CAAACCGTTC
TTCTTATACG TTGCCTTTAC GGAAGTTCAC AGCCCACTGG CCTCGCCGAA AAAATACCTC
GATATGTATT CACAATACAT GAGCGAATAC CAGAAGCAGC ATCCCGATCT GTTCTACGGC
GACTGGGCGG ATAAACCGTG GCGCGGTACC GGTGAATATT ACGCCAATAT CAGCTATATG
GATGCGCAAG TCGGTAAAGT ACTGGATAAA ATTAAAGCGA TGGGGGAAGA AGATAATACC
ATTGTCATCT TCACCAGTGA TAACGGCCCG GTGACGCGTG AAGCGCGTAA AGTCTACGAG
CTGAACCTCG CAGGCGAGAC TGACGGTCTG CGCGGACGTA AAGATAACCT GTGGGAAGGC
GGTATTCGCG TTCCGGCAAT TATCAAATAC GGTAAACACA TTCCACAGGG CGTCGTTACC
GATACGCCTG TGTATGGTCT GGACTGGATG CCGACGCTGG CGAACATGAT GGATTTCAAA
CTGCCTACTG ATCGTACATA CGACGGGCAA TCGCTGGTTC CGCTGTTGGA GCAGAAAACG
TTAAAACGTA ATAAGCCGCT GATCTTCGGT ATTGATATGC CGTTCCAGGA TGATCCGACC
GATGAATGGG CGATCCGCGA CGGTGACTGG AAAATGATCA TCGATCGTGA AAATAAACCT
AAATACCTCT ATAACCTGAA AAAAGATCGG TTCGAAACGC TGAACCAAAT TGGCAAACAG
CCTGAAATTG AAAAACAGCT GTATGGCAAG TTCCTGAAAA TGAAACAGGA CATCGATAAC
GACTCGTTAA TGAAAGCCCG TGGCGACAAA CCAACGCCGG TGACCTGGGG CTAA
 
Protein sequence
MKKTVIASMI GLALCATSTL SVVHAAAEKR PNLVFIVADD LGYGDLATYG HQIVKTPNID 
RLAKEGVKFT EYYAPAPLCS PSRAGMLTGR MPFRTGVRSW IPEGTNVSIG RNELTIANLL
KQQGYDTAMM GKLHLNAGGD RTDQPQPKEL GFDYSLVNPA GFVTDATLDN AKERPRYGVV
HPTGWMRNGK HIDRADSISG EFVSSEVVNW LDNKKDDKPF FLYVAFTEVH SPLASPKKYL
DMYSQYMSEY QKQHPDLFYG DWADKPWRGT GEYYANISYM DAQVGKVLDK IKAMGEEDNT
IVIFTSDNGP VTREARKVYE LNLAGETDGL RGRKDNLWEG GIRVPAIIKY GKHIPQGVVT
DTPVYGLDWM PTLANMMDFK LPTDRTYDGQ SLVPLLEQKT LKRNKPLIFG IDMPFQDDPT
DEWAIRDGDW KMIIDRENKP KYLYNLKKDR FETLNQIGKQ PEIEKQLYGK FLKMKQDIDN
DSLMKARGDK PTPVTWG