Gene SeHA_C3443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3443 
Symbol 
ID6490888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3346662 
End bp3348458 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content49% 
IMG OID642743574 
Productarylsulfate sulfotransferase 
Protein accessionYP_002047189 
Protein GI194449141 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.30935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGACC AATACCGGAA AACAATACTT GCCGGTGCCG TCGCACTGAC ATGCGGACTC 
ACCGCAGCCA GTACGTTTGC CGCAGGTTTT CAACCGGCAC AGCCCGCAGG AAAATTAGGC
GCAGTCGTTG TCGATCCTTA CGGAAATGCC CCTCTCACCG CGCTGGTGGA ATTAGATAGC
CATATTATTT CAGACGTTAA AGTTACTGTA CATGGCAAAG GGGAAAAAGG CGTTCCTGTT
ACTTATACCG TTGGGAAAGA GTCTTTAGAA ACCTATGACG GTATTCCTAT TTTTGGCCTT
TATCAGAAAT TTGCCAACAA CGTCACGGTA GAATATAAAG AAAACGGCAA AGCCATGAAG
GATGACTATG TGGTGCAGAC GTCCGCCATC GTCAACCATT ATATGGATAA CCGTTCTATT
TCAGATCTCC AGCAAACGAA AGTTATTAAA GTCGCGCCAG GATTTGAAGA TCGCCTTTAT
CTGGTAAATA CCCATACCTT TACGCCGCAG GGCGCTGAAT TTCACTGGCA CGGCGAAAAA
GATAAAAATG CGGGCATTCT TGATGCCGGC CCGGCGGGCG GGGCTTTGCC TTTCGATATC
GCCCCTTATA CGTTTGTGGT CGACACCCAG GGTGAATATC GCTGGTGGCT GGATCAAGAT
ACCTTCTACG ACGGCCACGA TATGAATATC AACAAACGCG GCTACCTGAT GGGTATTCGT
GAAACGCCTC GCGGCACCTT TACCGCGGTG CAGGGCCAAC ACTGGTACGA GTTTGACATG
ATGGGGCAAA TTCTTGCCGA TCACAAACTG CCGCGCGGGT TCCTGGATGC GTCTCATGAA
TCCATCGAAA CCGTGAACGG CACCGTACTG CTGCGCGTCG GCAAACGCGA TTACCGCAAA
GAAGACGGCA TACATGTTCA TACGATTCGT GACCAAATCA TTGAGGTTGA TAAGTCTGGC
CGCGTAGTAG ACGTTTGGGA TTTAACCAAA ATCCTCGACC CTATGCGTGA TGCGCTGCTC
GGCGCGCTGG ATGCGGGCGC AGTATGCGTG AACGTCGATC TGGCCCATGC CGGACAGCAG
GCGAAACTCG AACCGGATAC GCCGTATGGC GATGCGCTTG GCGTTGGTGC CGGTCGTAAC
TGGGCGCACG TCAACTCTAT CGCTTATGAC GCGAAAGACG ACTCCATCAT CCTTTCTTCC
CGCCATCAGG GTATTGTAAA AATTGGTCGC GATAAGCAGG TGAAATGGAT ACTGGCACCG
TCTAAGGGCT GGAATAAGCA GCTAGCCAGT AAATTGCTGA AACCGGTAGA CGATCATGGT
AAGCCGTTGA CCTGTGACGA AAACGGCAAG TGTAAGGACA CCGATTTCGA TTTCACCTAT
ACCCAACATA CGGCATGGCT TTCCAGCAAA GGCACGTTAA CGGTCTTTGA TAACGGCGAT
GGTCGCGGCC TGGAGCAACC GGCTCTACCG ACCATGAAAT ATTCCCGTTT TGTCGAATAT
AAGATCGATG AGAAGAAAGG CACCGTACAA CAAGTTTGGG AATACGGTAA AGAACGTGGA
TATGATTTCT ATAGTCCTAT TACCTCGGTT GTTGAATATC AAAAAGACCG CGACACCATG
TTCGGCTTTG GCGGTTCTAT TAACCTGTTC GACGTTGGTA AACCCACAGT CGGCAAACTG
AATGAGATTG ACTATAAAAC GAAAGAAGTG AAAGTTGAAA TTGATGTGCT GTCGGATAAG
CCCAACCAGA CTCACTATCG TGCGTTACTG GTTCATCCAA CGCAAATGTT TAAATAA
 
Protein sequence
MFDQYRKTIL AGAVALTCGL TAASTFAAGF QPAQPAGKLG AVVVDPYGNA PLTALVELDS 
HIISDVKVTV HGKGEKGVPV TYTVGKESLE TYDGIPIFGL YQKFANNVTV EYKENGKAMK
DDYVVQTSAI VNHYMDNRSI SDLQQTKVIK VAPGFEDRLY LVNTHTFTPQ GAEFHWHGEK
DKNAGILDAG PAGGALPFDI APYTFVVDTQ GEYRWWLDQD TFYDGHDMNI NKRGYLMGIR
ETPRGTFTAV QGQHWYEFDM MGQILADHKL PRGFLDASHE SIETVNGTVL LRVGKRDYRK
EDGIHVHTIR DQIIEVDKSG RVVDVWDLTK ILDPMRDALL GALDAGAVCV NVDLAHAGQQ
AKLEPDTPYG DALGVGAGRN WAHVNSIAYD AKDDSIILSS RHQGIVKIGR DKQVKWILAP
SKGWNKQLAS KLLKPVDDHG KPLTCDENGK CKDTDFDFTY TQHTAWLSSK GTLTVFDNGD
GRGLEQPALP TMKYSRFVEY KIDEKKGTVQ QVWEYGKERG YDFYSPITSV VEYQKDRDTM
FGFGGSINLF DVGKPTVGKL NEIDYKTKEV KVEIDVLSDK PNQTHYRALL VHPTQMFK