Gene SNSL254_A3450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3450 
Symbol 
ID6482518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3343832 
End bp3345628 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content49% 
IMG OID642738739 
Productarylsulfate sulfotransferase 
Protein accessionYP_002042459 
Protein GI194444727 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.921011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGACC AATACCGGAA AACAATACTT GCCGGTGCCG TCGCACTGAC ATGCGGACTC 
ACCGCAGCCA GTACGTTTGC CGCAGGTTTT CAACCGGCAC AGCCCGCAGG AAAATTAGGC
GCAGTCGTTG TCGATCCTTA CGGAAATGCC CCTCTCACCG CGCTGGTGGA ATTAGATAGC
CATATTATTT CAGACGTTAA AGTTACTGTA CATGGCAAAG GGGAAAAAGG CGTTCCTGTT
ACTTATACTG TTGGGAAAGA GTCTTTAGAA ACCTATGACG GTATTCCTAT TTTTGGCCTT
TATCAGAAAT TTGCCAACAA CGTCACGGTA GAATATAAAG AAAACGGCAA AGCCATGAAG
GATGACTATG TGGTGCAGAC GTCCGCCATC GTCAACCATT ATATGGATAA CCGTTCTATT
TCAGATCTCC AGCAAACGAA AGTTATTAAA GTTGCGCCAG GATTTGAAGA TCGCCTTTAT
CTGGTAAATA CCCATACCTT TACGCCGCAG GGCGCTGAAT TTCACTGGCA CGGCGAAAAA
GATAAAAATG CGGGCATTCT TGATGCCGGC CCGGCGGGCG GGGCTTTGCC TTTCGATATC
GCCCCTTATA CGTTTGTGGT CGACACCCAG GGTGAATATC GCTGGTGGCT GGATCAAGAT
ACCTTCTACG ACGGCCACGA TATGAATATC AACAAACGCG GCTACCTGAT GGGTATTCGT
GAAACGCCTC GCGGCACCTT TACCGCGGTG CAGGGCCAAC ACTGGTACGA GTTTGACATG
ATGGGGCAAA TTCTTGCCGA TCATAAACTG CCGCGCGGGT TCCTGGATGC GTCTCATGAA
TCCATCGAAA CCGTGAACGG CACCGTACTG CTGCGCGTCG GCAAACGCGA TTACCGCAAA
GAAGACGGCA TACATGTTCA TACTATTCGT GACCAAATCA TTGAGGTCGA TAAGTCTGGC
CGCGTAGTAG ACGTTTGGGA TTTAACCAAA ATCCTCGACC CTATGCGTGA TGCGCTGCTC
GGCGCGCTGG ATGCGGGCGC AGTATGCGTG AACGTCGATC TGGCCCATGC CGGACAGCAG
GCGAAACTTG AACCGGATAC GCCGTATGGT GATGCGCTTG GCGTTGGTGC CGGTCGTAAC
TGGGCGCACG TCAACTCTAT CGCTTATGAC GCGAAAGACG ACTCCATCAT CCTTTCTTCC
CGCCATCAGG GTATTGTAAA AATTGGTCGC GATAAGCAGG TGAAATGGAT ACTGGCACCG
TCTAAAGGCT GGAATAAGCA GCTAGCCAGT AAATTGCTGA AACCGGTAGA CGATCATGGT
AAGCCGTTGA CCTGTGACGA AAACGGCAAG TGTAAGGACA CCGATTTCGA TTTCACCTAT
ACCCAACATA CGGCATGGCT TTCCAGCAAA GGCACGTTAA CGGTCTTTGA TAACGGCGAT
GGTCGCGGCC TGGAGCAACC GGCTCTACCG ACCATGAAAT ATTCCCGTTT TGTCGAATAT
AAGATCGATG AGAAGAAAGG CACCGTACAA CAAGTTTGGG AATACGGTAA AGAACGTGGA
TATGATTTCT ATAGTCCTAT TACCTCGGTT GTTGAATATC AAAAAGACCG CGACACCATG
TTCGGCTTTG GCGGTTCTAT TAACCTGTTC GACGTTGGTA AACCCACAGT CGGCAAACTG
AATGAGATTG ACTATAAAAC GAAAGAAGTG AAAGTTGAAA TTGATGTGCT GTCGGATAAA
CCCAACCAGA CTCACTATCG TGCATTACTG GTTCATCCAA CGCAAATGTT TAAATAA
 
Protein sequence
MFDQYRKTIL AGAVALTCGL TAASTFAAGF QPAQPAGKLG AVVVDPYGNA PLTALVELDS 
HIISDVKVTV HGKGEKGVPV TYTVGKESLE TYDGIPIFGL YQKFANNVTV EYKENGKAMK
DDYVVQTSAI VNHYMDNRSI SDLQQTKVIK VAPGFEDRLY LVNTHTFTPQ GAEFHWHGEK
DKNAGILDAG PAGGALPFDI APYTFVVDTQ GEYRWWLDQD TFYDGHDMNI NKRGYLMGIR
ETPRGTFTAV QGQHWYEFDM MGQILADHKL PRGFLDASHE SIETVNGTVL LRVGKRDYRK
EDGIHVHTIR DQIIEVDKSG RVVDVWDLTK ILDPMRDALL GALDAGAVCV NVDLAHAGQQ
AKLEPDTPYG DALGVGAGRN WAHVNSIAYD AKDDSIILSS RHQGIVKIGR DKQVKWILAP
SKGWNKQLAS KLLKPVDDHG KPLTCDENGK CKDTDFDFTY TQHTAWLSSK GTLTVFDNGD
GRGLEQPALP TMKYSRFVEY KIDEKKGTVQ QVWEYGKERG YDFYSPITSV VEYQKDRDTM
FGFGGSINLF DVGKPTVGKL NEIDYKTKEV KVEIDVLSDK PNQTHYRALL VHPTQMFK