Gene SNSL254_A0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0036 
Symbol 
ID6485736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp35332 
End bp37050 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content46% 
IMG OID642735480 
Productarylsulfotransferase 
Protein accessionYP_002039262 
Protein GI194446355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.719743 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.23108 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA AAAGTTCGTC AATGGTTAAC ATGCCCGCAC CGCGTGAGCC GATTAACCAG 
AAAATCGATA CCAATAACGC ACTGGTTTTA AACCATAACG CCATATATGA ACAACGATTA
GCGGAGATCA CGCAATCTAA TACCTGTGAC AAGGCCATTG TCACCGTAAA TCCCTACGGG
ACCGCCCCGT TGAGTCTCTA TCTGGGGGTT TGGATGGATG AAGCTGCCGC GCTTGAGATC
AATGTTGTTG ATAGCGAAGC GACGACAGAG GCAGTGCGTT ATCAATATGA TGTACATCCG
GGCGCTAACC TTATTCCTGT GTGTGGGATG GTATCCGCGG TGAATAATCA GATTACCCTA
CGCCTTGCCT CGCAAATTGT CGGGCAATAT ACAGTAATGA CAGACGCATT ACCGCCCACG
GATTCGGCTA ACGTGAGCCT CGGCTTCCCC ATTATTAGCG TCTCCTGTCC TGCGCAGCAG
GCCTCGCTGA TGGAGGAAGG GCTTTATTTC TCTACTTATT TTGATCGGTA TAATCTGGCT
TTTGATCATA ACGGGATTGT CCGGTGGTAT GTCAGCCAGG ATATCCCATC TTATAATTTT
GTCAGAATGG ATAACGGCCA TTTCCTGGCG ACGTCACAGG GAATAAACCA TTGTCTGAAT
ATGTATGAAT TTGACATTAT GGGACGGGTT TATACGGTTT ATCTTCTCGA CAATGAGTTC
CATCACTCCA TTCTTCCCAT TGAGAACAAT CTGGCAGTTG CGCCTTCAGA ATATAGCAAT
GGACGGCCAG ATGGTTACTC AACCGGGAAA GATGGCGTTT CTATTATTAA CTTATCTACC
GGACTTGAAG TCGCCTATTA CGATATGCTG TATGTGATGG ATTATTCCAG ATCGCCGCGT
CCTTCCGGAA GCGCGCCAGG TCAGGACGTA TCAATGGATG ACTGGCTGCA TATCAACCAA
AGCTATATTA ATGAACCCAA CAATTTGCTG ATCTGTTCCG GTCGACATCA GAGCGCGATT
TTTGGCGTAA ATGTGGATTC CGGCGAACTG CGCTTTATTA TGGCGAACCA TGAGAATTGG
TCTGACGAAT TCAAGCAATA CTTATTAACC CCTGTCGATG ATGATGGTGT CCCGCTGTAC
GATCTTACCT CGCCGGGAGG GATTGATGCG GCAGATAAGA ATTTCTGGAC CTGGGGGCAG
CATAACATTG TTGAAATTCC AAACGATGAG CCTGGTATCC TGGAGTTTAT GGTCTTTGAT
AATGGTAACT ATCGTTCACG CGAAGATGCG AAAAGTCTGT TGCCGCTCGA TAACTTCAGC
CGGGTGGTGC AGTTTAAAAT AAACCTAAAT ACGATGACCG TAACGCGTCC GTATGAGTAT
GGTAAAACGG AAGTCGGGAA CCGGGGCTAT AGCAGTTTTG TGAGCGCTAA GCATTTATTG
ACTAATGGTC ACCTGGTTAT TCACTTCGGC GCGACGACGG TTGATGAGTT TGAACATACC
ATTACCGCGC AACCAGGTTC CAGCGATCTT GTCGATCCGG ATGAAGGGCA ACAGGCGTTA
GGTCGACTGG TATTACAAGA AATCAATAAA GAGACGAAAG AGGTCTTATT CGAAGCGATG
GTGACGTCGG GCTATTTCAA GAACGAAGAG ACGAATGGCA CGAATTATCG TTATGATATT
TCTGCATTTC GGGTATACAA AATGCCGCTG TTTGCATAA
 
Protein sequence
MNKKSSSMVN MPAPREPINQ KIDTNNALVL NHNAIYEQRL AEITQSNTCD KAIVTVNPYG 
TAPLSLYLGV WMDEAAALEI NVVDSEATTE AVRYQYDVHP GANLIPVCGM VSAVNNQITL
RLASQIVGQY TVMTDALPPT DSANVSLGFP IISVSCPAQQ ASLMEEGLYF STYFDRYNLA
FDHNGIVRWY VSQDIPSYNF VRMDNGHFLA TSQGINHCLN MYEFDIMGRV YTVYLLDNEF
HHSILPIENN LAVAPSEYSN GRPDGYSTGK DGVSIINLST GLEVAYYDML YVMDYSRSPR
PSGSAPGQDV SMDDWLHINQ SYINEPNNLL ICSGRHQSAI FGVNVDSGEL RFIMANHENW
SDEFKQYLLT PVDDDGVPLY DLTSPGGIDA ADKNFWTWGQ HNIVEIPNDE PGILEFMVFD
NGNYRSREDA KSLLPLDNFS RVVQFKINLN TMTVTRPYEY GKTEVGNRGY SSFVSAKHLL
TNGHLVIHFG ATTVDEFEHT ITAQPGSSDL VDPDEGQQAL GRLVLQEINK ETKEVLFEAM
VTSGYFKNEE TNGTNYRYDI SAFRVYKMPL FA