Gene SNSL254_A0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0042 
Symbol 
ID6485781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp44303 
End bp46018 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content50% 
IMG OID642735486 
Productarylsulfotransferase 
Protein accessionYP_002039268 
Protein GI194446358 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.176316 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACGT TAACTACAAC GTCTGTTGTC CTTCCTGCGC CGCGTCCGGC GATTAATCAG 
GGTATCGATA TCAATAATGA AATGGTGCTT AACCATACCG CTATTTATGA AAATTGCCTT
GCGCAGGTCA CGCAAGAGAA TACGGTAGAA AATGCGCTCA TGTTGTTAGA CCCTTACGGC
ACGGCGCCTT TAAGCGCCTA TGCCGGCGTC TGGAGTCTGG AACCGGCTGA GATCATGGTC
ACGGTCCAGG ATGCGGCAAA AACGGCGATG CCGATAGAAC ATCTTTACAC CCTTACGCCA
GGCGCAAATC TGTTACCGGT TCTGGGGCTG GTAGCGGATA CTGAAAACCG TATTGTCTTT
TCTCAGGCAG ATACGCCGCT TGCCGTCTAT ACGCTCATCA CACAGCCATT ACCGCCGGTA
GATTCCGCGG AGGTCGTATT AGGTTTTCCG ATTATCAACG TGACGCAACC TGCTACCGAT
GCGGACAAGA TGGCGCCAGG GTTTTATTTT ATTACGCATT TCGATCGCTA TAATTACGCA
TTAGATCAGA ATGGTCTGGT GCGCTGGTAC GTTACTCAGG ATTACCCGTC TTATAATTTT
GTTCGAATTG ATAATGGCCA TTTCCTCACT ACTTCAGAAG CGAAAAATAC CTATCTGGAT
ATGTATGAGT TCGACATGAT GGGGCGTCTT CACACATTCT ATAATCTCGA TAATCAATTT
CACCATTCTA TCTGGCCGTG GGACAGCAAT ACCATTGTTG CGCCCTCTGA ATATACCTCG
GGTCGGCCCG ACGATTTGAA AACCAATGAA GACGGCGTAT CGGTTGTCGA TCTGACTACC
GGACTGGAGA CGGCTTACTA CGATATGGCG AAGGTGCTGG ATACGACGCG GGTTTCCCGT
CCTTCAGGTA CGGCGCCGGG AGAAGACCCG ACGGTTAAAG ACTGGCTGCA TATAAACCAG
AGCTACGTGA ATGAGACGAA TCAGTTGTTA ATTGCGTCCG GGCGTCATCA GAGCGCGGTG
TTTGGCGTCG ATCTGCAAAC GCAAGCGCTA CGCTTTATTT TGTCAACGCA TGAAGACTGG
GACGACGCTT ATCAGCCTTA TCTTTTAACC CCGGTCGACA GTGAAGGTGT GGCGCTTTAT
GACTTTAGCA AACAGGAGGA TATTGACGCG GCCGACCGTG ACTTTTGGAC GTGGGGCCAG
CATAACGTCG TTGAAATCGC CAATAATACG CCGGGTATAG TGGAGTTTAT GGTATTTGAT
AACGGTAACT ACCGTTCGCG TGATGACAGC AAAAGCCTGT TACCGCCGGA TAACTACAGC
CGCATTGTCC ATTTCGTGGT GAATATGAAT GAGATGACCG TTATGCGGCC ATTTGAATAC
GGCAAGGAGC TGGGCGCGCG TGGCTACAGT AGCTGCGTTA GCGCGAAAGC GATCCAGCAG
AATGGCAATA TTGTGGTGCA TTTTGCCGAC TGCACGTTTG ATGAAAATGG CCGCGCCATC
TCTTGCCAGC CTGGCGAGAG CGATATTATC GATCCGCAGG CGGGCAGTGA GGCTATGGGG
CTGCTAATTT TACAGGAGAT TGCGCCTACG GAGAAAACCG TGCTTTTTGA AGCGACCATG
ACGTCAGGTT ACTACAAAAA CGCGGAAACG AACGGGGAAG GCTATCGCTA CGATATTACC
AGTTTCCGGG TGTATAAAAT GGATCTGTAC GCGTAG
 
Protein sequence
MNTLTTTSVV LPAPRPAINQ GIDINNEMVL NHTAIYENCL AQVTQENTVE NALMLLDPYG 
TAPLSAYAGV WSLEPAEIMV TVQDAAKTAM PIEHLYTLTP GANLLPVLGL VADTENRIVF
SQADTPLAVY TLITQPLPPV DSAEVVLGFP IINVTQPATD ADKMAPGFYF ITHFDRYNYA
LDQNGLVRWY VTQDYPSYNF VRIDNGHFLT TSEAKNTYLD MYEFDMMGRL HTFYNLDNQF
HHSIWPWDSN TIVAPSEYTS GRPDDLKTNE DGVSVVDLTT GLETAYYDMA KVLDTTRVSR
PSGTAPGEDP TVKDWLHINQ SYVNETNQLL IASGRHQSAV FGVDLQTQAL RFILSTHEDW
DDAYQPYLLT PVDSEGVALY DFSKQEDIDA ADRDFWTWGQ HNVVEIANNT PGIVEFMVFD
NGNYRSRDDS KSLLPPDNYS RIVHFVVNMN EMTVMRPFEY GKELGARGYS SCVSAKAIQQ
NGNIVVHFAD CTFDENGRAI SCQPGESDII DPQAGSEAMG LLILQEIAPT EKTVLFEATM
TSGYYKNAET NGEGYRYDIT SFRVYKMDLY A