Gene SNSL254_A4260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4260 
Symbol 
ID6483629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4150856 
End bp4152679 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content47% 
IMG OID642739510 
Productarylsulfotransferase 
Protein accessionYP_002043209 
Protein GI194444263 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.790155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTA AATATGCTTT AACTTCTCTC GCATTATCTG TTGCAATTTT GTCATCAGTA 
CCTTCTACTG CTTTCGCCAT CGGCGGCGCC AGCGGCGCTA AAGTGGACTA TCAGGTCCAG
GGAAAAATTG GCGAAGTTGT TATGAACCCC TATGATATCG CGCCGCTAAC CGCCGTTATT
CGTAATGGCG GTTACCAGTT ACGTGACGTG CATGTACGGA TTGTACCCAA AGAAAATGGC
CAGGAGATCG CGTATAAAGT TAATAATAAA TACCTTTTAA CGTATGGCGG TATTCCCGTC
TTTGGTCTTT ACCCGGATTA TGTCAATACC GTTGAAGTTG AATATACAAG GATCCAGGGT
AGTAAAACCG AAAATGTAAA AGAAAGCTAT AAAATGTATG CGCCGCCTGC CTATATTGAA
TCAGCGGGTA CAAAAGAAGA ACAATCAGCA CTCTTTACTA TCGATGTTAA AAAGGTTTCC
CCAGAATTTA AAGATCGCTT GTATCTTTTG AATAATACGA AAGATAAGTC TGGGAATGGA
ACGCGTACTG TCTGGAACAA CCCTACTGGG GGGGCATTAG AATGGAACTT CACTACAGCT
AACGCTATTA TCGACACCTC CGGTGATATT CGTTGGTTTA TGAATCCAAG TTCAATTTAT
GATTTAAAGT CAATTTATCG TGCTGGCGTT ATGATGGGCT TTAAACAAAA CCAGGATGGC
GCACTATCGT GGGGCTACGG TCAGCGTTAT GTGAAATACG ATATCATGGG GCGTGAAATC
TTTAACCGCC GCCTGCCGGA TAATTATAAC GATTTTTCAC ACTCAATGGA TAACGCGGCC
AACGGTCACT ACTTCCTGCG TGTAGCCAGC TCTAACTATA AACGTCCTGA TGGGAAAAAT
GTTCGTACCG TGCGTGATGT GATTGCCGAA GTTGATCAGA ACGGCGTGGT AGTGGATGAA
TGGCGTCTGT TTGATATCCT CGATCCTTAT CGTGATGTGA TAATGAAAAC CCTCGATCAG
GGCGCAGTGT GCCTGAATAT CGACGCCAGC CAGTCCGGCC ATACGTTGAG CGAAGAAGAT
CTGGCGGCGC TGGACTCCTC CGACAAATTC GGGGATATCG TGGGTAGTGG GGCTGGCCGC
AACTGGGCGC ATGTCAACAG CGTCGACTAT GACAGTGAAG ATGATTCCAT CATCATCAGC
TCCCGCCACC AGAGTGCGAT TATCAAAATC GGCCGCGATA AGAAAGTGAA GTGGATACTG
GGTACGCCTG CTGGCTGGAA AGCGCCATTT AATGCCGCAA TTCTGACGCC AGTGGATAGC
AAAGGCCAAA AAATCGCCTG CCAGGACAGT GGCTGCGAGG GTGACTTCGA CTGGACATGG
ACGCAACATA CGGCCTTTAA AATTGATAGT AAGAGTAAAG GCGATATCTT ATACCTTTCC
GCTTTCGACA ATGGCGATGG CCGCGGCTTA GAACAGCCTG CTATGCAGAG TATGAAATAC
AGCCGCTCCG TGATTTACAA AATCGACCAG AAAAACAAGA CAGTCCAACA GATCTGGCAA
TACGGTAAAG AGCGCGGGAA CGAGTGGTTT AGCCCGGTAA CCTCTATCAC CGAGTACCAG
ACTGACAAGA ATTCTGTGTT CGTGTATTCC GCAACAGCAG GTGGTGCGTT TGATTTGTCG
GTAGGCGCAT TTACCAGCTT GCCTAATCCG TATCTGGAAG AGTTCAAATG GGGAGAAAAA
GAGCCTGCGG TTGAAATGCA AATACATGGT GCGCGTGGAT ATCAGGCTAT GCCATTTAGC
CTGACCAAAG CGCTTACTGA GTAG
 
Protein sequence
MKFKYALTSL ALSVAILSSV PSTAFAIGGA SGAKVDYQVQ GKIGEVVMNP YDIAPLTAVI 
RNGGYQLRDV HVRIVPKENG QEIAYKVNNK YLLTYGGIPV FGLYPDYVNT VEVEYTRIQG
SKTENVKESY KMYAPPAYIE SAGTKEEQSA LFTIDVKKVS PEFKDRLYLL NNTKDKSGNG
TRTVWNNPTG GALEWNFTTA NAIIDTSGDI RWFMNPSSIY DLKSIYRAGV MMGFKQNQDG
ALSWGYGQRY VKYDIMGREI FNRRLPDNYN DFSHSMDNAA NGHYFLRVAS SNYKRPDGKN
VRTVRDVIAE VDQNGVVVDE WRLFDILDPY RDVIMKTLDQ GAVCLNIDAS QSGHTLSEED
LAALDSSDKF GDIVGSGAGR NWAHVNSVDY DSEDDSIIIS SRHQSAIIKI GRDKKVKWIL
GTPAGWKAPF NAAILTPVDS KGQKIACQDS GCEGDFDWTW TQHTAFKIDS KSKGDILYLS
AFDNGDGRGL EQPAMQSMKY SRSVIYKIDQ KNKTVQQIWQ YGKERGNEWF SPVTSITEYQ
TDKNSVFVYS ATAGGAFDLS VGAFTSLPNP YLEEFKWGEK EPAVEMQIHG ARGYQAMPFS
LTKALTE