Gene SeSA_A4188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4188 
Symbol 
ID6516589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4064499 
End bp4066322 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content47% 
IMG OID642749154 
Productarylsulfotransferase 
Protein accessionYP_002116906 
Protein GI194735990 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTA AATATGCTTT AACTTCTCTC GCATTATCTG TTGCAATTTT GTCATCAGTA 
CCTTCTACTG CTTTCGCCAT CGGCGGCGCC AGCGGCGCTA AAGTGGACTA TCAGGTCCAG
GGGAAAATTG GCGAAGTTGT TATGAACCCC TATGATATCG CGCCGCTAAC CGCCGTTATT
CGTAATGGCG GTTACCAGTT ACGTGACGTG CATGTACGGA TTGTACCCAA AGAAAATGGC
CAGGAGATCG CGTATAAAGT TAATAATAAA TACCTTTTAA CGTATGGCGG TATTCCCGTC
TTTGGTCTTT ACCCAGATTA TGTCAATACC GTTGAAGTTG AATATACAAG AATCCAGGGT
AGTAAAACCG AAAATATAAA AGAAAGCTAT AAAATGTATG CACCGCCTGC CTATATTGAA
TCAGCGGGTA CAAAGGAAGA ACAATCAGCA CTCTTTACTA TCGATGTTAA AAAGGTTTCC
CCAGAATTTA AAGATCGCTT GTATCTTTTG AATAATACGA AAGATAAGTC TGGGAATGGA
ACGCGTACTG TCTGGAACAA CCCTACTGGG GGTGCATTAG AATGGAACTT CACTACAGCT
AACGCTATTA TCGACACCTC CGGTGATATT CGTTGGTTTA TGAATCCAAG TTCAATTTAT
GATTTAAAGT CAATTTATCG TGCTGGCGTT ATGATGGGCT TTAAACAAAA CCAGGATGGC
GCACTATCGT GGGGCTACGG TCAGCGTTAT GTGAAATACG ATATCATGGG GCGTGAAATC
TTTAACCGCC GCCTGCCGGA TAATTATAAC GATTTTTCAC ACTCAATGGA TAACGCGGCC
AACGGTCACT ACTTCCTGCG TGTAGCCAGC TCTAACTATA AACGTCCTGA TGGAAAAAAT
GTTCGTACCG TGCGTGATGT GATTGCCGAA GTTGATCAGA ACGGCGTGGT AGTGGATGAA
TGGCGTCTGT TTGATATCCT CGATCCTTAT CGTGATGTGA TAATGAAAAC CCTCGATCAG
GGCGCTGTGT GCCTGAATAT CGACGCCAGC CAGTCCGGCC ATACGTTGAG CGAAGAAGAT
CTGGCGGCGC TGGACTCCTC CGACAAATTC GGGGATATCG TGGGTAGTGG GGCTGGCCGC
AACTGGGCGC ATGTCAACAG CGTCGACTAT GACAGTGAAG ATGATTCCAT CATCATCAGC
TCCCGCCACC AGAGTGCGAT TATCAAAATC GGCCGCGATA AGAAAGTGAA GTGGATACTG
GGTACGCCTG CTGGCTGGAA AGCGCCATTT AATGCCGCAA TTCTGACGCC AGTGGATAGC
AAAGGCCAAA AAATCGCCTG CCAGGACAGT GGCTGCGAGG GTGACTTCGA CTGGACATGG
ACGCAACATA CGGCCTTTAA AATTGATAGT AAGAGTAAAG GCGATATCTT ATACCTTTCC
GCTTTCGACA ATGGCGATGG CCGCGGCTTA GAACAGCCTG CTATGCAGAG TATGAAATAC
AGCCGCTCCG TGATTTACAA AATCGACCAG AAAAACAAGA CCGTCCAACA GATCTGGCAA
TACGGTAAAG AGCGCGGGAA CGAGTGGTTT AGCCCGGTAA CCTCTATCAC CGAGTACCAG
ACTGACAAGA ATTCTGTGTT CGTGTATTCC GCAACAGCAG GTGGTGCGTT TGATTTGTCG
GTAGGCGCAT TTACCAGCTT GCCTAATCCG TATCTGGAAG AGTTCAAATG GGGAGAAAAA
GAGCCTGTGG TCGAAATGCA AATACATGGT GCGCGTGGAT ATCAGGCTAT GCCATTTAGC
CTGACCAAAG CGCTTACTGA GTAG
 
Protein sequence
MKFKYALTSL ALSVAILSSV PSTAFAIGGA SGAKVDYQVQ GKIGEVVMNP YDIAPLTAVI 
RNGGYQLRDV HVRIVPKENG QEIAYKVNNK YLLTYGGIPV FGLYPDYVNT VEVEYTRIQG
SKTENIKESY KMYAPPAYIE SAGTKEEQSA LFTIDVKKVS PEFKDRLYLL NNTKDKSGNG
TRTVWNNPTG GALEWNFTTA NAIIDTSGDI RWFMNPSSIY DLKSIYRAGV MMGFKQNQDG
ALSWGYGQRY VKYDIMGREI FNRRLPDNYN DFSHSMDNAA NGHYFLRVAS SNYKRPDGKN
VRTVRDVIAE VDQNGVVVDE WRLFDILDPY RDVIMKTLDQ GAVCLNIDAS QSGHTLSEED
LAALDSSDKF GDIVGSGAGR NWAHVNSVDY DSEDDSIIIS SRHQSAIIKI GRDKKVKWIL
GTPAGWKAPF NAAILTPVDS KGQKIACQDS GCEGDFDWTW TQHTAFKIDS KSKGDILYLS
AFDNGDGRGL EQPAMQSMKY SRSVIYKIDQ KNKTVQQIWQ YGKERGNEWF SPVTSITEYQ
TDKNSVFVYS ATAGGAFDLS VGAFTSLPNP YLEEFKWGEK EPVVEMQIHG ARGYQAMPFS
LTKALTE