Gene SeAg_B4211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4211 
Symbol 
ID6795754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4105853 
End bp4107676 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content47% 
IMG OID642778322 
Productarylsulfotransferase 
Protein accessionYP_002148906 
Protein GI197250771 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA AATATGCTTT AACTTCTCTC GCATTATCTG TTGCAATTTT GTCATCAGTA 
CCTTCTACTG CTTTCGCCAT CGGCGGCGCC AGCGGCGCTA AAGTGGACTA TCAGGTCCAG
GGAAAAATTG GCGAAGTTGT TATGAACCCC TATGATATCG CGCCGCTAAC CGCCGTTATT
CGTAATGGCG GTTACCAGTT ACGTGACGTG CATGTACGGA TTGTCCCCAA AGAAAATGGC
CAGGAGATCG CGTATAAAGT TAATAATAAA TACCTTTTAA CGTATGGCGG TATCCCCGTC
TTTGGTCTTT ACCCAGATTA TGTCAATACC GTTGAAGTTG AATATACAAG GATCCAGGGT
AGTAAAACCG AAAATATAAA AGAAAGCTAT AAAATGTATG CACCGCCTGC CTATATTGAA
TCAGCGGGTA CAAAAGAAGA ACAATCAGCA CTCTTTACTA TCGATGTTAA AAAGGTTTCC
CCAGAATTTA AAGATCGCTT GTATCTTTTG AATAATACGA AAGATAAGTC TGGGAATGGA
ACGCGTACTG TCTGGAACAA CCCTACTGGG GGTGCATTAG AATGGAACTT CACTACAGCT
AACGCTATTA TCGACACCTC CGGTGATATT CGTTGGTTTA TGAATCCAAG TTCAATTTAT
GATTTAAAGT CAATTTATCG TGCTGGCGTT ATGATGGGCT TTAAACAAAA CCAGGATGGC
GCACTATCGT GGGGCTACGG TCAGCGTTAT GTGAAATACG ATATCATGGG GCGTGAAATC
TTTAACCGCC GCCTGCCGGA TAATTATAAC GATTTTTCAC ACTCAATGGA TAACGCGGCC
AACGGTCACT ACTTCCTGCG TGTAGCCAGC TCTAACTATA AACGCCCTGA TGGGAAAAAT
GTTCGTACCG TGCGTGATGT GATTGCCGAA GTTGATCAGA ACGGCGTGGT AGTGGATGAA
TGGCGTCTGT TTGATATCCT CGATCCTTAT CGTGATGTGA TAATGAAAAC CCTCGATCAG
GGCGCTGTGT GCCTGAATAT CGACGCCAGC CAGTCCGGCC ATACGTTGAG CGAAGAAGAT
CTGGCGGCGC TGGACTCCTC CGACAAATTC GGGGATATCG TGGGTAGTGG GGCTGGCCGC
AACTGGGCGC ATGTCAACAG TGTCGACTAT GACAGTGAAG ATGATTCCAT CATCATCAGC
TCCCGCCACC AGAGTGCGAT TATCAAAATC GGCCGCGATA AGAAAGTGAA GTGGATACTG
GGTACGCCTG CTGGCTGGAA AGCGCCATTT AATGCCGCAA TTCTGACGCC AGTGGATAGC
AAAGGCCAAA AAATTGCTTG CCAGGACAGT GGCTGCGAGG GTGACTTCGA CTGGACATGG
ACGCAACATA CGGCCTTTAA AATTGATAGT AAGAGTAAAG GCGATATCTT ATACCTTTCC
GCTTTCGACA ATGGCGATGG CCGCGGCTTA GAACAGCCTG CTATGCAGAG TATGAAATAC
AGCCGCTCCG TGATTTACAA AATCGACCAG AAAAACAAGA CCGTCCAACA GATCTGGCAA
TACGGTAAAG AGCGCGGGAA CGAGTGGTTT AGCCCGGTAA CCTCTATCAC CGAGTACCAG
ACTGACAAGA ATTCTGTGTT CGTGTATTCC GCAACAGCAG GTGGTGCGTT TGATTTGTCG
GTAGGCGCAT TTACCAGCTT GCCTAATCCG TATCTGGAAG AGTTCAGATG GGGAGAAAAA
GAACCTGCGG TCGAAATGCA AATACATGGT GCGCGTGGAT ATCAGGCTAT GCCATTTAGC
CTGACCAAAG CGCTTACTGA GTAG
 
Protein sequence
MKFKYALTSL ALSVAILSSV PSTAFAIGGA SGAKVDYQVQ GKIGEVVMNP YDIAPLTAVI 
RNGGYQLRDV HVRIVPKENG QEIAYKVNNK YLLTYGGIPV FGLYPDYVNT VEVEYTRIQG
SKTENIKESY KMYAPPAYIE SAGTKEEQSA LFTIDVKKVS PEFKDRLYLL NNTKDKSGNG
TRTVWNNPTG GALEWNFTTA NAIIDTSGDI RWFMNPSSIY DLKSIYRAGV MMGFKQNQDG
ALSWGYGQRY VKYDIMGREI FNRRLPDNYN DFSHSMDNAA NGHYFLRVAS SNYKRPDGKN
VRTVRDVIAE VDQNGVVVDE WRLFDILDPY RDVIMKTLDQ GAVCLNIDAS QSGHTLSEED
LAALDSSDKF GDIVGSGAGR NWAHVNSVDY DSEDDSIIIS SRHQSAIIKI GRDKKVKWIL
GTPAGWKAPF NAAILTPVDS KGQKIACQDS GCEGDFDWTW TQHTAFKIDS KSKGDILYLS
AFDNGDGRGL EQPAMQSMKY SRSVIYKIDQ KNKTVQQIWQ YGKERGNEWF SPVTSITEYQ
TDKNSVFVYS ATAGGAFDLS VGAFTSLPNP YLEEFRWGEK EPAVEMQIHG ARGYQAMPFS
LTKALTE