Gene SeAg_B0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B0037 
Symbol 
ID6796044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp35338 
End bp37056 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content46% 
IMG OID642774349 
Productarylsulfotransferase 
Protein accessionYP_002145013 
Protein GI197251278 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA AAAGTTCGTC AATGGTTAAC ATGCCCGCAC CGCGTGAGCC GATTAACCAG 
AAAATCGATA CCAATAACGC ATTGGTGTTA AATCATAACG CCATATATGA ACAACGATTA
GCGGAGATCA CGCAATCTAA TACCTGTGAC AAGGCCATTG TCACCGTAAA TCCCTACGGG
ACCGCCCCGT TGAGTCTCTA TCTGGGGGTT TGGATGGATG AAGCTGCCGC GCTTGAGATC
AATGTTGTTG ATAGCGAAGC GACGACAGAG GAAGTGCGTT ATCAATATGA TGTACATCCG
GGCGCTAACC TTATTCCTGT GTGTGGGATG GTATCCGCGG TGAATAATCA GATTACCCTA
CGCCTTGCCT CGCAAATTGT CGGGCAATAT ACAGTAATGA CAGACGCATT ACCGCCCACG
GATTCGGCTA ACGTGAGCCT CGGTTTCCCT ATTATTAGCG TCTCCTGTCC TGCGCAGCAG
GCCTCGCTGA TGGAGGAAGG ACTTTATTTC TCCACTTATT TTGATCGGTA TAATCTGGCT
TTTGATCATA ACGGGATTGT CCGGTGGTAT GTAAGTCAGG AAATCCCTTC TTATAATTTT
GTCAGAATGG ATAACGGCCA TTTCCTGGCG ACGTCACAGG GAATAAACCA TTGCCTGAAT
ATGTATGAAT TTGACATTAT GGGACGGGTT TATACTGTTT ATCTTCTCGA CAATGAGTTC
CATCACTCCA TTCTTCCCAT TGAGAACAAT CTGGCGATTG CGCCTTCAGA ATATAGCAAT
GGACGGCCAG ACGGTTACTC AACCGGGAAA GATGGCGTTT CTATTATTAA CTTATCTACC
GGACTTGAAG TCGCCTATTA CGATATGCTG TATGTGATGG ATTATTCCAG ATCGCCGCGT
CCTTCCGGAA GCGCGCCAGG TCAGGATGTA TCAATGGATG ACTGGCTGCA TATCAACCAA
AGCTATATTA ATGAACCCAA CAATTTGCTG ATCTGTTCCG GTCGACATCA GAGCGCGATT
TTTGGCGTAA ATGTGGATTC CGGCGAACTG CGCTTTATTA TGGCGAACCA TGAGGATTGG
TCTGACGAAT TCAAGCAATA CTTATTAACC CCTGTCGATG ATGATGGTGT CCCGCTGTAC
GATCTTACCT CGCCGGGAGG GATTGATGCG GCAGATAAGA ATTTCTGGAC CTGGGGGCAG
CATAACATTG TTGAAATTCC AAACGATGAG CCTGGTATCC TGGAGTTTAT GGTCTTTGAT
AATGGTAACT ATCGTTCACG CGAAGATGCG AAAAGTCTGT TGCCGCTCGA TAACTTCAGC
CGGGTGGTGC AGTTTAAAAT AAACCTAAAC ACGATGACCG TAACGCGTCC GTATGAATAT
GGTAAAACGG AAGTCGGGAA CCGGGGCTAT AGCAGTTTTG TGAGCGCTAA GCATTTATTG
ACTAATGGTC ACCTGGTTAT TCACTTCGGC GCGACGACGG TTGATGAGTT TGAACATACC
ATTACCGCGC AACCAGGTTC CAGCGATCTT GTCGATCCGG ATGAAGGGCA ACAGGCGTTA
GGCCGACTGG TATTACAAGA AATCAATAAA GAGACGAAAG AGGTTTTATT CGAAGCGATG
GTGACGTCGG GCTATTTCAA GAACGAAGAG ACGAATGGCA CGAATTATCG TTATGATATT
TCTGCATTTC GGGTATACAA AATGCCGCTG TTTGCATAA
 
Protein sequence
MNKKSSSMVN MPAPREPINQ KIDTNNALVL NHNAIYEQRL AEITQSNTCD KAIVTVNPYG 
TAPLSLYLGV WMDEAAALEI NVVDSEATTE EVRYQYDVHP GANLIPVCGM VSAVNNQITL
RLASQIVGQY TVMTDALPPT DSANVSLGFP IISVSCPAQQ ASLMEEGLYF STYFDRYNLA
FDHNGIVRWY VSQEIPSYNF VRMDNGHFLA TSQGINHCLN MYEFDIMGRV YTVYLLDNEF
HHSILPIENN LAIAPSEYSN GRPDGYSTGK DGVSIINLST GLEVAYYDML YVMDYSRSPR
PSGSAPGQDV SMDDWLHINQ SYINEPNNLL ICSGRHQSAI FGVNVDSGEL RFIMANHEDW
SDEFKQYLLT PVDDDGVPLY DLTSPGGIDA ADKNFWTWGQ HNIVEIPNDE PGILEFMVFD
NGNYRSREDA KSLLPLDNFS RVVQFKINLN TMTVTRPYEY GKTEVGNRGY SSFVSAKHLL
TNGHLVIHFG ATTVDEFEHT ITAQPGSSDL VDPDEGQQAL GRLVLQEINK ETKEVLFEAM
VTSGYFKNEE TNGTNYRYDI SAFRVYKMPL FA