Gene SeSA_A0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A0036 
Symbol 
ID6516767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp35340 
End bp37058 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content46% 
IMG OID642745211 
Productarylsulfotransferase 
Protein accessionYP_002113043 
Protein GI194736972 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.453558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.16743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA AAAGTTCGTC AATGGTTAAC TTGCCCGCAC CGCGTGAGCC GATTAACCAG 
AAAATCGATA CCAATAACGC ATTGGTGTTA AACCATAACG CCATATATGA ACAACGATTA
GCGGAGATCA CGCAATCTAA TACCTGTGAC AAGGCCATTG TCACCGTAAA TCCCTACGGG
ACCGCCCCGT TGAGTCTCTA TCTGGGGATT TGGATTGATG AAGCTGCCGC GCTTGAGATC
AATGTTGTTG ATAGCGAAGC GACGACAGAG GCAGTGCGTT ATCAATATGA TGTACATCCG
GGCGCTAACC TTATTCCTGT GTGTGGGATG GTATCCGCGG TGAATAATCA GATTACCCTA
CGCCTTGCCT CGCAAATTGT CGGGCAATAT ACAGTAATGA CAGACGCATT ACCGCCCACG
GATTCGGCTA ACGTGAGCCT CGGTTTCCCT ATTATTAGCG TCTCCTGTCC TGCGCAGCAG
GCCTCGCTGA TGGAGGAAGG GCTTTATTTC TCCACTTATT TTGATCGGTA TAATCTGGCT
TTTGATCATA ACGGGATTGT CCGGTGGTAT GTAAGTCAGG AAATCCCTTC TTATAATTTT
GTCAGAATGG ATAATGGCCA TTTCCTGGCG ACGTCACAGG GAATAAACCA TTGTCTGAAT
ATGTATGAAT TTGACATTAT GGGACGGGTT TATACGGTTT ATCTTCTCGA CAATGAGTTC
CATCACTCCA TTCTTCCCAT TGAGAACAAT CTGGCGATTG CGCCTTCAGA ATATAGCAAT
GGACGGCCAG ATGGTTACTC AACCGGGAAA GATGGCGTTT CTATTATTAA CTTATCTACC
GGACTTGAAG TCGCCTATTA CGATATGCTG TATGTGATGG ATTATTCCAG ATCGCCGCGT
CCTTCCGGAA GCGCGCCAGG TCAGGACGTA TCAATGGATG ACTGGCTGCA TATCAACCAA
AGCTATATTA ATGAACCCAA CAATTTGCTG ATCTGTTCCG GTCGACATCA GAGCGCGATT
TTTGGCGTAA ATGTGGATTC CGGCGAACTG CGCTTTATTA TGGCGAACCA TGAGGATTGG
TCTGACGAAT TCAGGCAATA CTTATTAACC CCTGTCGATG ATGATGGCGT CCCGCTGTAT
GATCTTACCT CGCCGGGAGG GATTGATGCG GCAGATAAGA ATTTCTGGAC CTGGGGGCAG
CATAACATTG TTGAAATTCC AAATGATGAG CCTGGTATCC TGGAGTTTAT GGTCTTTGAT
AATGGTAACT ATCGTTCACG CGAAGATGCG AAAAGTCTGT TGCCGCTCGA TAACTTCAGC
CGGGTGGTGC AGTTTAAAAT AAACCTAAAC ACGATGACCG TAACGCGTCC GTATGAATAT
GGTAAAACGG AAGTCGGGAA CCGGGGCTAT AGCAGTTTTG TGAGCGCTAA GCATTTATTG
ACTAATGGTC ACCTGGTTAT TCACTTCGGC GCGACGACGG TTGATGAGTT TGAACATACC
ATTACCGCGC AACCAGGTTC CAGCGATCTT GTCGATCCGG ATGAAGGGCA ACAGGCGTTA
GGCCGACTGG TATTACAAGA AATCAATAAA GAGACGAAAG AGGTTTTATT CGAAGCGATG
GTGACGTCGG GCTATTTCAA GAACGAAGAG ACGAATGGCA CGAATTATCG TTATGATATT
TCTGCATTTC GGGTATACAA AATGCCGCTG TTTGCATAA
 
Protein sequence
MNKKSSSMVN LPAPREPINQ KIDTNNALVL NHNAIYEQRL AEITQSNTCD KAIVTVNPYG 
TAPLSLYLGI WIDEAAALEI NVVDSEATTE AVRYQYDVHP GANLIPVCGM VSAVNNQITL
RLASQIVGQY TVMTDALPPT DSANVSLGFP IISVSCPAQQ ASLMEEGLYF STYFDRYNLA
FDHNGIVRWY VSQEIPSYNF VRMDNGHFLA TSQGINHCLN MYEFDIMGRV YTVYLLDNEF
HHSILPIENN LAIAPSEYSN GRPDGYSTGK DGVSIINLST GLEVAYYDML YVMDYSRSPR
PSGSAPGQDV SMDDWLHINQ SYINEPNNLL ICSGRHQSAI FGVNVDSGEL RFIMANHEDW
SDEFRQYLLT PVDDDGVPLY DLTSPGGIDA ADKNFWTWGQ HNIVEIPNDE PGILEFMVFD
NGNYRSREDA KSLLPLDNFS RVVQFKINLN TMTVTRPYEY GKTEVGNRGY SSFVSAKHLL
TNGHLVIHFG ATTVDEFEHT ITAQPGSSDL VDPDEGQQAL GRLVLQEINK ETKEVLFEAM
VTSGYFKNEE TNGTNYRYDI SAFRVYKMPL FA