Gene SeD_A0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0035 
Symbol 
ID6874555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp35340 
End bp37058 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content46% 
IMG OID642783293 
Productarylsulfotransferase 
Protein accessionYP_002213987 
Protein GI198242535 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.0785305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA AAAGTTCGTC AATGGTTAAC ATGCCCGCAC CGCGTGAGCC TATTAACCAG 
AAAATCGATA CCAATAACGC ACTGGTTTTA AACCATAACG CCATATATGA ACAACGATTA
GCGGAGATCA CGCAATCTAA TACCTGTGAC AAGGCCATTG TCACCGTAAA TCCCTACGGG
ACCGCCCCGT TGAGTCTCTA TCTGGGGGTT TGGATGGATG AAGCTGCCGC GCTTGAGATC
AATGTTGTTG ATAGCGAAGC GACGACAGAG GCAGTGCGTT ATCAATATGA TGTACATCCG
GGCGCTAACC TTATTCCTGT GTGTGGGATG GTATCCGCGG TGAATAATCA GATTACCCTA
CGCCTTGCCT CGCAAATTGT CGGGCAATAT ACAGTAATGA CAGACGCATT ACCGCCCACG
GATTCGGCTA ACGTGAGCCT CGGCTTCCCT ATTATTAGCG TCTCCTGTCC TGCGCAGCAG
GCCTCGCTGA TGGAGGAAGG ACTTTATTTC TCCACTTATT TTGATCGGTA TAATCTGGCT
TTTGATCATA ACGGGATTGT CCGGTGGTAT GTAAGTCAGG AAATCCCTTC TTATAATTTT
GTCAGAATGG ATAACGGCCA TTTCCTGGCG ACGTCACAGG GAATAAACCA TTGTCTGAAT
ATGTATGAAT TTGACATTAT GGGACGGGTT TATACGGTTT ATCTTCTCGA CAATGAGTTC
CATCACTCCA TTCTTCCCAT TGAGAACAAT CTGGCGATTG CGCCTTCAGA ATATAGCAAT
GGACGGCCAG ACGGTTACTC AACCGGGAAA GATGGCGTTT CTATTATTAA CTTATCTACC
GGACTTGAAG TCGCCTATTA CGATATGCTG TATGTGATGG ATTATTCCAG ATCGCCGCGT
CCTTCCGGAA GCGCGCCAGG TCAGGACGTA TCAATGGATG ACTGGCTGCA TATCAACCAA
AGCTATATTA ATGAACCCAA CAATTTGCTG ATCTGTTCCG GTCGACATCA GAGCGCGATT
TTTGGCGTAA ATGTGGATTC CGGCGAACTG CGCTTTATTA TGGCGAACCA TGAGGATTGG
TCTGACGAAT TCAAGCAATA CTTATTAACC CCTGTCGATG ATGATGGTGT CCCGCTGTAC
GATCTTACCT CGCCGGGAGG GATTGATGCG GCAGATAAGA ATTTCTGGAC CTGGGGGCAG
CATAACATTG TTGAAATTCC AAACGATGAG CCTGGTATCC TGGAGTTTAT GGTCTTTGAT
AATGGTAACT ATCGTTCACG CGAAGATGCG AAAAGTCTGT TGCCGCTCGA TAACTTCAGC
CGGGTGGTGC AGTTTAAAAT AAACCTAAAC ACGATGACCG TAACGCGTCC GTATGAATAT
GGTAAAACGG AAGTCGGGAA CCGGGGCTAT AGCAGTTTTG TGAGCGCTAA GCATTTATTG
ACTAATGGTC ACCTGGTTAT TCACTTCGGC GCGACGACGG TTGATGAGTT TGAACATACC
ATTACCGCGC AACCAGGTTC CAGCGATCTT GTCGATCCGG ATGAAGGGCA ACAGGCGTTA
GGTCGACTGG TATTACAAGA AATCAATAAA GAGACGAAAG AGGTCTTATT CGAAGCGATG
GTGACGTCGG GCTATTTCAA GAACGAAGAG ACGAATGGCA CGAATTATCG TTATGATATT
TCTGCATTTC GGGTATACAA AATGCCGCTG TTTGCATAA
 
Protein sequence
MNKKSSSMVN MPAPREPINQ KIDTNNALVL NHNAIYEQRL AEITQSNTCD KAIVTVNPYG 
TAPLSLYLGV WMDEAAALEI NVVDSEATTE AVRYQYDVHP GANLIPVCGM VSAVNNQITL
RLASQIVGQY TVMTDALPPT DSANVSLGFP IISVSCPAQQ ASLMEEGLYF STYFDRYNLA
FDHNGIVRWY VSQEIPSYNF VRMDNGHFLA TSQGINHCLN MYEFDIMGRV YTVYLLDNEF
HHSILPIENN LAIAPSEYSN GRPDGYSTGK DGVSIINLST GLEVAYYDML YVMDYSRSPR
PSGSAPGQDV SMDDWLHINQ SYINEPNNLL ICSGRHQSAI FGVNVDSGEL RFIMANHEDW
SDEFKQYLLT PVDDDGVPLY DLTSPGGIDA ADKNFWTWGQ HNIVEIPNDE PGILEFMVFD
NGNYRSREDA KSLLPLDNFS RVVQFKINLN TMTVTRPYEY GKTEVGNRGY SSFVSAKHLL
TNGHLVIHFG ATTVDEFEHT ITAQPGSSDL VDPDEGQQAL GRLVLQEINK ETKEVLFEAM
VTSGYFKNEE TNGTNYRYDI SAFRVYKMPL FA