Gene SeD_A3545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3545 
Symbol 
ID6872325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3401554 
End bp3403350 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content49% 
IMG OID642786535 
Productarylsulfate sulfotransferase 
Protein accessionYP_002217172 
Protein GI198242400 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.949086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGACC AATACCGGAA AACAATACTT GCCGGCGCCG TCGCACTGAC ATGCGGACTC 
ACCGCAGCCA GTACGTTTGC CGCAGGTTTT CAACCGGCAC AGCCCGCAGG GAAATTAGGC
GCAGTCGTTG TCGATCCTTA CGGAAATGCC CCTCTCACCG CGCTGGTGGA ATTAGATAGC
CATATTATTT CAGACGTTAA AGTTACTGTA CATGGCAAAG GGGAAAAAGG CGTTCCTGTT
ACTTATACCG TTGGGAAAGA GTCTTTAGAA ACCTATGACG GTATTCCTAT TTTTGGCCTT
TATCAGAAAT TTGCCAACAA CGTCACGGTA GAATATAAAG AAAACGGCAA AGCCATGAAG
GATGACTATG TGGTGCAGAC GTCCGCCATC GTCAACCATT ATATGGATAA CCGTTCTATT
TCAGATCTTC AGCAAACGAA AGTTATTAAA GTCGCGCCAG GATTTGAAGA TCGCCTTTAT
CTGGTAAATA CCCATACCTT TACGCCGCAG GGCGCTGAAT TTCACTGGCA CGGCGAAAAA
GATAAAAATG CGGGTATTCT TGATGCCGGT CCGGCGGGCG GTGCTTTGCC TTTCGATATC
GCCCCTTATA CGTTTGTGGT TGACACCCAG GGTGAATACC GCTGGTGGCT GGATCAAGAT
ACCTTCTACG ACGGCCACGA TATGAATATC AACAAACGCG GCTACCTGAT GGGTATTCGT
GAAACGCCTC GCGGCACCTT TACCGCGGTG CAGGGCCAAC ACTGGTACGA GTTTGACATG
ATGGGGCAAA TTCTTGCCGA TCACAAACTG CCGCGCGGGT TCCTGGATGC GTCTCATGAA
TCCATCGAAA CCGTGAACGG CACCGTACTG CTGCGCGTCG GCAAACGCGA TTACCGCAAA
GAAGACGGCA TACATGTTCA TACGATTCGT GACCAAATCA TTGAGGTTGA TAAGTCTGGC
CGCGTAGTAG ACGTTTGGGA TTTAACCAAA ATCCTCGACC CTATGCGTGA TGCGCTGCTC
GGCGCGCTGG ATGCGGGCGC AGTATGCGTG AACGTCGATC TGGCCCATGC CGGACAGCAG
GCGAAACTCG AACCGGATAC GCCGTATGGC GATGCGCTTG GCGTTGGTGC CGGTCGTAAC
TGGGCGCACG TCAACTCTAT CGCTTATGAC GCGAAAGACG ACTCCATCAT CCTTTCTTCC
CGCCATCAGG GTATTGTAAA AATTGGTCGC GATAAGCAGG TGAAATGGAT ACTGGCACCG
TCTAAAGGCT GGAATAAGCA GCTAGCCAGT AAATTGCTGA AACCGGTAGA CGATCATGGT
AAGCCGTTGA CCTGTGACGA AAACGGCAAG TGTAAGGACA CCGATTTCGA TTTCACCTAT
ACCCAACATA CGGCATGGCT TTCCAGCAAA GGCACGTTAA CGGTCTTTGA TAACGGCGAT
GGTCGCGGCC TGGAGCAACC GGCTCTACCG ACCATGAAAT ATTCCCGTTT TGTCGAATAT
AAGATCGATG AGAAGAAAGG CACCGTACAA CAAGTTTGGG AATACGGTAA AGAACGTGGA
TATGATTTCT ATAGTCCTAT TACCTCGGTT GTTGAATATC AAAAAGACCG CGACACCATG
TTCGGCTTTG GCGGTTCTAT TAACCTGTTC GACGTTGGTA AACCCACAGT CGGCAAACTG
AATGAGATTG ACTATAAAAC GAAAGAAGTG AAAGTTGAAA TTGATGTGCT GTCGGATAAA
CCCAACCAGA CTCACTATCG TGCATTACTG GTTCATCCAA CGCAAATGTT TAAATAA
 
Protein sequence
MFDQYRKTIL AGAVALTCGL TAASTFAAGF QPAQPAGKLG AVVVDPYGNA PLTALVELDS 
HIISDVKVTV HGKGEKGVPV TYTVGKESLE TYDGIPIFGL YQKFANNVTV EYKENGKAMK
DDYVVQTSAI VNHYMDNRSI SDLQQTKVIK VAPGFEDRLY LVNTHTFTPQ GAEFHWHGEK
DKNAGILDAG PAGGALPFDI APYTFVVDTQ GEYRWWLDQD TFYDGHDMNI NKRGYLMGIR
ETPRGTFTAV QGQHWYEFDM MGQILADHKL PRGFLDASHE SIETVNGTVL LRVGKRDYRK
EDGIHVHTIR DQIIEVDKSG RVVDVWDLTK ILDPMRDALL GALDAGAVCV NVDLAHAGQQ
AKLEPDTPYG DALGVGAGRN WAHVNSIAYD AKDDSIILSS RHQGIVKIGR DKQVKWILAP
SKGWNKQLAS KLLKPVDDHG KPLTCDENGK CKDTDFDFTY TQHTAWLSSK GTLTVFDNGD
GRGLEQPALP TMKYSRFVEY KIDEKKGTVQ QVWEYGKERG YDFYSPITSV VEYQKDRDTM
FGFGGSINLF DVGKPTVGKL NEIDYKTKEV KVEIDVLSDK PNQTHYRALL VHPTQMFK