Gene SeSA_A3378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3378 
Symbol 
ID6516789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3260738 
End bp3262534 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content49% 
IMG OID642748374 
Productarylsulfate sulfotransferase 
Protein accessionYP_002116147 
Protein GI194735024 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.953621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGACC AATACCGGAA AACAATACTT GCCGGTGCCG TCGCACTGAC ATGCGGACTC 
ACCGCAGCCA GTACATTTGC CGCAGGTTTT CAACCGGCGC AGCCCGCAGG AAAATTAGGC
GCAGTCGTTG TCGATCCTTA CGGTAATGCC CCTCTCACCG CACTGGTGGA ATTAGATAGC
CATGTTATTT CAGACGTTAA AGTTACTGTA CATGGCAAAG GGGAAAAAGG CGTTCCCGTT
ACTTATACTG TTGGGAAAGA GTCTTTAGAA ACCTATGACG GTATTCCTAT TTTTGGCCTT
TATCAGAAAT TTGCCAACAA CGTCACGGTA GAATATAAAG AAAACGGCAA AGCCATGAAG
GATGACTATG TGGTGCAGAC GTCCGCCATC GTCAACCATT ATATGGATAA CCGTTCTATT
TCAGATCTCC AGCAAACGAA AGTTATTAAA GTTGCGCCAG GATTTGAAGA TCGCCTTTAT
CTGGTAAATA CCCATACCTT TACGCCGCAG GGCGCTGAAT TTCACTGGCA CGGCGAAAAA
GATAAAAATG CGGGCATTCT TGATGCCGGT CCGGCGGGCG GGGCTTTGCC TTTCGATATC
GCCCCTTATA CGTTTGTGGT CGACACCCAG GGTGAATACC GCTGGTGGCT GGATCAAGAT
ACCTTCTACG ACGGCCACGA TATGAATATC AACAAACGCG GCTATCTGAT GGGTATTCGT
GAAACGCCTC GCGGCACCTT TACCGCGGTG CAGGGCCAAC ACTGGTACGA GTTTGACATG
ATGGGGCAAA TTCTTGCCGA TCATAAACTG CCGCGCGGGT TCCTGGATGC GTCTCATGAA
TCCATCGAAA CCGTGAACGG CACCGTACTG CTGCGCGTCG GCAAACGCGA TTACCGCAAA
GAAGACGGCA TACATGTTCA TACTATTCGT GACCAAATCA TTGAGGTTGA TAAGTCTGGC
CGCGTAGTAG ACGTTTGGGA TTTAACCAAA ATCCTCGACC CTATGCGTGA TGCGCTGCTC
GGCGCGCTGG ATGCGGGCGC AGTATGCGTG AACGTCGATC TGGCCCATGC CGGACAGCAG
GCGAAACTTG AACCGGATAC GCCGTATGGC GATGCGCTTG GCGTTGGTGC CGGTCGTAAC
TGGGCGCACG TCAACTCTAT CGCTTATGAC GCGAAAGACG ACTCCATCAT CCTTTCTTCC
CGCCATCAGG GTATTGTAAA AATTGGTCGC GATAAGCAGG TGAAATGGAT ACTGGCACCG
TCTAAAGGCT GGAATAAGCA GCTAGCCAGT AAATTGCTGA AACCGGTAGA CGATCATGGT
AAGCCGTTGA CCTGTGACGA AAATGGCAAG TGTAAGGACA CCGATTTCGA TTTCACCTAT
ACCCAACATA CGGCATGGCT TTCCAGCAAA GGCACGTTAA CGGTCTTTGA TAACGGCGAT
GGTCGCGGCC TGGAGCAACC GGCTCTACCG ACCATGAAAT ATTCCCGTTT TGTCGAATAT
AAGATCGATG AGAAGAAAGG CACCGTACAA CAAGTTTGGG AATACGGTAA AGAACGTGGG
TATGATTTCT ATAGTCCTAT TACCTCGGTT GTTGAATATC AAAAAGACCG CGACACCATG
TTCGGCTTTG GCGGTTCTAT TAACCTGTTC GACGTTGGTA AACCCACAGT CGGCAAACTG
AATGAGATTG ACTATAAAAC GAAAGAAGTG AAAGTTGAAA TTGATGTGCT GTCGGATAAA
CCCAACCAGA CTCACTATCG TGCGTTACTG GTTCATCCAA CGCAAATGTT TAAATAA
 
Protein sequence
MFDQYRKTIL AGAVALTCGL TAASTFAAGF QPAQPAGKLG AVVVDPYGNA PLTALVELDS 
HVISDVKVTV HGKGEKGVPV TYTVGKESLE TYDGIPIFGL YQKFANNVTV EYKENGKAMK
DDYVVQTSAI VNHYMDNRSI SDLQQTKVIK VAPGFEDRLY LVNTHTFTPQ GAEFHWHGEK
DKNAGILDAG PAGGALPFDI APYTFVVDTQ GEYRWWLDQD TFYDGHDMNI NKRGYLMGIR
ETPRGTFTAV QGQHWYEFDM MGQILADHKL PRGFLDASHE SIETVNGTVL LRVGKRDYRK
EDGIHVHTIR DQIIEVDKSG RVVDVWDLTK ILDPMRDALL GALDAGAVCV NVDLAHAGQQ
AKLEPDTPYG DALGVGAGRN WAHVNSIAYD AKDDSIILSS RHQGIVKIGR DKQVKWILAP
SKGWNKQLAS KLLKPVDDHG KPLTCDENGK CKDTDFDFTY TQHTAWLSSK GTLTVFDNGD
GRGLEQPALP TMKYSRFVEY KIDEKKGTVQ QVWEYGKERG YDFYSPITSV VEYQKDRDTM
FGFGGSINLF DVGKPTVGKL NEIDYKTKEV KVEIDVLSDK PNQTHYRALL VHPTQMFK