Gene SeSA_A0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A0043 
Symbol 
ID6516705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp44311 
End bp46026 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content50% 
IMG OID642745218 
Productarylsulfotransferase 
Protein accessionYP_002113050 
Protein GI194735149 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.79648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.228771 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACGT TAACTGCAAC GTCTGTTGTC CTTCCTGCGC CGCGTCCGGC GATTAATCAG 
GGTATCGATA TCAATAATGA AATGGTGCTT AACCATACTG CTATTTATGA AAATTGCCTT
ACGCAGGTCA CGCAAGAGAA TACGGTAGAA AATGCGCTGA TGTTGTTAGA CCCTTACGGC
ACGGCGCCTT TAAGCGCTTA TGCCGGGGTC TGGAGTCTGG AACCGGCTGA GATCATAGTC
ACGGTCCAGG ATGCGGCAAA AACGGCGATG CCGATAGAAC ATCTTTATAC CCTTACGCCA
GGCGCAAATC TGTTGCCGGT TCTGGGGCTG GTAGCGGATA CTGAAAACCG TATTGTCTTT
TCTCAGGCAG ATACGCCGCT TGCCGTCTAT ACGCTCATCA CACAGCCATT ACCGCCGGTA
GATTCCGCGG AGGTCGTATT AGGTTTTCCG ATTATCAACG TGACGCAACC TGCTACCGAT
GCGGACAAGA TGGCGCCAGG GTTTTATTTT ATTACGCATT TCGATCGCTA TAATTACGCA
TTAGATCAGA ATGGTCTGGT GCGCTGGTAC GTTACTCAGG ATTATCCGTC TTATAATTTT
GTTCGAATTG ATAATGGCCA TTTCCTCACC ACTTCAGAAG CGAAAAATAC CTATCTGGAT
ATGTATGAGT TCGACATGAT GGGGCGTCTT CACACATTCT ATAATCTCGA TAATCAATTT
CACCATTCTA TCTGGCCGTG GGATAGCAAT ACCATTGTTG CGCCCTCTGA ATATACCTCG
GGTCGGCCCG ACGATTTGAA AACCAATGAA GACGGCGTAT CGGTTGTCGA TCTGACTACC
GGGCTGGAGA CCGCTTACTA CGATATGGCG AAAGTGCTGG ATACGACGCG GGTTTCCCGT
CCTTCAGGTA CGGCGCCGGG AGAAGACCCG ACGGTTAAAG ACTGGCTGCA CATAAACCAG
AGCTACGTGA ATGAGACGAA TCAGTTGTTA ATTGCGTCCG GGCGTCATCA GAGCGCGGTG
TTTGGCGTCG ATCTACAAAC GCAAGCGCTA CGCTTTATTT TGTCAACGCA TGAAGACTGG
GACGACGCTT ATCAGCCTTA TCTTTTAACC CCGGTCAACA GTGAAGGTGT GGCGCTTTAC
GACTTTAGCA AACAGGAGGA TATCGACGCG GCTGACCGTG ACTTTTGGAC GTGGGGCCAG
CATAACGTCG TTGAAATCGC CAATAATACG CCGGGCATGG TGGAGTTTAT GGTGTTTGAT
AACGGTAACT ACCGTTCGCG TGATGACAGC AAAAGTCTGT TACCGCCGGA TAACTACAGC
CGCATTGTCC ATTTCGTGGT GAATATGAAT GAGATGACCG TTATGCGGCC ATTTGAATAC
GGCAAGGAGC TGGGCGCGCG TGGCTACAGT AGCTGCGTTA GCGCGAAAGC GATCCAGCAG
AATGGCAATA TTGTGGTGCA TTTTGCTGAC TGCACGTTTG ATGAAAATGG CCGCGCCATC
TCTTGCCAGC CTGGCGAGAG CGATATTATC GATCCGCAGG CGGGCAGCGA GGCGATGGGG
CTGCTAATTT TACAGGAGAT TGCGCCTACG GAGAAAACCG TGCTTTTTGA AGCGACCATG
ACGTCAGGTT ACTACAAAAA CGCGGAAACG AACGGGGAAG GCTATCGCTA CGATATTACC
AGTTTCCGGG TGTATAAAAT GGATCTGTAC GCGTAG
 
Protein sequence
MNTLTATSVV LPAPRPAINQ GIDINNEMVL NHTAIYENCL TQVTQENTVE NALMLLDPYG 
TAPLSAYAGV WSLEPAEIIV TVQDAAKTAM PIEHLYTLTP GANLLPVLGL VADTENRIVF
SQADTPLAVY TLITQPLPPV DSAEVVLGFP IINVTQPATD ADKMAPGFYF ITHFDRYNYA
LDQNGLVRWY VTQDYPSYNF VRIDNGHFLT TSEAKNTYLD MYEFDMMGRL HTFYNLDNQF
HHSIWPWDSN TIVAPSEYTS GRPDDLKTNE DGVSVVDLTT GLETAYYDMA KVLDTTRVSR
PSGTAPGEDP TVKDWLHINQ SYVNETNQLL IASGRHQSAV FGVDLQTQAL RFILSTHEDW
DDAYQPYLLT PVNSEGVALY DFSKQEDIDA ADRDFWTWGQ HNVVEIANNT PGMVEFMVFD
NGNYRSRDDS KSLLPPDNYS RIVHFVVNMN EMTVMRPFEY GKELGARGYS SCVSAKAIQQ
NGNIVVHFAD CTFDENGRAI SCQPGESDII DPQAGSEAMG LLILQEIAPT EKTVLFEATM
TSGYYKNAET NGEGYRYDIT SFRVYKMDLY A