Gene SeAg_B0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B0043 
Symbol 
ID6793141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp44309 
End bp46024 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content50% 
IMG OID642774355 
Productarylsulfotransferase 
Protein accessionYP_002145019 
Protein GI197248163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACGT TGACTGCAAC GTCTGTTGTC CTTCCTGCGC CGCGTCCGGC GATTAATCAG 
GGTATCGATA TCAATAATGA AATGGTGCTT AACCATACCG CTATTTATGA AAATTGCCTT
GCGCAGGTCA CGCAAGAGAA TACGGTAGAA AATGCGCTGA TGTTGTTAGA CCCTTACGGC
ACGGCGCCTT TAAGCGCTTA TGCCGGGGTC TGGAGTCTGG AATCAGCTGA GATCATAGTC
ACGGTCCAGG ATGCGGCAAA AACGGCGATG CCGGTAGAAC ATCTTTACAC CCTTACGCCA
GGCGCAAATC TGTTGCCGGT TCTGGGGCTG GTAGCGGATA CTGAAAACCG TATTGTCTTT
TCTCAGGCAG ATACGCCGCT TGCCGTCTAT ACGCTCACCA CACAGCCATT ACCGCCGGCA
GATTCCGCGG AAGTCGTATT AGGTTTTCCG ATTATCAACG TGACGCAACC TGCTACCGAT
GCGGACAAGA TGGCGCCAGG GTTTTATTTT ATTACGCATT TCGATCGCTA TAATTACGCA
TTAGATCAGA ATGGTCTGGT GCGCTGGTAC GTTACTCAGG ATTATCCGTC TTATAATTTT
GTTCGAATTG ATAATGGCCA TTTCCTCACT ACTTCAGAAG CGAAAAATAC CTATCTGGAT
ATGTATGAGT TCGACATGAT GGGGCGTCTT CACACATTCT ATAATCTCGA TAATCAATTT
CACCATTCTA TCTGGCCGTG GGATAGCAAT ACCATTGTTG CGCCCTCTGA ATATACCTCG
GGTCGGCCCG ACGATTTGAA AACCAATGAA GACGGCGTAT CGGTTGTCGA TCTGACTACC
GGACTGGAGA CGGCTTACTA CGATATGGCG AAGGTGCTGG ATACGACGCG GGTTTCCCGT
CCTTCAGGTA CGGCGCCGGG AGAAGACCCG ACGGTTAAAG ACTGGCTGCA TATAAACCAG
AGCTACGTGA ATGAGACGAA TCAGTTGTTA ATTGCGTCCG GGCGTCATCA GAGCGCGGTG
TTTGGCGTCG ATCTGCAAAC GCAAGCGCTA CGCTTTATTT TGTCAACGCA TGAAGACTGG
GACGACGCTT ATCAGCCTTA TCTTTTAACC CCGGTCGACA GTGAAGGTGT GGCGCTTTAT
GACTTTAGCA AACAGGAGGA TATCGACGCG GCCGACCGTG ACTTTTGGAC TTGGGGCCAG
CATAACGTCG TTGAAATCGC CAATAATACG CCGGGTATAG TGGAGTTTAT GGTATTTGAT
AACGGTAACT ACCGTTCGCG TGATGACAGC AAAAGCCTGT TACCGCCGGA TAACTACAGC
CGCATTGTCC ATTTCGTGGT GAATATGAAT GAGATGACCG TTATGCGGCC ATTTGAATAC
GGCAAGGAGC TGGGCGCGCG TGGCTACAGT AGCTGCGTTA GCGCGAAAGC GATCCAGCAG
AATGGCAATA TTGTGGTGCA TTTTGCCGAC TGCACGTTTG ATGAAAATGG CCGCGCCATC
TCTTGCCAGC CTGGCGAGAG CGATATTATC GATCCGCAGG CGGGCAGCGA GGCGATGGGG
CTGCTAATTT TACAGGAGAT TGCGCCTACG GAGAAAACCG TGCTTTTTGA AGCGACCATG
ACGTCAGGTT ACTACAAAAA CGCGGAAACG AACGGGGAAG GCTATCGCTA CGATATTACC
AGTTTCCGGG TGTATAAAAT GGATCTGTAC GCGTAG
 
Protein sequence
MNTLTATSVV LPAPRPAINQ GIDINNEMVL NHTAIYENCL AQVTQENTVE NALMLLDPYG 
TAPLSAYAGV WSLESAEIIV TVQDAAKTAM PVEHLYTLTP GANLLPVLGL VADTENRIVF
SQADTPLAVY TLTTQPLPPA DSAEVVLGFP IINVTQPATD ADKMAPGFYF ITHFDRYNYA
LDQNGLVRWY VTQDYPSYNF VRIDNGHFLT TSEAKNTYLD MYEFDMMGRL HTFYNLDNQF
HHSIWPWDSN TIVAPSEYTS GRPDDLKTNE DGVSVVDLTT GLETAYYDMA KVLDTTRVSR
PSGTAPGEDP TVKDWLHINQ SYVNETNQLL IASGRHQSAV FGVDLQTQAL RFILSTHEDW
DDAYQPYLLT PVDSEGVALY DFSKQEDIDA ADRDFWTWGQ HNVVEIANNT PGIVEFMVFD
NGNYRSRDDS KSLLPPDNYS RIVHFVVNMN EMTVMRPFEY GKELGARGYS SCVSAKAIQQ
NGNIVVHFAD CTFDENGRAI SCQPGESDII DPQAGSEAMG LLILQEIAPT EKTVLFEATM
TSGYYKNAET NGEGYRYDIT SFRVYKMDLY A