Gene SeAg_B4344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4344 
Symbol 
ID6795359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4233273 
End bp4235063 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content48% 
IMG OID642778447 
Productarylsulfotransferase 
Protein accessionYP_002149026 
Protein GI197249056 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATTTA CAAGGAAAGT CTTACCTGTA TTATGCTGCC TGTGTTTGAG CGGCAGCGTT 
TTGGCCTCAG GCGTTTTAGA CCCAAACAGG CCAATGGTCG CATCAGCAGA TGTCATTCCA
GTACATGAAG GGCCATTAGG TATGGTCGAT GTCGCTCCCT ACGGCGGCGT TTTCCCATTA
ACAGCAATCA TTAATAAAGC CAATCATAAT GTACAGGACG TGAAGGTTAC CGTTTTAGGG
AAAGGGGAAA AAGGTATCCC GATCAGTTAT GATGTCGGCC CGCAGGCTAT AAATACCCAT
GACGGCATAC CTGTATTTGG CTTGTATCCA GATTATGTCA ATAAGGTTAA AGTTGACTGG
ACTGAAGAAG GTAAAAAACA AACTTATACG TGGTCCATTT ACGCCGCACC GGTATCATTA
CCCTCTACTA CCGGGCAAAC TGCCGTTCTT CCTACAGTAG AACCGGTTAA AGTCGATAGC
TCGCTTAAAA ATCGCTTATA TCTTTTTAAC CATATAACAG GGATGCCAAG AGCCGGCCAC
ATTATGCATG TCGCAGGCGG CGCGGCGAAC TGGGATTATA CCGGTATCAA CTGGATTAGC
GATACGAATG GCGATGTTCG CGGCTATATG AATATTGATA AATTCCGTAA CCAGGATGAT
ATAACGCGCT TTGGTTCCAT GATGAGCTTC CATCAGGTTA ACGATGGCAA TCTTATTTTT
GGCCAGGGTC AACGTTACTT TAAATATGAT TTCTTAGGCC GCGTTATTTC CGATAAACGA
CTGCCAAAAG GATTTATTGA TTTTTCGCAC GCCATTACCG AAACGCCGAA AGGCACCTAC
CTGCTGCGTG TCGCAAAAGA AAATTATCCA TTAAATGGTA AATACACCAT CAATACGGTG
CGTGATCATA TTCTTGAAGT TGACCAGAAC GGCGATACCG TCGATTACTG GGATCTGCCA
AAAATCCTCG ACCCCTATCG TGACGACGTT ATTCTGGCGA TGGATCAGGG AGCGGTATGT
TTGAGCGTCG ATGCCGAACA TTCCGGTCAG GTCATGACCA AAGAGCAGCT TGCAAAACAA
CCCTTCGGCG ATATCGCGGG TTCCGGCCCG GGCCGCAACT GGGCGCATGT TAACTCCGTC
AGCTACGATC CTCGCGACGA CAGCATTATC ATTAGCTCGC GCCACCAGTC TGCCATCATC
AAAATTGGCC GCGATAAAAA AGTGAAATGG ATACTTTCCG ATCCATCCGG CTGGAAAGGC
GAACTGGCGA AAAAAGTGCT GAAACCCGTA GACAGCAATG GTAAACCGCT AACCTGCGAA
GCGCACCACT GCGACGGTGG ATTTGACTGG ACATGGACAC AACATACCGG TTGGTTAGTG
CCATCCAAAA GCACCGGAGG TAAAATCGTC GTGACCGCCT TTGATAACGG CGATGCGCGC
GGCATGGAAC AACCGGCCAT GCCATCAATG AAATATTCCC GCGGCGTGGA ATATCAAATT
GACGAAAAAA ATATGACGGT TTCCCAAATG TGGGAATATG GTAAAGAGCG CGGTTTTGAC
TGGTACAGCG CCATTACTTC CGTCACGGAA TATCGCCCGG AAACCAAAAC GATGTTCATG
TACTCGGCTA CAGCGGGAAT GAGCGGTACA AAACCGATCG TTTCCGTTCT GGATGAAGTC
AAAGACGGCA CTCAGGATGT GATGCTGGAG CTAAAAGTAC ACAGTAACCG TGCCGGTATG
CTGGGTTATC GGGCGCTGAT TATCGATCCA GAGCAGATGT TTAAAAAATA A
 
Protein sequence
MLFTRKVLPV LCCLCLSGSV LASGVLDPNR PMVASADVIP VHEGPLGMVD VAPYGGVFPL 
TAIINKANHN VQDVKVTVLG KGEKGIPISY DVGPQAINTH DGIPVFGLYP DYVNKVKVDW
TEEGKKQTYT WSIYAAPVSL PSTTGQTAVL PTVEPVKVDS SLKNRLYLFN HITGMPRAGH
IMHVAGGAAN WDYTGINWIS DTNGDVRGYM NIDKFRNQDD ITRFGSMMSF HQVNDGNLIF
GQGQRYFKYD FLGRVISDKR LPKGFIDFSH AITETPKGTY LLRVAKENYP LNGKYTINTV
RDHILEVDQN GDTVDYWDLP KILDPYRDDV ILAMDQGAVC LSVDAEHSGQ VMTKEQLAKQ
PFGDIAGSGP GRNWAHVNSV SYDPRDDSII ISSRHQSAII KIGRDKKVKW ILSDPSGWKG
ELAKKVLKPV DSNGKPLTCE AHHCDGGFDW TWTQHTGWLV PSKSTGGKIV VTAFDNGDAR
GMEQPAMPSM KYSRGVEYQI DEKNMTVSQM WEYGKERGFD WYSAITSVTE YRPETKTMFM
YSATAGMSGT KPIVSVLDEV KDGTQDVMLE LKVHSNRAGM LGYRALIIDP EQMFKK