Gene SeAg_B1026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B1026 
Symbol 
ID6793995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp1028045 
End bp1029805 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content54% 
IMG OID642775296 
Productlon protease (S16) proteolytic domain protein 
Protein accessionYP_002145938 
Protein GI197250157 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000023789 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACCATTA CGAAACTTGC ATGGCGTGAT CTGGTTCCGG ATAGCGAAAG CTATCAGGAG 
ATATTTGCAC AGCCACACGC GACTGACGAA AACGACACCT TACTCAGTGA TACTCAGCCA
CGACTGCAAT TTGCGCTTGA GCAACTTATA CAGCCGTGGG CATCATCCTC TTTTATGCTG
ACTAAAGCGC CTGAAGAGCA AGAGTATCTC ACTTTACTTT CAGATGCCGT CCGCGCTCTG
CAAACCGATG CCGGACAATT AACCGGCGGA CATTATGACG TTTCCGGGCA TACTGTTCAT
TACCGCGCCG CGCAGAATGC GCAAGACAAC TTTGCCACCG TCACACAAGT CGTCAGCGCG
GACTGGGTCG AAGCCGAACA GCTCTTTGGT TGCCTGCGGC AGTATAACGG CGACATTATC
CTGCAGCCGG GACTGGTTCA TCAGGCGAAC GGCGGCGTGC TGATTATTTC CTTACGAACC
CTTCTGGCGC AGCCGTTACT GTGGATGCGT CTGAAAGCCA TCGTTAGCCG CGAGCGTTTT
GACTGGGTGG CCTTTGACGA GTCGCGTCCA TTACCGGTCT CCGTGCCATC CATGCCGCTC
AAACTGAAGG TGATTCTGGT TGGCGAACGT GAATCACTGG CTGATTTTCA GGAGATGGAA
CCGGAGCTCG CGGAACAGGC TATCTACAGT GAATTTGAAG ACAATTTACA GATAGCGGAC
GCAGAAGCTA TGACCCTGTG GTGTCAATGG GTGACGCGTA TCGCTTTACG CGATAATTTG
CCGCCCCCGG CACCGGACGC CTGGCCCGTC CTGATACGCG AGGCTGTGCG CTATACCGGC
GAACAGGATA CGCTGCCTCT TTGCCCACTG TGGATAGCCC GCCAGTTTAA GGAAGCGGCG
CCTTTATGCG AAGGCGATAC CTGCGGCGCA GAAGCGCTCA GTCTGATGCT TGCCCGACGC
GAATGGCGAG AAGGCTTTCT GGCGGAGCGG ATGCAGGATG AGATTCTGCA AGAGCAGATC
CTGATTGAAA CCGAAGGCGA ACGCGTTGGA CAAATCAATG CGCTTTCCGT CATTGAGTTT
CCCGGACATC CGCGCGCCTT TGGCGAACCG TCGCGAATTA GCTGTGTTGT GCATATCGGC
GATGGCGAAT TTAACGATAT TGAGCGCAAG GCCGAACTTG GCGGGAATAT CCACGCTAAG
GGAATGATGA TTATGCAGGC CTTCCTGATG TCTGAGTTGC AGCTGGAGCA ACAAATTCCC
TTCTCTGCCT CGTTAACCTT TGAGCAGTCC TACAGCGAAG TGGATGGCGA TAGCGCCTCA
ATGGCGGAAT TATGTGCGCT CATCAGCGCG CTGGCCAATG TGCCAGTGAA TCAAAACATT
GCGATTACCG GCTCGGTCGA TCAGTTTGGT CGCGCGCAAC CGGTGGGCGG GCTAAACGAA
AAAATTGAAG GTTTCTTCGC CATCTGCGAG CAGCGGGAAT TAAACGGTAA ACAGGGCGTA
ATTATCCCTG CCGCCAACGT CCGCCATCTC AGTCTTAAAT CTGAACTGCT GCAAGCGGTT
AAAGAAGAGA AGTTCACTAT CTGGGCGGTA GACGACGTGA CCGACGCCTT ACCGTTACTG
TTAAATCTGG TGTGGGATGG CGAAGGTCAA ACGACGTTGA TGCAGACTAT CCAGGAGCGT
ATCGCGCAGG CGACGCAACA GGAAGGCCGT CATCGTTTCC CGTGGCCATT ACGTTGGCTG
AACGCTTTTA TTCCGAACTG A
 
Protein sequence
MTITKLAWRD LVPDSESYQE IFAQPHATDE NDTLLSDTQP RLQFALEQLI QPWASSSFML 
TKAPEEQEYL TLLSDAVRAL QTDAGQLTGG HYDVSGHTVH YRAAQNAQDN FATVTQVVSA
DWVEAEQLFG CLRQYNGDII LQPGLVHQAN GGVLIISLRT LLAQPLLWMR LKAIVSRERF
DWVAFDESRP LPVSVPSMPL KLKVILVGER ESLADFQEME PELAEQAIYS EFEDNLQIAD
AEAMTLWCQW VTRIALRDNL PPPAPDAWPV LIREAVRYTG EQDTLPLCPL WIARQFKEAA
PLCEGDTCGA EALSLMLARR EWREGFLAER MQDEILQEQI LIETEGERVG QINALSVIEF
PGHPRAFGEP SRISCVVHIG DGEFNDIERK AELGGNIHAK GMMIMQAFLM SELQLEQQIP
FSASLTFEQS YSEVDGDSAS MAELCALISA LANVPVNQNI AITGSVDQFG RAQPVGGLNE
KIEGFFAICE QRELNGKQGV IIPAANVRHL SLKSELLQAV KEEKFTIWAV DDVTDALPLL
LNLVWDGEGQ TTLMQTIQER IAQATQQEGR HRFPWPLRWL NAFIPN