Gene SeAg_B3539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B3539 
Symbol 
ID6796860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp3431728 
End bp3433095 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content54% 
IMG OID642777670 
Productserine endoprotease 
Protein accessionYP_002148272 
Protein GI197247455 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0314705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC ACACCCAGCT GTTAAGTGCA TTAGCGTTAA GTGTCGGATT AACTCTTTCG 
GCGCCGTTTC CAGCCCTTGC ATCAATACCA GGCCAGGTGC CAGGCCAGGC GACGCTGCCA
AGCCTTGCCC CTATGCTGGA GAAAGTGCTG CCTGCTGTCG TCAGCGTAAA AGTCGAGGGA
ACCGCCGCCC AGAGCCAAAA AGTGCCGGAG GAGTTTAAAA AATTCTTTGG CGAGGATCTG
CCAGACCAGC CGTCCCAGCC GTTTGAAGGA CTCGGTTCGG GGGTGATTAT CGATGCCGCG
AAAGGCTATG TATTAACCAA TAATCATGTG ATTAATCAGG CACAGAAGAT CAGCATTCAA
CTGAATGACG GACGCGAATT CGACGCGAAG CTGATCGGCG GCGACGACCA GAGCGATATC
GCTCTGTTAC AAATTCAGAA TCCCAGCAAG TTAACGCAAA TTGCCATCGC CGATTCCGAC
AAACTCCGCG TCGGCGATTT CGCCGTGGCG GTCGGTAATC CGTTTGGCCT TGGGCAAACC
GCCACCTCCG GGATTATTTC AGCGCTAGGA CGCAGCGGGC TTAATCTGGA AGGGCTTGAG
AACTTTATTC AAACCGATGC CTCTATTAAC CGCGGCAACT CCGGCGGCGC GCTGCTTAAC
CTGAACGGCG AGCTGATCGG GATTAATACC GCGATCCTCG CGCCAGGGGG CGGGAGCATC
GGCATTGGCT TTGCTATTCC TTCCAATATG GCGCAGACGC TGGCGCAGCA GTTGATTCAG
TTCGGCGAAA TCAAACGCGG ATTGCTGGGA ATTAAAGGCA CTGAAATGAC CGCTGATATC
GCTAAGGCAT TCAAACTGAA CGTTCAGCGT GGCGCTTTTG TCAGCGAGGT TTTACCCAAT
TCAGGTTCGG CGAAGGCCGG GGTGAAATCC GGAGACGTGA TTATCAGTCT TAACGGTAAG
CCGCTGAATA GCTTTGCCGA ACTGCGTTCG CGTATCGCCA CCACCGAACC GGGCACGAAA
GTGAAGCTGG GCCTGCTGCG CGATGGTAAG CCGCTGGAGG TGGAAGTCAC GCTGGATTCC
AATACCTCTT CTTCCGCCAG CGCCGAAATG ATCGCCCCGG CGTTGCAAGG CGCGACGTTG
AGCGACGGCC AGCTGAAAGA CGGGACGAAA GGCGTTAAGG TTGATAGCGT CGAAAAAAGC
AGTCCTGCCG CGCAGGCCGG TTTGCAAAAA GATGATGTTA TCATCGGCGT TAATCGCGAT
CGCATCAGTT CTATCGCCGA AATGCGCAAA GTGATGGCGG CAAAACCGTC CATCATTGCT
CTTCAGGTAG TACGCGGCAA CGAGAACATT TATCTATTGC TGCGCTAA
 
Protein sequence
MKKHTQLLSA LALSVGLTLS APFPALASIP GQVPGQATLP SLAPMLEKVL PAVVSVKVEG 
TAAQSQKVPE EFKKFFGEDL PDQPSQPFEG LGSGVIIDAA KGYVLTNNHV INQAQKISIQ
LNDGREFDAK LIGGDDQSDI ALLQIQNPSK LTQIAIADSD KLRVGDFAVA VGNPFGLGQT
ATSGIISALG RSGLNLEGLE NFIQTDASIN RGNSGGALLN LNGELIGINT AILAPGGGSI
GIGFAIPSNM AQTLAQQLIQ FGEIKRGLLG IKGTEMTADI AKAFKLNVQR GAFVSEVLPN
SGSAKAGVKS GDVIISLNGK PLNSFAELRS RIATTEPGTK VKLGLLRDGK PLEVEVTLDS
NTSSSASAEM IAPALQGATL SDGQLKDGTK GVKVDSVEKS SPAAQAGLQK DDVIIGVNRD
RISSIAEMRK VMAAKPSIIA LQVVRGNENI YLLLR