Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3611 |
Symbol | |
ID | 6482231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3497474 |
End bp | 3498841 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642738886 |
Product | serine endoprotease |
Protein accession | YP_002042603 |
Protein GI | 194445588 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.40714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 83 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC ACACCCAGCT GTTAAGTGCA TTAGCGTTAA GTGTCGGGTT AACTCTTTCG GCGCCGTTTC CAGCCCTTGC ATCGATACCA GGCCAGGTGC CAGGCCAGGC GACGCTGCCA AGCCTTGCCC CTATGCTGGA GAAAGTGCTG CCTGCTGTCG TCAGCGTAAA AGTCGAGGGA ACCGCCGCCC AGAGCCAAAA AGTGCCGGAG GAGTTTAAAA AATTCTTTGG CGAGGATCTG CCAGACCAGC CGTCCCAGCC GTTTGAAGGA CTCGGTTCGG GGGTGATTAT CGATGCCGCG AAAGGCTATG TATTAACCAA TAATCATGTG ATTAATCAGG CACAGAAGAT CAGTATTCAA CTGAATGACG GACGCGAATT CGACGCGAAG CTGATCGGCG GCGACGACCA GAGCGATATC GCTCTGTTAC AAATTCAGAA TCCCAGCAAG TTAACGCAAA TTGCCATCGC CGATTCCGAC AAACTCCGCG TCGGCGATTT CGCCGTGGCG GTCGGTAATC CGTTTGGTCT TGGACAAACC GCCACCTCCG GGATTATTTC AGCGCTGGGA CGCAGCGGGC TTAATCTGGA AGGGCTTGAG AACTTTATTC AAACCGATGC CTCTATTAAC CGCGGCAACT CCGGCGGCGC GCTGCTTAAC CTGAACGGCG AGCTGATCGG GATTAATACC GCAATCCTCG CGCCAGGGGG CGGGAGCATC GGCATTGGCT TTGCTATTCC TTCCAATATG GCGCAGACGC TGGCGCAGCA GTTGATTCAG TTCGGCGAAA TCAAACGCGG ATTGCTGGGA ATTAAAGGCA CTGAAATGAC CGCTGATATC GCTAAGGCAT TCAAACTGAA CGTTCAGCGT GGCGCTTTTG TCAGCGAGGT TTTACCCAAT TCAGGTTCGG CGAAGGCCGG GGTGAAATCC GGAGACGTGA TTATCAGTCT TAACGGTAAG CCGCTGAATA GCTTTGCCGA ACTGCGTTCA CGTATCGCCA CCACCGAACC GGGCACGAAA GTGAAGCTGG GCCTGCTGCG CGATGGTAAG CCGCTGGAGG TGGAAGTCAC GCTGGATTCC AATACCTCTT CTTCCGCCAG TGCCGAAATG ATCGCCCCGG CGTTGCAAGG CGCGACGTTG AGCGACGGCC AACTGAAAGA CGGGACGAAA GGCGTTAAGG TTGATAGCGT CGAAAAAAGC AGTCCTGCCG CGCAGGCCGG TTTGCAAAAA GATGATGTTA TCATCGGCGT TAACCGCGAT CGTATCAGTT CTATCGCCGA AATGCGTAAA GTGATGGCGG CAAAACCGTC CATCATTGCT CTTCAGGTAG TACGCGGCAA CGAGAACATT TATCTATTGC TGCGCTAA
|
Protein sequence | MKKHTQLLSA LALSVGLTLS APFPALASIP GQVPGQATLP SLAPMLEKVL PAVVSVKVEG TAAQSQKVPE EFKKFFGEDL PDQPSQPFEG LGSGVIIDAA KGYVLTNNHV INQAQKISIQ LNDGREFDAK LIGGDDQSDI ALLQIQNPSK LTQIAIADSD KLRVGDFAVA VGNPFGLGQT ATSGIISALG RSGLNLEGLE NFIQTDASIN RGNSGGALLN LNGELIGINT AILAPGGGSI GIGFAIPSNM AQTLAQQLIQ FGEIKRGLLG IKGTEMTADI AKAFKLNVQR GAFVSEVLPN SGSAKAGVKS GDVIISLNGK PLNSFAELRS RIATTEPGTK VKLGLLRDGK PLEVEVTLDS NTSSSASAEM IAPALQGATL SDGQLKDGTK GVKVDSVEKS SPAAQAGLQK DDVIIGVNRD RISSIAEMRK VMAAKPSIIA LQVVRGNENI YLLLR
|
| |