Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0230 |
Symbol | degP |
ID | 6485532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 246642 |
End bp | 248078 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642735667 |
Product | serine endoprotease |
Protein accession | YP_002039449 |
Protein GI | 194443684 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAACACA TGAAAAAAAC CACATTAGCA ATGAGTGCAC TGGCTCTGAG TTTAGGTTTG GCATTGTCGC CTCTGTCTGC CACGGCGGCT GAAACGTCCT CTTCAGCAAT GACTGCCCAG CAGATGCCAA GCCTGGCACC GATGCTCGAA AAAGTGATGC CATCGGTGGT CAGTATTAAT GTTGAAGGTA GCACCACGGT GAATACGCCG CGTATGCCGC GTAATTTCCA GCAGTTCTTT GGCGATGACT CCCCGTTCTG CCAGGACGGT TCTCCGTTCC AGAATTCTCC GTTCTGCCAG GGCGGCGGTA ACGGCGGCAA CGGCGGTCAA CAACAGAAAT TCATGGCGCT GGGCTCCGGC GTAATTATTG ACGCCGCGAA GGGCTACGTC GTCACCAACA ACCACGTGGT TGATAACGCC AGCGTGATTA AAGTACAGCT TAGCGATGGG CGTAAATTCG ATGCTAAAGT GGTGGGCAAA GATCCGCGTT CTGATATCGC GCTGATTCAA ATTCAGAATC CGAAGAACCT GACGGCGATT AAGCTGGCGG ACTCCGACGC GCTGCGCGTG GGGGATTATA CCGTCGCTAT TGGTAACCCG TTTGGTCTGG GCGAAACGGT GACGTCAGGT ATCGTTTCGG CGCTGGGGCG TAGCGGCCTG AACGTAGAAA ATTACGAGAA CTTTATTCAG ACCGACGCCG CGATTAACCG CGGTAACTCC GGCGGCGCGC TGGTGAACCT GAACGGTGAG CTGATCGGTA TTAACACCGC GATTCTGGCG CCGGACGGCG GCAACATCGG TATCGGCTTC GCTATCCCCA GTAACATGGT GAAAAACCTG ACGTCGCAGA TGGTGGAATA CGGCCAGGTG AAACGCGGCG AACTGGGGAT CATGGGGACT GAGCTGAATT CCGAATTGGC GAAAGCGATG AAAGTCGACG CCCAGCGAGG CGCGTTCGTC AGCCAGGTGA TGCCGAATTC GTCCGCGGCG AAAGCGGGTA TCAAAGCCGG GGATGTCATT ACCTCGCTGA ACGGTAAACC GATCAGCAGC TTTGCGGCGC TGCGCGCTCA GGTCGGCACT ATGCCGGTCG GCAGCAAAAT CAGCCTCGGT CTGCTGCGTG AAGGTAAAGC GATTACGGTG AATCTGGAAC TGCAGCAGAG CAGCCAGAGT CAGGTTGATT CCAGCACCAT CTTCAGCGGG ATTGAAGGCG CTGAAATGAG CAATAAAGGC CAGGATAAAG GCGTTGTGGT GAGCAGCGTG AAAGCGAACT CACCCGCCGC GCAAATTGGC CTCAAAAAAG GCGATGTGAT TATCGGCGCT AACCAGCAGC CGGTGAAAAA TATCGCCGAG CTGCGTAAGA TTCTCGACAG CAAGCCGTCG GTTCTGGCGC TGAATATTCA GCGTGGTGAT AGTTCTATTT ATTTGCTGAT GCAGTAA
|
Protein sequence | MKHMKKTTLA MSALALSLGL ALSPLSATAA ETSSSAMTAQ QMPSLAPMLE KVMPSVVSIN VEGSTTVNTP RMPRNFQQFF GDDSPFCQDG SPFQNSPFCQ GGGNGGNGGQ QQKFMALGSG VIIDAAKGYV VTNNHVVDNA SVIKVQLSDG RKFDAKVVGK DPRSDIALIQ IQNPKNLTAI KLADSDALRV GDYTVAIGNP FGLGETVTSG IVSALGRSGL NVENYENFIQ TDAAINRGNS GGALVNLNGE LIGINTAILA PDGGNIGIGF AIPSNMVKNL TSQMVEYGQV KRGELGIMGT ELNSELAKAM KVDAQRGAFV SQVMPNSSAA KAGIKAGDVI TSLNGKPISS FAALRAQVGT MPVGSKISLG LLREGKAITV NLELQQSSQS QVDSSTIFSG IEGAEMSNKG QDKGVVVSSV KANSPAAQIG LKKGDVIIGA NQQPVKNIAE LRKILDSKPS VLALNIQRGD SSIYLLMQ
|
| |