Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A3540 |
Symbol | |
ID | 6517354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 3413647 |
End bp | 3415002 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642748526 |
Product | serine endoprotease |
Protein accession | YP_002116296 |
Protein GI | 194735465 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC ACACCCAGCT GTTAAGTGCA TTAGCGTTAA GTGTCGGGTT AACTCTTTCG GCGCCGTTTC CAGCCCTTGC ATCGATACCA GGCCAGGCGA CGCTGCCAAG CCTTGCCCCT ATGCTGGAGA AAGTGCTGCC TGCTGTCGTC AGCGTAAAAG TCGAGGGAAC CGCCGCCCAG AGCCAAAAAG TGCCGGAGGA GTTTAAAAAA TTCTTTGGCG AGGATCTGCC AGACCAGCCG TCCCAGCCGT TTGAAGGACT CGGTTCGGGG GTGATTATCG ATGCCGCGAA AGGCTATGTA TTAACCAATA ATCATGTGAT TAATCAGGCA CAAAAGATCA GCATTCAACT GAATGACGGA CGCGAATTCG ACGCGAAGCT GATCGGCGGC GACGACCAGA GCGATATCGC TCTGTTACAA ATTCAGAATC CCAGCAAGTT AACGCAAATT GCCATCGCCG ATTCCGACAA ACTCCGCGTC GGCGATTTCG CCGTGGCGGT CGGTAATCCG TTTGGTCTTG GACAAACCGC CACCTCCGGG ATTATTTCAG CGCTGGGACG CAGCGGGCTT AATCTGGAAG GGCTTGAGAA CTTTATTCAA ACCGATGCCT CTATTAACCG CGGCAACTCC GGCGGCGCGC TGCTTAACCT GAACGGCGAG CTGATCGGGA TTAATACCGC GATCCTCGCG CCAGGTGGCG GGAGTATCGG CATTGGCTTT GCTATTCCTT CCAATATGGC GCAGACGCTG GCGCAGCAGT TGATTCAGTT CGGCGAAATC AAACGCGGAT TGCTGGGAAT TAAAGGCACT GAAATGACCG CTGATATCGC CAAGGCATTC AAACTGAACG TTCAGCGTGG CGCTTTTGTC AGCGAGGTTT TACCCAATTC AGGTTCGGCG AAGGCCGGGG TGAAATCCGG AGACGTGATT ATCAGTCTTA ACGGTAAGCC GCTGAATAGC TTTGCCGAAC TGCGTTCACG TATCGCCACC ACCGAACCGG GCACGAAAGT GAAGCTGGGC CTGCTGCGCG ATGGTAAGCC GCTGGAGGTG GAAGTCACGC TGGATTCCAA TACCTCTTCT TCCGCCAGCG CCGAAATGAT CGCCCCGGCG TTGCAAGGCG CGACGTTGAG CGACGGCCAG CTGAAAGACG GGACGAAAGG CGTTAAGGTT GATAGCGTCG AAAAAAGCAG TCCTGCCGCG CAGGCCGGTT TGCAAAAAGA TGATGTTATC ATCGGCGTTA ACCGCGATCG TATCAGTTCT ATCGCCGAAA TGCGCAAAGT GATGGCGGCA AAACCGTCCA TCATTGCTCT TCAGGTAGTA CGCGGCAACG AGAACATTTA TCTATTGCTA CGCTAA
|
Protein sequence | MKKHTQLLSA LALSVGLTLS APFPALASIP GQATLPSLAP MLEKVLPAVV SVKVEGTAAQ SQKVPEEFKK FFGEDLPDQP SQPFEGLGSG VIIDAAKGYV LTNNHVINQA QKISIQLNDG REFDAKLIGG DDQSDIALLQ IQNPSKLTQI AIADSDKLRV GDFAVAVGNP FGLGQTATSG IISALGRSGL NLEGLENFIQ TDASINRGNS GGALLNLNGE LIGINTAILA PGGGSIGIGF AIPSNMAQTL AQQLIQFGEI KRGLLGIKGT EMTADIAKAF KLNVQRGAFV SEVLPNSGSA KAGVKSGDVI ISLNGKPLNS FAELRSRIAT TEPGTKVKLG LLRDGKPLEV EVTLDSNTSS SASAEMIAPA LQGATLSDGQ LKDGTKGVKV DSVEKSSPAA QAGLQKDDVI IGVNRDRISS IAEMRKVMAA KPSIIALQVV RGNENIYLLL R
|
| |