Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_2002 |
Symbol | |
ID | 4437395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | + |
Start bp | 1854162 |
End bp | 1855397 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 639677562 |
Product | trypsin-like serine protease |
Protein accession | YP_821300 |
Protein GI | 116628681 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.172847 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTAACTGGAA GAAAATAGTC GCGCCAATTG CAATGCTAAT TATTGGCTTA CTAGGTGGTT TACTTGGTGC CTTTATCCTA CTAACAGCAG CCGGGGTATC TTTTACCAAT ACAACAGATA CTGGAGCAAA AACGGCTAAG ACCGTCTACA CCAATATAAC AGATACAACT AAGGCTGTTA AGAAAGTACA AAATGCCGTT GTTTCTGTCA TCAATTATCA AGAAGGTTCA TCTTCAGATT CTCTAAATGA CCTTTATGGC CGTATCTTTG GCGGAGGGGA CAGTTCTGAT TCTAGCCAAG AAAATTCAAA AGATTCAGAT GGCCTGCAGG TCGCTGGTGA AGGTTCTGGA GTCATCTATA AAAAAGATGG CAAAGAAGCC TACATCGTAA CCAATAACCA CGTTGTCGAT GGGGCTAAAA AACTCGAAAT CATGCTTTCG GATGGTTCGA AAATTACTGG TGAACTTGTT GGTAAAGACA CTTACTCTGA CCTAGCAGTT GTCAAAGTAT CTTCAGATAA AATAACAACT GTTGCAGAAT TTGCAGACTC AAACTCCCTT ACTGTTGGTG AAAAAGCAAT TGCTATTGGT AGCCCACTTG GTACCGAATA CGCCAACTCA GTAACAGAAG GAATCGTTTC TAGCCTTAGC CGTACTATAA CGATGCAAAA CGATAATGGT GAAACTGTAT CAACAAACGC TATCCAAACA GATGCAGCCA TTAACCCTGG TAACTCTGGT GGTGCCCTAG TCAATATTGA AGGACAAGTT ATCGGTATTA ACTCAAGTAA AATTTCATCA ACGTCTGCAG TCGCTGGTAG TGCTGTTGAA GGTATGGGGT TTGCCATTCC ATCAAACGAT GTTGTTGAAA TCATCAATCA ATTAGAAAAA GATGGTAAAG TTACACGACC AGCACTAGGG ATCTCAATAG CAGATCTTAA TAGCCTTTCT AGCAGCGCAA CTTCTAAATT AGATTTACCA GATGAGGTCA AATCCGGTGT TGTTGTCGGT AGTGTTCAGA AAGGTATGCC AGCTGACGGT AAACTTCAAG AATATGATGT TATCACTGAG ATTGATGGTA AGAAAATCAG CTCAAAAACT GATATTCAAA CCAATCTTTA CAGCCATAGT ATCGGAGATA CTATCAAGGT AACCTTCTAT CGTGGTAAAG ATAAGAAAAC TGTAGATCTT AAATTAACAA AATCTACAGA AGACATATCT GATTAA
|
Protein sequence | MKKINWKKIV APIAMLIIGL LGGLLGAFIL LTAAGVSFTN TTDTGAKTAK TVYTNITDTT KAVKKVQNAV VSVINYQEGS SSDSLNDLYG RIFGGGDSSD SSQENSKDSD GLQVAGEGSG VIYKKDGKEA YIVTNNHVVD GAKKLEIMLS DGSKITGELV GKDTYSDLAV VKVSSDKITT VAEFADSNSL TVGEKAIAIG SPLGTEYANS VTEGIVSSLS RTITMQNDNG ETVSTNAIQT DAAINPGNSG GALVNIEGQV IGINSSKISS TSAVAGSAVE GMGFAIPSND VVEIINQLEK DGKVTRPALG ISIADLNSLS SSATSKLDLP DEVKSGVVVG SVQKGMPADG KLQEYDVITE IDGKKISSKT DIQTNLYSHS IGDTIKVTFY RGKDKKTVDL KLTKSTEDIS D
|
| |