Gene STER_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_2002 
Symbol 
ID4437395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp1854162 
End bp1855397 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content39% 
IMG OID639677562 
Producttrypsin-like serine protease 
Protein accessionYP_821300 
Protein GI116628681 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.172847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTAACTGGAA GAAAATAGTC GCGCCAATTG CAATGCTAAT TATTGGCTTA 
CTAGGTGGTT TACTTGGTGC CTTTATCCTA CTAACAGCAG CCGGGGTATC TTTTACCAAT
ACAACAGATA CTGGAGCAAA AACGGCTAAG ACCGTCTACA CCAATATAAC AGATACAACT
AAGGCTGTTA AGAAAGTACA AAATGCCGTT GTTTCTGTCA TCAATTATCA AGAAGGTTCA
TCTTCAGATT CTCTAAATGA CCTTTATGGC CGTATCTTTG GCGGAGGGGA CAGTTCTGAT
TCTAGCCAAG AAAATTCAAA AGATTCAGAT GGCCTGCAGG TCGCTGGTGA AGGTTCTGGA
GTCATCTATA AAAAAGATGG CAAAGAAGCC TACATCGTAA CCAATAACCA CGTTGTCGAT
GGGGCTAAAA AACTCGAAAT CATGCTTTCG GATGGTTCGA AAATTACTGG TGAACTTGTT
GGTAAAGACA CTTACTCTGA CCTAGCAGTT GTCAAAGTAT CTTCAGATAA AATAACAACT
GTTGCAGAAT TTGCAGACTC AAACTCCCTT ACTGTTGGTG AAAAAGCAAT TGCTATTGGT
AGCCCACTTG GTACCGAATA CGCCAACTCA GTAACAGAAG GAATCGTTTC TAGCCTTAGC
CGTACTATAA CGATGCAAAA CGATAATGGT GAAACTGTAT CAACAAACGC TATCCAAACA
GATGCAGCCA TTAACCCTGG TAACTCTGGT GGTGCCCTAG TCAATATTGA AGGACAAGTT
ATCGGTATTA ACTCAAGTAA AATTTCATCA ACGTCTGCAG TCGCTGGTAG TGCTGTTGAA
GGTATGGGGT TTGCCATTCC ATCAAACGAT GTTGTTGAAA TCATCAATCA ATTAGAAAAA
GATGGTAAAG TTACACGACC AGCACTAGGG ATCTCAATAG CAGATCTTAA TAGCCTTTCT
AGCAGCGCAA CTTCTAAATT AGATTTACCA GATGAGGTCA AATCCGGTGT TGTTGTCGGT
AGTGTTCAGA AAGGTATGCC AGCTGACGGT AAACTTCAAG AATATGATGT TATCACTGAG
ATTGATGGTA AGAAAATCAG CTCAAAAACT GATATTCAAA CCAATCTTTA CAGCCATAGT
ATCGGAGATA CTATCAAGGT AACCTTCTAT CGTGGTAAAG ATAAGAAAAC TGTAGATCTT
AAATTAACAA AATCTACAGA AGACATATCT GATTAA
 
Protein sequence
MKKINWKKIV APIAMLIIGL LGGLLGAFIL LTAAGVSFTN TTDTGAKTAK TVYTNITDTT 
KAVKKVQNAV VSVINYQEGS SSDSLNDLYG RIFGGGDSSD SSQENSKDSD GLQVAGEGSG
VIYKKDGKEA YIVTNNHVVD GAKKLEIMLS DGSKITGELV GKDTYSDLAV VKVSSDKITT
VAEFADSNSL TVGEKAIAIG SPLGTEYANS VTEGIVSSLS RTITMQNDNG ETVSTNAIQT
DAAINPGNSG GALVNIEGQV IGINSSKISS TSAVAGSAVE GMGFAIPSND VVEIINQLEK
DGKVTRPALG ISIADLNSLS SSATSKLDLP DEVKSGVVVG SVQKGMPADG KLQEYDVITE
IDGKKISSKT DIQTNLYSHS IGDTIKVTFY RGKDKKTVDL KLTKSTEDIS D