Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_0709 |
Symbol | |
ID | 4437391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | + |
Start bp | 643235 |
End bp | 646600 |
Gene Length | 3366 bp |
Protein Length | 1121 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 639676407 |
Product | CRISPR-system-like protein |
Protein accession | YP_820161 |
Protein GI | 116627542 |
COG category | [S] Function unknown |
COG ID | [COG3513] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01865] CRISPR-associated protein, Csn1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGACT TAGTTTTAGG ACTTGATATC GGTATAGGTT CTGTTGGTGT GGGTATCCTT AACAAAGTGA CAGGAGAAAT TATCCATAAA AACTCACGCA TCTTCCCAGC AGCTCAAGCG GAAAATAACC TAGTACGTAG AACGAATCGT CAAGGAAGAC GCTTGGCACG ACGTAAAAAA CATCGTAGAG TTCGTTTAAA TCGTCTATTT GAGGAAAGTG GATTAATCAC TGATTTTACG AAGATTTCAA TTAATCTTAA CCCATATCAA TTACGAGTTA AGGGCTTGAC CGATGAATTG TCTAATGAAG AACTGTTTAT CGCTCTTAAA AATATGGTGA AACACCGTGG GATTAGTTAC CTCGATGATG CTAGTGATGA CGGAAATTCA TCAGTAGGAG ACTATGCACA AATTGTTAAG GAAAATAGTA AACAATTAGA AACTAAGACA CCGGGACAGA TACAGTTGGA ACGCTACCAA ACATATGGTC AATTACGTGG TGATTTTACT GTTGAGAAAG ATGGCAAAAA ACATCGCTTG ATTAATGTCT TTCCAACATC AGCTTATCGT TCAGAAGCCT TAAGGATACT GCAAACTCAA CAAGAATTTA ATCCACAGAT TACAGATGAA TTTATTAATC GTTATCTCGA AATTTTAACT GGAAAACGGA AATATTATCA TGGACCCGGA AATGAAAAGT CACGGACTGA TTATGGTCGT TACAGAACGA GTGGAGAAAC TTTAGACAAT ATTTTTGGAA TTCTAATTGG GAAATGTACA TTTTATCCAG ACGAGTTTAG AGCAGCAAAA GCTTCCTACA CGGCTCAAGA ATTCAATTTG CTAAATGATT TGAACAATCT AACAGTTCCT ACTGAAACCA AAAAGTTGAG CAAAGAACAG AAGAATCAAA TCATTAATTA TGTCAAAAAT GAAAAGGCAA TGGGGCCAGC GAAACTTTTT AAATATATCG CTAAGTTACT TTCTTGTGAT GTTGCAGATA TCAAGGGATA CCGTATCGAC AAATCAGGTA AGGCTGAGAT TCATACTTTC GAAGCCTATC GAAAAATGAA AACGCTTGAA ACCTTAGATA TTGAGCAAAT GGATAGAGAA ACGCTTGATA AATTAGCCTA TGTCTTAACA TTAAACACTG AGAGGGAAGG TATTCAAGAA GCCTTAGAAC ATGAATTTGC TGATGGTAGC TTTAGCCAGA AGCAAGTTGA CGAATTGGTT CAATTCCGCA AAGCAAATAG TTCCATTTTT GGAAAAGGAT GGCATAATTT TTCTGTCAAA CTGATGATGG AGTTAATTCC AGAATTGTAT GAGACGTCAG AAGAGCAAAT GACTATCCTA ACACGACTTG GAAAACAAAA AACGACTTCG TCTTCAAATA AAACAAAATA TATAGATGAA AAACTATTAA CTGAAGAAAT CTATAATCCT GTTGTTGCTA AGTCTGTTCG CCAGGCTATA AAAATCGTAA ATGCGGCGAT TAAAGAATAC GGAGACTTTG ACAATATTGT CATCGAAATG GCTCGTGAAA CAAATGAAGA TGATGAAAAG AAAGCTATTC AAAAGATTCA AAAAGCCAAC AAAGATGAAA AAGATGCAGC AATGCTTAAG GCTGCTAACC AATATAATGG AAAGGCTGAA TTACCACATA GTGTTTTCCA CGGTCATAAG CAATTAGCGA CTAAAATCCG CCTTTGGCAT CAGCAAGGAG AACGTTGCCT TTATACTGGT AAGACAATCT CAATCCATGA TTTGATAAAT AATTCTAATC AGTTTGAAGT AGATCATATT TTACCTCTTT CTATCACATT CGATGATAGC CTTGCAAATA AGGTTTTGGT TTATGCAACT GCTAACCAAG AAAAAGGACA ACGAACACCT TATCAGGCTT TAGATAGTAT GGATGATGCG TGGTCTTTCC GTGAATTAAA AGCTTTTGTA CGTGAGTCAA AAACACTTTC GAACAAGAAA AAAGAATACC TCCTTACAGA AGAAGATATT TCAAAGTTTG ATGTTCGAAA GAAATTTATT GAACGAAATC TTGTAGATAC AAGATACGCT TCAAGAGTTG TCCTCAATGC CCTTCAAGAA CACTTTAGAG CTCACAAGAT TGATACAAAA GTTTCCGTGG TTCGTGGCCA ATTTACATCT CAATTGAGAC GCCATTGGGG AATTGAGAAG ACTCGTGATA CTTATCATCA CCATGCTGTC GATGCATTGA TTATTGCCGC CTCAAGTCAG TTGAATTTGT GGAAAAAACA AAAGAATACC CTTGTAAGTT ATTCAGAAGA CCAACTCCTT GATATTGAAA CAGGTGAACT TATTAGTGAT GATGAGTACA AGGAATCTGT GTTCAAAGCC CCTTATCAAC ATTTTGTTGA TACATTGAAG AGTAAAGAAT TTGAAGACAG TATCTTATTC TCTTATCAAG TGGATTCTAA GTTTAATCGT AAAATATCAG ATGCCACTAT TTATGCGACA AGACAGGCTA AAGTAGGAAA AGATAAGGCG GATGAAACTT ATGTCTTAGG GAAAATCAAA GATATCTATA CTCAGGATGG TTATGATGCC TTTATGAAGA TTTATAAGAA GGATAAGTCA AAATTCCTCA TGTATCGTCA CGACCCACAA ACCTTTGAGA AAGTTATCGA GCCAATTTTA GAGAACTATC CTAATAAGCA AATAAATGAA AAAGGGAAAG AGGTACCATG TAATCCTTTC CTAAAATATA AAGAAGAACA TGGCTATATT CGTAAATATA GTAAAAAAGG CAATGGTCCT GAAATCAAGA GTCTTAAATA CTATGATAGT AAGCTAGGCA ATCATATTGA TATTACTCCG AAGGATAGTA ACAATAAAGT TGTCTTACAG TCAGTTTCTC CATGGAGAGC GGATGTCTAT TTCAATAAGA CTACTGGAAA ATACGAAATC CTCGGATTGA AATATGCTGA TTTACAATTT GAAAAAGGGA CAGGAACATA TAAGATTTCC CAGGAAAAAT ACAATGACAT TAAGAAAAAA GAGGGTGTAG ATTCTGATTC AGAATTCAAG TTTACACTTT ATAAAAATGA TTTGTTACTC GTTAAAGATA CAGAAACAAA AGAACAACAG CTTTTCCGTT TTCTTTCTCG AACTATGCCT AAACAAAAGC ATTATGTTGA ATTAAAACCT TATGATAAAC AGAAATTTGA AGGAGGTGAG GCGTTAATTA AAGTGTTGGG TAACGTTGCT AATAGTGGTC AATGCAAAAA AGGGCTAGGA AAATCAAATA TTTCTATTTA TAAGGTAAGA ACAGATGTCC TAGGAAATCA GCATATCATC AAAAATGAGG GTGATAAGCC TAAGCTAGAT TTTTAA
|
Protein sequence | MSDLVLGLDI GIGSVGVGIL NKVTGEIIHK NSRIFPAAQA ENNLVRRTNR QGRRLARRKK HRRVRLNRLF EESGLITDFT KISINLNPYQ LRVKGLTDEL SNEELFIALK NMVKHRGISY LDDASDDGNS SVGDYAQIVK ENSKQLETKT PGQIQLERYQ TYGQLRGDFT VEKDGKKHRL INVFPTSAYR SEALRILQTQ QEFNPQITDE FINRYLEILT GKRKYYHGPG NEKSRTDYGR YRTSGETLDN IFGILIGKCT FYPDEFRAAK ASYTAQEFNL LNDLNNLTVP TETKKLSKEQ KNQIINYVKN EKAMGPAKLF KYIAKLLSCD VADIKGYRID KSGKAEIHTF EAYRKMKTLE TLDIEQMDRE TLDKLAYVLT LNTEREGIQE ALEHEFADGS FSQKQVDELV QFRKANSSIF GKGWHNFSVK LMMELIPELY ETSEEQMTIL TRLGKQKTTS SSNKTKYIDE KLLTEEIYNP VVAKSVRQAI KIVNAAIKEY GDFDNIVIEM ARETNEDDEK KAIQKIQKAN KDEKDAAMLK AANQYNGKAE LPHSVFHGHK QLATKIRLWH QQGERCLYTG KTISIHDLIN NSNQFEVDHI LPLSITFDDS LANKVLVYAT ANQEKGQRTP YQALDSMDDA WSFRELKAFV RESKTLSNKK KEYLLTEEDI SKFDVRKKFI ERNLVDTRYA SRVVLNALQE HFRAHKIDTK VSVVRGQFTS QLRRHWGIEK TRDTYHHHAV DALIIAASSQ LNLWKKQKNT LVSYSEDQLL DIETGELISD DEYKESVFKA PYQHFVDTLK SKEFEDSILF SYQVDSKFNR KISDATIYAT RQAKVGKDKA DETYVLGKIK DIYTQDGYDA FMKIYKKDKS KFLMYRHDPQ TFEKVIEPIL ENYPNKQINE KGKEVPCNPF LKYKEEHGYI RKYSKKGNGP EIKSLKYYDS KLGNHIDITP KDSNNKVVLQ SVSPWRADVY FNKTTGKYEI LGLKYADLQF EKGTGTYKIS QEKYNDIKKK EGVDSDSEFK FTLYKNDLLL VKDTETKEQQ LFRFLSRTMP KQKHYVELKP YDKQKFEGGE ALIKVLGNVA NSGQCKKGLG KSNISIYKVR TDVLGNQHII KNEGDKPKLD F
|
| |