Gene STER_0709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_0709 
Symbol 
ID4437391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp643235 
End bp646600 
Gene Length3366 bp 
Protein Length1121 aa 
Translation table11 
GC content35% 
IMG OID639676407 
ProductCRISPR-system-like protein 
Protein accessionYP_820161 
Protein GI116627542 
COG category[S] Function unknown 
COG ID[COG3513] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01865] CRISPR-associated protein, Csn1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGACT TAGTTTTAGG ACTTGATATC GGTATAGGTT CTGTTGGTGT GGGTATCCTT 
AACAAAGTGA CAGGAGAAAT TATCCATAAA AACTCACGCA TCTTCCCAGC AGCTCAAGCG
GAAAATAACC TAGTACGTAG AACGAATCGT CAAGGAAGAC GCTTGGCACG ACGTAAAAAA
CATCGTAGAG TTCGTTTAAA TCGTCTATTT GAGGAAAGTG GATTAATCAC TGATTTTACG
AAGATTTCAA TTAATCTTAA CCCATATCAA TTACGAGTTA AGGGCTTGAC CGATGAATTG
TCTAATGAAG AACTGTTTAT CGCTCTTAAA AATATGGTGA AACACCGTGG GATTAGTTAC
CTCGATGATG CTAGTGATGA CGGAAATTCA TCAGTAGGAG ACTATGCACA AATTGTTAAG
GAAAATAGTA AACAATTAGA AACTAAGACA CCGGGACAGA TACAGTTGGA ACGCTACCAA
ACATATGGTC AATTACGTGG TGATTTTACT GTTGAGAAAG ATGGCAAAAA ACATCGCTTG
ATTAATGTCT TTCCAACATC AGCTTATCGT TCAGAAGCCT TAAGGATACT GCAAACTCAA
CAAGAATTTA ATCCACAGAT TACAGATGAA TTTATTAATC GTTATCTCGA AATTTTAACT
GGAAAACGGA AATATTATCA TGGACCCGGA AATGAAAAGT CACGGACTGA TTATGGTCGT
TACAGAACGA GTGGAGAAAC TTTAGACAAT ATTTTTGGAA TTCTAATTGG GAAATGTACA
TTTTATCCAG ACGAGTTTAG AGCAGCAAAA GCTTCCTACA CGGCTCAAGA ATTCAATTTG
CTAAATGATT TGAACAATCT AACAGTTCCT ACTGAAACCA AAAAGTTGAG CAAAGAACAG
AAGAATCAAA TCATTAATTA TGTCAAAAAT GAAAAGGCAA TGGGGCCAGC GAAACTTTTT
AAATATATCG CTAAGTTACT TTCTTGTGAT GTTGCAGATA TCAAGGGATA CCGTATCGAC
AAATCAGGTA AGGCTGAGAT TCATACTTTC GAAGCCTATC GAAAAATGAA AACGCTTGAA
ACCTTAGATA TTGAGCAAAT GGATAGAGAA ACGCTTGATA AATTAGCCTA TGTCTTAACA
TTAAACACTG AGAGGGAAGG TATTCAAGAA GCCTTAGAAC ATGAATTTGC TGATGGTAGC
TTTAGCCAGA AGCAAGTTGA CGAATTGGTT CAATTCCGCA AAGCAAATAG TTCCATTTTT
GGAAAAGGAT GGCATAATTT TTCTGTCAAA CTGATGATGG AGTTAATTCC AGAATTGTAT
GAGACGTCAG AAGAGCAAAT GACTATCCTA ACACGACTTG GAAAACAAAA AACGACTTCG
TCTTCAAATA AAACAAAATA TATAGATGAA AAACTATTAA CTGAAGAAAT CTATAATCCT
GTTGTTGCTA AGTCTGTTCG CCAGGCTATA AAAATCGTAA ATGCGGCGAT TAAAGAATAC
GGAGACTTTG ACAATATTGT CATCGAAATG GCTCGTGAAA CAAATGAAGA TGATGAAAAG
AAAGCTATTC AAAAGATTCA AAAAGCCAAC AAAGATGAAA AAGATGCAGC AATGCTTAAG
GCTGCTAACC AATATAATGG AAAGGCTGAA TTACCACATA GTGTTTTCCA CGGTCATAAG
CAATTAGCGA CTAAAATCCG CCTTTGGCAT CAGCAAGGAG AACGTTGCCT TTATACTGGT
AAGACAATCT CAATCCATGA TTTGATAAAT AATTCTAATC AGTTTGAAGT AGATCATATT
TTACCTCTTT CTATCACATT CGATGATAGC CTTGCAAATA AGGTTTTGGT TTATGCAACT
GCTAACCAAG AAAAAGGACA ACGAACACCT TATCAGGCTT TAGATAGTAT GGATGATGCG
TGGTCTTTCC GTGAATTAAA AGCTTTTGTA CGTGAGTCAA AAACACTTTC GAACAAGAAA
AAAGAATACC TCCTTACAGA AGAAGATATT TCAAAGTTTG ATGTTCGAAA GAAATTTATT
GAACGAAATC TTGTAGATAC AAGATACGCT TCAAGAGTTG TCCTCAATGC CCTTCAAGAA
CACTTTAGAG CTCACAAGAT TGATACAAAA GTTTCCGTGG TTCGTGGCCA ATTTACATCT
CAATTGAGAC GCCATTGGGG AATTGAGAAG ACTCGTGATA CTTATCATCA CCATGCTGTC
GATGCATTGA TTATTGCCGC CTCAAGTCAG TTGAATTTGT GGAAAAAACA AAAGAATACC
CTTGTAAGTT ATTCAGAAGA CCAACTCCTT GATATTGAAA CAGGTGAACT TATTAGTGAT
GATGAGTACA AGGAATCTGT GTTCAAAGCC CCTTATCAAC ATTTTGTTGA TACATTGAAG
AGTAAAGAAT TTGAAGACAG TATCTTATTC TCTTATCAAG TGGATTCTAA GTTTAATCGT
AAAATATCAG ATGCCACTAT TTATGCGACA AGACAGGCTA AAGTAGGAAA AGATAAGGCG
GATGAAACTT ATGTCTTAGG GAAAATCAAA GATATCTATA CTCAGGATGG TTATGATGCC
TTTATGAAGA TTTATAAGAA GGATAAGTCA AAATTCCTCA TGTATCGTCA CGACCCACAA
ACCTTTGAGA AAGTTATCGA GCCAATTTTA GAGAACTATC CTAATAAGCA AATAAATGAA
AAAGGGAAAG AGGTACCATG TAATCCTTTC CTAAAATATA AAGAAGAACA TGGCTATATT
CGTAAATATA GTAAAAAAGG CAATGGTCCT GAAATCAAGA GTCTTAAATA CTATGATAGT
AAGCTAGGCA ATCATATTGA TATTACTCCG AAGGATAGTA ACAATAAAGT TGTCTTACAG
TCAGTTTCTC CATGGAGAGC GGATGTCTAT TTCAATAAGA CTACTGGAAA ATACGAAATC
CTCGGATTGA AATATGCTGA TTTACAATTT GAAAAAGGGA CAGGAACATA TAAGATTTCC
CAGGAAAAAT ACAATGACAT TAAGAAAAAA GAGGGTGTAG ATTCTGATTC AGAATTCAAG
TTTACACTTT ATAAAAATGA TTTGTTACTC GTTAAAGATA CAGAAACAAA AGAACAACAG
CTTTTCCGTT TTCTTTCTCG AACTATGCCT AAACAAAAGC ATTATGTTGA ATTAAAACCT
TATGATAAAC AGAAATTTGA AGGAGGTGAG GCGTTAATTA AAGTGTTGGG TAACGTTGCT
AATAGTGGTC AATGCAAAAA AGGGCTAGGA AAATCAAATA TTTCTATTTA TAAGGTAAGA
ACAGATGTCC TAGGAAATCA GCATATCATC AAAAATGAGG GTGATAAGCC TAAGCTAGAT
TTTTAA
 
Protein sequence
MSDLVLGLDI GIGSVGVGIL NKVTGEIIHK NSRIFPAAQA ENNLVRRTNR QGRRLARRKK 
HRRVRLNRLF EESGLITDFT KISINLNPYQ LRVKGLTDEL SNEELFIALK NMVKHRGISY
LDDASDDGNS SVGDYAQIVK ENSKQLETKT PGQIQLERYQ TYGQLRGDFT VEKDGKKHRL
INVFPTSAYR SEALRILQTQ QEFNPQITDE FINRYLEILT GKRKYYHGPG NEKSRTDYGR
YRTSGETLDN IFGILIGKCT FYPDEFRAAK ASYTAQEFNL LNDLNNLTVP TETKKLSKEQ
KNQIINYVKN EKAMGPAKLF KYIAKLLSCD VADIKGYRID KSGKAEIHTF EAYRKMKTLE
TLDIEQMDRE TLDKLAYVLT LNTEREGIQE ALEHEFADGS FSQKQVDELV QFRKANSSIF
GKGWHNFSVK LMMELIPELY ETSEEQMTIL TRLGKQKTTS SSNKTKYIDE KLLTEEIYNP
VVAKSVRQAI KIVNAAIKEY GDFDNIVIEM ARETNEDDEK KAIQKIQKAN KDEKDAAMLK
AANQYNGKAE LPHSVFHGHK QLATKIRLWH QQGERCLYTG KTISIHDLIN NSNQFEVDHI
LPLSITFDDS LANKVLVYAT ANQEKGQRTP YQALDSMDDA WSFRELKAFV RESKTLSNKK
KEYLLTEEDI SKFDVRKKFI ERNLVDTRYA SRVVLNALQE HFRAHKIDTK VSVVRGQFTS
QLRRHWGIEK TRDTYHHHAV DALIIAASSQ LNLWKKQKNT LVSYSEDQLL DIETGELISD
DEYKESVFKA PYQHFVDTLK SKEFEDSILF SYQVDSKFNR KISDATIYAT RQAKVGKDKA
DETYVLGKIK DIYTQDGYDA FMKIYKKDKS KFLMYRHDPQ TFEKVIEPIL ENYPNKQINE
KGKEVPCNPF LKYKEEHGYI RKYSKKGNGP EIKSLKYYDS KLGNHIDITP KDSNNKVVLQ
SVSPWRADVY FNKTTGKYEI LGLKYADLQF EKGTGTYKIS QEKYNDIKKK EGVDSDSEFK
FTLYKNDLLL VKDTETKEQQ LFRFLSRTMP KQKHYVELKP YDKQKFEGGE ALIKVLGNVA
NSGQCKKGLG KSNISIYKVR TDVLGNQHII KNEGDKPKLD F