Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_1476 |
Symbol | |
ID | 4437922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | - |
Start bp | 1379106 |
End bp | 1379975 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 639677080 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_820831 |
Protein GI | 116628212 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.095696 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGGTT GGAGAACAGT AGTCGTAAAC ATACATTCTA AACTGTCATA TAAGAATAAT CACCTTATTT TTAGAAACTC ATATAAAACT GAGATGATTC ATCTTTCTGA AATTGATATA CTTTTATTGG AAACAACTGA CATTGTATTG ACTACTATGT TAGTGAAGAG ATTAGTTGAT GAAAATATTC TAGTTATTTT CTGTGATGAT AAACGATTGC CGACAGCTTT TCTTACACCG TATTATGCAC GTCATGATTC TAGTCTGCAA ATAGCTAGAC AAATTGCGTG GAAGGAAAAT GTTAAATGTG AGGTATGGAC TGCTATTATT GCGCAAAAGA TATTGAATCA ATCTTACTAT TTGGGAGAAT GTTCCTTCTT TGAGAAATCT CAATCTATTA TGGAATTATA TCATGGATTA GAACGGTTTG ATCCTAGTAA TCGTGAGGGT CACTCTGCCA GAATTTATTT TAATACGCTT TTCGGTAATG ATTTTACTAG GGAAAGTGAT AATGATATCA ATGCAGCGCT GGATTATGGT TATACCTTGT TATTGAGTAT GTTTGCACGT GAGGTTGTCG TCTGTGGTTG TATGACACAA ATTGGTTTAA AACATGCTAA CCAGTTTAAT CAATTTAACC TTGCTAGTGA TATTATGGAG CCATTTAGGC CAATTATAGA TAGAATTGTT TATCAAAATC GACATAATAA TTTCGTCAAA ATCAAAAAAG AACTTTTTTC GATCTTTTCA GAAACCTATC TCTATAATGG AAAAGAGATG TATTTATCAA ACATTGTTAG TGATTATACA AAAAAAGTTA TAAAAGCGTT AAATCAGTTA GGTGAGGAAA TTCCTGAATT TAGAATATGA
|
Protein sequence | MAGWRTVVVN IHSKLSYKNN HLIFRNSYKT EMIHLSEIDI LLLETTDIVL TTMLVKRLVD ENILVIFCDD KRLPTAFLTP YYARHDSSLQ IARQIAWKEN VKCEVWTAII AQKILNQSYY LGECSFFEKS QSIMELYHGL ERFDPSNREG HSARIYFNTL FGNDFTRESD NDINAALDYG YTLLLSMFAR EVVVCGCMTQ IGLKHANQFN QFNLASDIME PFRPIIDRIV YQNRHNNFVK IKKELFSIFS ETYLYNGKEM YLSNIVSDYT KKVIKALNQL GEEIPEFRI
|
| |