Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_0710 |
Symbol | |
ID | 4437392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | + |
Start bp | 646777 |
End bp | 647688 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 639676408 |
Product | hypothetical protein |
Protein accession | YP_820162 |
Protein GI | 116627543 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0265881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTGGA GAGTTGTACA TGTCAGTCAA AGCGAGAAGA TGCGCTTAAA GCTTGATAAC TTATTAGTGC AAAAGATGGG ACAAGAGTTT ACGGTGCCAC TAAGTGATAT TTCGATAATC GTTGCAGAAG GTGGGGATAC AGTTGTTACC CTTCGTCTAT TAAGTGCCTT AAGTAAATAT AATATTGCTT TAGTTGTTTG TGATAATGAA CATTTACCAA CAGGGATTTA TCACTCACAA AATGGGCACT TTAGAGCGTA CAAGCGCTTG AAAGAACAGC TGGATTGGTC TCAGAAACAA AAGGACAAGG CATGGCAGAT TGTAACTTAT TATAAAATCA ATAACCAAGA GGATGTCCTA GCCATGTTTG AAAAAAGTCT GGACAACATT AGATTACTTT CAGACTATAA AGAGCAGATA GAACCTGGTG ATAGAACGAA TAGAGAGGGA CATGCTGCCA AGGTCTACTT TAATGAGCTC TTTGGTAAAC AATTTGTCAG AGTAACTCAG AAAGAAGCTG ATGTCATCAA TGCTGGTTTA AACTATGGCT ATGCTATCAT GAGGGCTCAG ATGGCTAGAA TAGTGGCGGG TTATGGTTTA AATGGCCTAT TAGGAATCTT CCATAAAAAT GAATACAATC AGTTTAATTT GGTTGACGAC TTGATGGAGC CATTTAGACA GATTGTAGAT GTTTGGGTAT ATGATAATCT ACGAGATCAG GAATTCCTTA AGTATGAGTA TAGGTTGGGA TTGACAGATT TACTCAATGC TAAAATCAAA TATGGCAAAG AGACCTGTTC AGTGACAGTT GCTATGGACA AATATGTCAA AGGCTTTATC AAATATATTT CGGAAAAAGA TAGCAGTAAA TTCCACTGCC CAGTGGTATC AAGTTTAGAG TGGAGAAAAT AA
|
Protein sequence | MTWRVVHVSQ SEKMRLKLDN LLVQKMGQEF TVPLSDISII VAEGGDTVVT LRLLSALSKY NIALVVCDNE HLPTGIYHSQ NGHFRAYKRL KEQLDWSQKQ KDKAWQIVTY YKINNQEDVL AMFEKSLDNI RLLSDYKEQI EPGDRTNREG HAAKVYFNEL FGKQFVRVTQ KEADVINAGL NYGYAIMRAQ MARIVAGYGL NGLLGIFHKN EYNQFNLVDD LMEPFRQIVD VWVYDNLRDQ EFLKYEYRLG LTDLLNAKIK YGKETCSVTV AMDKYVKGFI KYISEKDSSK FHCPVVSSLE WRK
|
| |