Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1023 |
Symbol | |
ID | 6092453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1070046 |
End bp | 1071005 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642488219 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001739056 |
Protein GI | 170288818 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAGTG TTTATCTATT TTCAAGCGGA ACGTTAAAGA GAAAAGCGAA TACTATCTGT CTTGAATCAG AGGCGGGAAG AAAGTACATA CCTGTCGAAA ATGTGATGGA TATAAAGGTT TTTGGGGAGG TTGATCTCAA CAAAAGATTC CTTGAGTTTC TTTCTCAGAA AAGAATTCCT ATTCACTTCT TCAACAGAGA GGGGTATTAT GTAGGCACTT TTTATCCCAG AGAGTATTTA AACAGCGGTT TTCTGATACT GAAACAGGCA GAACACTACA TCAACCAAGA AAAGAGAATG CTCATAGCAA GAGAAATAGT TTCAAGATCG ATTCAAAACA TGATCGACTT TTTGAAAAAA CGAAAAGTCC AGGCTGATTC ACTAACGAGG TATAAAAAGA AAGCAGAAGA GGCGAGCAAT GTATCAGAGT TGATGGGAAT AGAAGGAAAC GCAAGAGAAG GGTACTACTC GATTATGGAC AATCTCGTGT CGGATGAAAG ATTCCGCATA GAGAAGAGAA CAAGAAGACC CCCTAAAAAC TTCGCCAATA CACTCATCAG TTTTGGAAAC TCGCTTCTTT ACACCACCGT TTTGAGTCTC ATCTATCAAA CACATCTGGA CCCGAGGATA GGATATCTCC ATGAGACGAA TTTCAGAAGG TTCTCACTCA ATCTTGATAT AGCAGAGCTG TTCAAACCAG CCGTGGTGGA TAGGTTGTTT TTGAATCTCG TCAACACTCG TCAAATAAAC GAAAGGCATT TCGATGAAAT CTCAGAGGGT CTCATGCTCA ACGATCAGGG AAAAAGTCTG TTTATCAAAA ATTACGAACA AATTTTGAGG GAAACGGTTT TTCACAAAAA GTTGAATCGG TACGTTTCCA TGAGATCTCT GATAAAGATG GAACTTCATA AACTGGAGAA GCACCTCATA GGTGAACAGG TTTTCGGATC TGAGGAATGA
|
Protein sequence | MESVYLFSSG TLKRKANTIC LESEAGRKYI PVENVMDIKV FGEVDLNKRF LEFLSQKRIP IHFFNREGYY VGTFYPREYL NSGFLILKQA EHYINQEKRM LIAREIVSRS IQNMIDFLKK RKVQADSLTR YKKKAEEASN VSELMGIEGN AREGYYSIMD NLVSDERFRI EKRTRRPPKN FANTLISFGN SLLYTTVLSL IYQTHLDPRI GYLHETNFRR FSLNLDIAEL FKPAVVDRLF LNLVNTRQIN ERHFDEISEG LMLNDQGKSL FIKNYEQILR ETVFHKKLNR YVSMRSLIKM ELHKLEKHLI GEQVFGSEE
|
| |