Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1082 |
Symbol | |
ID | 5170637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | - |
Start bp | 1110514 |
End bp | 1111497 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640563599 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001244672 |
Protein GI | 148270212 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAGAA ACTATTATGT TTTTTCCTCT GGAAGAATAC GAAGGCGAGA AAATAGTATC TTAATAGAAT ATCAGGATAG AGATGGGAAA CAGCAAAAAA GGTTCATTCC GGTAGAAAAC GTTGATCAGA TATTTTTCTT AGGTGAGGTT GATTTGAATT CAAAATTTCT GGATTTTGCT GCAAAAAACA ATATTGTTCT CCATTTTTTC AATTATTATG GATATTACAC CGGCTCTTTT TATCCAAGGG AAAAATTTCT CTCAGGAGAA CTTTTGGTAA GACAGGTGGA ACACTATCTG GATAATGAGA AAAGGTTGAG TTTGGCGAGA AAGTTTGTAG AAGGAGCAAT CCATAACTTC AAACGAAACA TCGAAAAAAG AGGATTCGAT ATTGTCAGTA AGATATCCGA ATATCAGGAA AGAATAAAGC ACGTAGCGAC TATCCCGGAG CTCATGAGTT GTGAAGCGCA CGCCAGAAAG CTCTATTACT CTACCTGGGA AGATATAACA GACTGGCCGT TTGAAGAAAG AAGCATGCAA CCGCCTTTGA ATGAACTGAA TGCTCTTATT TCCTTTGGAA ACTCTCTTAC CTATTCCGTT GTTCTGAAGG AACTTTATCA TACACATTTG AATCCTACCG TTAGTTATCT TCATGAACCG GGTACCAAAA GGTTTTCGCT TGCTTTGGAC ATATCGGAGA TATTTAAACC TATTTTTGTT GATAGGATAA TATTCAAACT CATAAATCTT GGCAAGATTA AACGTGAAAA TCATTTCCTT CAGGAATCCA ATGGAGTGTT TTTAAACGAT GAAGGACGAA GAATATTTGT GGAGGAATTC GAAAATATGC TTCAACAAAC AGTTCTTCAT AGAAAGCTGA AGAGAAAGGT TAAATATCAG TCTTTCATAA GATTGGAGGC TTATAAAATA ATCAAACATC TGCTCGGAGA AGATGAGTAC AAGCCTTTCA AGGTTTGGTG GTAG
|
Protein sequence | MGRNYYVFSS GRIRRRENSI LIEYQDRDGK QQKRFIPVEN VDQIFFLGEV DLNSKFLDFA AKNNIVLHFF NYYGYYTGSF YPREKFLSGE LLVRQVEHYL DNEKRLSLAR KFVEGAIHNF KRNIEKRGFD IVSKISEYQE RIKHVATIPE LMSCEAHARK LYYSTWEDIT DWPFEERSMQ PPLNELNALI SFGNSLTYSV VLKELYHTHL NPTVSYLHEP GTKRFSLALD ISEIFKPIFV DRIIFKLINL GKIKRENHFL QESNGVFLND EGRRIFVEEF ENMLQQTVLH RKLKRKVKYQ SFIRLEAYKI IKHLLGEDEY KPFKVWW
|
| |