Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2297 |
Symbol | |
ID | 4809886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2741525 |
End bp | 2742517 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640107703 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001038692 |
Protein GI | 125974782 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.150876 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA GTGCTTTTAT TTTTAGTGAC GGAGAGTTGA AAAGGAAGGA CAGTACAGTA CTTTTCGAGA GCGAAGATTC AAAAAATTAT CTTCCGATTG AGGATATCAG CGATATTTAT ATTTTCGGTG AGGTTACCGT TACAAAGAAG TTTCTGGAGC TGGCAACTCA GAAGGAAATA CTTCTTCATT TTTATAACTA CAATGAATAT TATGTTGGAA CTTATTATCC CAGAGAGCAT TATAATTCCG GTTTTATGAT ATTAAAACAG GCGGAGCATT ATCTTGATGA GGAGAAGAGA ATGGCAATTG CTAAAAAGTT CATACATGGC AGTGTGAAAA ACATGCTTGC GGTTTTAAAA TACTACAATA ACCGGGAAAA AGATTTGGAC AGACAGATAA CGGCTATTAG TGATTTGGCA GAAAAAATAG ATGAGATGGA CGAAATCAAC AAATTAATGG CTATTGAGGG AAATATAAGA GAGATTTACT ATGGTTCCTT CGACATTATA GTAGATGATG AGTATTTTGA GTTTGGTAAA AGAACAAAAC AACCACCGAA AAATCGAATG AATTCTTTGA TCAGCTTTGG AAACAGCATA CTTTATACGA CTGTTTTAAG CGAGATATAT AAAACTCATT TGGATCCGAG GATAGGATAC CTTCATTCTA CAAACCACAG AAGATTCACT TTAAATCTTG ATGTTGCTGA AATTTTTAAG CCAATCATTG TTGACAGAGT CATTTTTACG TTGATTGGCA AAAAAATGCT GGGAGAAAAG CATTTTGAAG AAAAAGCCGG TGGAATTGTC TTAAACGACA AGGGGCGAAA GCAGTTTGTA GCGCAAATGC TTGAGAAGTT AAATGCAACT TTAATGTATA AGCCCTTGGG GAGAGAAGTA TCTTACAGGA GACTTATCAG GTTGGAATTG TATAAGCTTG AGAAACATCT TATGGGAGAG CAGGAATATA AACCATACGT TGCTTCATGG TAA
|
Protein sequence | MKKSAFIFSD GELKRKDSTV LFESEDSKNY LPIEDISDIY IFGEVTVTKK FLELATQKEI LLHFYNYNEY YVGTYYPREH YNSGFMILKQ AEHYLDEEKR MAIAKKFIHG SVKNMLAVLK YYNNREKDLD RQITAISDLA EKIDEMDEIN KLMAIEGNIR EIYYGSFDII VDDEYFEFGK RTKQPPKNRM NSLISFGNSI LYTTVLSEIY KTHLDPRIGY LHSTNHRRFT LNLDVAEIFK PIIVDRVIFT LIGKKMLGEK HFEEKAGGIV LNDKGRKQFV AQMLEKLNAT LMYKPLGREV SYRRLIRLEL YKLEKHLMGE QEYKPYVASW
|
| |