Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0127 |
Symbol | |
ID | 7408489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 149984 |
End bp | 150964 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643714535 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002572058 |
Protein GI | 222528176 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAA CACTTTACAT TACCTCAAAC GGAAGACTCA GAAGAAAAAA CAATACCTTG TACTTTGAGA CAGAGACTGA AAAAAGGTCA ATTGATATCG AAAACATTGA ACAAATTCAC ATCTTTGGTG AAGTCGATTT GAATACCAAG ACTTTGAATT ATATCTCTCA ATACGGCATA GTTCTTCACT TTTACAACTA TTACGGATTT TATTTGGGGA GTTTTCTGCC TCGAAAGAAA AACATTTCAG GAGATGTTGT TGTTCGGCAA GCACTTCATT ATCTTGATAG GGAAAAGAGA ATCTTCTTGG CATACTGTTT TGTCGAATCA GCGGTTTATC ATATGATGAG AAATTTAAGA GAAAGAAAAA AAACAGAGGC TTTTTTGAAT GCAATTGAAG ATGAATGGGA AAATGGCAGG TTTAATATTT CAAGCATATC AGAGCTTATG GGGCTTGAGG GAAGAGTGAG GAATATTTAC TATTCGTCTT TCAATCAGTT TTTGCCAGAA GACTTTTACA TGGAAAAGCG TGAAAAAAGA CCACCGACAA ATCCGATAAA TGCTTTGATT TCATTAGGAA ATAGCCTAAT TTATAGCACA GTTTTAACAG AGATTTACCA TACTCAGCTT GACCCAAGCA TAAGTTTTTT GCATGAACCA AGTGAAAAGA GGTTTTCATT AAGTCTTGAT ATATCTGAAA TTTTCAAACC TCTGATTGTG GATAGTGTAA TTTTCAAGCT TTTGAATAAT CATCAGCTTA CACTTGAACA CTTCGATGAG GATTTGAATT ATTGCTACTT GAATCAAGAT GGAAAAAAGA TTTTTATCAA TGAACTAAAA AACAAGCTTG AGACAACAGT TCGTCACAGA CAGCTAAATA GAAATGTTTC CTATAAGGGA TTTATAAGAC TTGAGTGTTA CAAGCTGATA AAACATTTTA TAGGCGATCA GGTTTACTCG CCACTTAAGG CGTGGTGGTA A
|
Protein sequence | MQKTLYITSN GRLRRKNNTL YFETETEKRS IDIENIEQIH IFGEVDLNTK TLNYISQYGI VLHFYNYYGF YLGSFLPRKK NISGDVVVRQ ALHYLDREKR IFLAYCFVES AVYHMMRNLR ERKKTEAFLN AIEDEWENGR FNISSISELM GLEGRVRNIY YSSFNQFLPE DFYMEKREKR PPTNPINALI SLGNSLIYST VLTEIYHTQL DPSISFLHEP SEKRFSLSLD ISEIFKPLIV DSVIFKLLNN HQLTLEHFDE DLNYCYLNQD GKKIFINELK NKLETTVRHR QLNRNVSYKG FIRLECYKLI KHFIGDQVYS PLKAWW
|
| |