Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2659 |
Symbol | |
ID | 7407023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2795649 |
End bp | 2796641 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643717025 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002574494 |
Protein GI | 222530612 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAAG ACCTTTATGT TTTCAATTCT GGGTTTTTAA GAAGGAAAGA TAACACCATC ATGTTTGAAA CAGATGAAGG CAAAAAGTAT TTTCCGGTAG AAGAGATAGA ATCAGTTTTT ATATTTGGAG AGGTTGACAT AAATAAAAGA TTTTTGGAGT TCATGACAGA AAAGAATATA TGTGTCCATT TTTTTAACAG GTATGAGTAC TACGTTGGCA CATACTACCC GCGCGAACAT TACAACTCAG GCATTGTGAT ACTCAAGCAG GTAGAATTTT ACAACGACTA CAACAAAAGA ATGACCATTG CAAGGTCAAT TGTTGAAGGA GCTGTTCTAA ATATGCTTGT GGTTCTGAGG TACTACAATT CCCGTGGAAA TATGCTAAAA GATGAGATAG AGACAATCGA AAGAATGCGC CATAACATAA ACTCCTGTGA TGATGTAAAT ACGCTTATGG CTTTGGAGGG GAATATAAGA GAAATCTACT ACAGGTGTTT TAACAAGATA CTGGATAATG AAAATTTTAC ATTTGTCCGC AGGAGCAAAA ATCCGCCTCT TGACAGGATA AATGCTCTGA TTAGCTTTGG GAATTCTCTT TTGTATGCTA CAACTCTTGG AGAGATTTAC CAAACACAGC TTGACCCGCG CATAGGATAT CTGCATTCCA CAAACCAGCG CAAGTTTTCA TTGAACTTAG ATATTTCCGA AATCTTCAAA CCCATAATTG TTGACAGGGT AATTTTTTCT CTTGTTAACA AGAAAGTGCT GAGCGAAAAA CATTTTGAAA AAGAACTAAA CGGCATTATA TTGAACGACC AGGGCAAAAA ATTGTTTGTC TCTGAGTATA ACCAGAAGCT CTACTCTACT ATAATGCACC CGAAACTGAA TACTCAAGTA AGTTACAAAA GGCTCATCCG AATGGAAGCA TACAAGCTCC AAAAATTGTT TTTGGAAAAT ATAGAATACA AACCTTTTGT TGCAAGGTGG TAG
|
Protein sequence | MKKDLYVFNS GFLRRKDNTI MFETDEGKKY FPVEEIESVF IFGEVDINKR FLEFMTEKNI CVHFFNRYEY YVGTYYPREH YNSGIVILKQ VEFYNDYNKR MTIARSIVEG AVLNMLVVLR YYNSRGNMLK DEIETIERMR HNINSCDDVN TLMALEGNIR EIYYRCFNKI LDNENFTFVR RSKNPPLDRI NALISFGNSL LYATTLGEIY QTQLDPRIGY LHSTNQRKFS LNLDISEIFK PIIVDRVIFS LVNKKVLSEK HFEKELNGII LNDQGKKLFV SEYNQKLYST IMHPKLNTQV SYKRLIRMEA YKLQKLFLEN IEYKPFVARW
|
| |