Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CCV52592_1287 |
Symbol | cas |
ID | 5406787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Campylobacter curvus 525.92 |
Kingdom | Bacteria |
Replicon accession | NC_009715 |
Strand | - |
Start bp | 879066 |
End bp | 880064 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640872337 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_001408159 |
Protein GI | 154174048 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0188131 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAA GCGACAGGAC GCACTTTATA TTAAGCAGCG GGCGGCTTAG GCGACAGGAT AACAACATCT ATTTTGATAA ATTTGACGAG ACAGGCGGCG TGACGGCGAG TAAAATTTTA CCGATCAACG CCATAGATGA AATTTACATC CTCACCCGTG TGGAGCTCGA CACATACACG CTCGCGTTTT TGGCGGATAA TAACATACTT TTGCACGTTT TTAGCCCGTT TCAGAGCTTT CGGGGGAATT TTTACCCGAG CACCTCAAAC TCGGTCAATA AAAGCGGCTT CGCGCTGCTG TCTCAACTGC GGGCGTTTGA CGACCCCGTA AAGCGCGTCT ATATCGCTCG CGAAATCACC CGCGCGCACA TGCTAAACGA CGCGGCAAAC TGCAAAAAGC ACGGCGTGAA ATTTGACCCC GCGCCGCACA TCGCAGCACT TGACGCCGCC GCAGACGTAG GACAGATAAT GGCGGCGGAA GGGGCATTTC AAAAGCTTTA TTATGAAAAA TGGAACGAAA TTATCGCTGA TCAGCGAAGC TTTAAATTTA CCGTCCGCTC CAAGCGCCCG CCCGCCGATA AGATAAATAG CTTCATCAGC TACGTAAATA CGCGCATTTA TAACGTCTGC CTGAGCGAAA TCTACAAGAC CGAGCTTGAT CCGCGCATCG GCTTTTTGCA CGAGCCAAAC TACCGCGCGC TTAGCCTGCA CCTTGATCTA GCAGAGATAT TTAAGCCGAT TTTGGGCGAT ACGCTGATAT TTGCGATGCT AAATAAAAAG GAGATCACGG CAAAGGACTT TCAAACGGAC GCCGGACGGA TAAAATTTAG TAATGACGCC ATCCAAAAGA TCGAGATGAA GATGATCTCT CGCCTAAGCG AAACGATCGC GCTAAACGGG CAAAACCTCA CGTGGCGGCA AGTCATCAGA CGCGAGGCAA ACCAGCTCAA AAAATGTATC TGTGAGGATA TGCCTTACGT GGGGTTTGTG TGGGGATGA
|
Protein sequence | MQKSDRTHFI LSSGRLRRQD NNIYFDKFDE TGGVTASKIL PINAIDEIYI LTRVELDTYT LAFLADNNIL LHVFSPFQSF RGNFYPSTSN SVNKSGFALL SQLRAFDDPV KRVYIAREIT RAHMLNDAAN CKKHGVKFDP APHIAALDAA ADVGQIMAAE GAFQKLYYEK WNEIIADQRS FKFTVRSKRP PADKINSFIS YVNTRIYNVC LSEIYKTELD PRIGFLHEPN YRALSLHLDL AEIFKPILGD TLIFAMLNKK EITAKDFQTD AGRIKFSNDA IQKIEMKMIS RLSETIALNG QNLTWRQVIR REANQLKKCI CEDMPYVGFV WG
|
| |