Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0567 |
Symbol | |
ID | 7267800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 701815 |
End bp | 702828 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643565430 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002461942 |
Protein GI | 219847509 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.424472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.165251 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACTCA TAGTAGATGA ACGCGGTAGT TTCATTGGCA AGCATCAAGG TCGGTTGCGG GTAACCAAAG ATAACGAGCG GTTACGCGAA GTACCGATCA TGCATCTCCG GCAAGTGATC ATCTGTGGTA GTGGGGTTGC CATCAGCAGC GACGCCGTGC GAGCGTGCAG TGAAGAAGGC ATCCCAATCC ACTTCATCAG CACCAACGGC ACCCCACAGG CCAGCCTCTA CAGCGCCGGC CTCACCGGCA CCGTTCTTAC ACGGCGCGCC CAATTACGCG CCTACGACGG GCCGGCCGGC GTCACCCTTG CCCGTGCGTT TACGCTGGGT AAGCTTGGCA ACCAAGCCAA CCTACTGCGC TACGCGGCAA AAAATCGTAA GGAGACGGCG CCTGACATAT ACGAACAACT GATGACCGCA GCCGGTGAAG TGGTTGACTA CCAAATCGCA GTCGAACGGC TCAAAGGCGA AACGGTTGAC GAGATTCGTG ACGAATTGAT GGGGATCGAA GGCCGCTACG CAGCGCGCTA CTGGAAAGCC ATCGGCGCGT TGGTCCCGTC TGAACTCAAT TGGCCGGGGC GCGAGACACG TGGGGCAACC GATCCGTTCA ATCAAGTACT CAACTATGGT TACGGTGTAT TGTATGGTCA AGTCGAACAC GCCATCGTGC TTGCCGGTCT CGATCCGTAT GCCGGCCTAC TCCATGCCGA TCGACCGGGC AAACCGAGTC TGGTTTTAGA TTTAATCGAA GAGTTTCGTC AAGCTGTGGT TGATCGACCG CTGCTCGGCC AACTCACCCG TGGTTGGCAA ATTGGGCGGG AAGAAGATGG TCGGCTTGAT CAACCTACAC GTGAGCGCAT TGTGACCAAA GTGCTCGAAC GGCTTGAATC AACCGAACCG TATGAGGGGA AGCGGCAACC GTTACGTCAC ATTCTCCAGT GTCAAGCACG CCACATTGCT ACCTTCGTGC GTGGTGAGCG TGAAAACTAC ACCCCATTCG TGATGGGCTG GTGA
|
Protein sequence | MELIVDERGS FIGKHQGRLR VTKDNERLRE VPIMHLRQVI ICGSGVAISS DAVRACSEEG IPIHFISTNG TPQASLYSAG LTGTVLTRRA QLRAYDGPAG VTLARAFTLG KLGNQANLLR YAAKNRKETA PDIYEQLMTA AGEVVDYQIA VERLKGETVD EIRDELMGIE GRYAARYWKA IGALVPSELN WPGRETRGAT DPFNQVLNYG YGVLYGQVEH AIVLAGLDPY AGLLHADRPG KPSLVLDLIE EFRQAVVDRP LLGQLTRGWQ IGREEDGRLD QPTRERIVTK VLERLESTEP YEGKRQPLRH ILQCQARHIA TFVRGERENY TPFVMGW
|
| |