Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3451 |
Symbol | |
ID | 7269676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4193268 |
End bp | 4194287 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643568261 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002464729 |
Protein GI | 219850296 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.124605 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.143241 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACAC TCTACGTGAT TGAACAAGGC GCTGAAATCG GTTGCGATGG CGAACGGATC GAAGTGCGCC GGGGGGCCGA CATTATCGGC AGCGTACCGT TGGTCAAACT CGACGACATC GTCATCTTTG GCAACGTCGG CATCAGTACG CCGGCGATGA AACGACTGCT CGACCGCGGC ATTGAAGTTA CCTTCATGAC GGTTGATGGG CGTTATCAGG GTCGCCTCAT TGGGCAGGTC ACGGCGCATG TCGCCCTTCG CCATGCGCAA TACGCCTGTG CCGCCGATCC GGCGCGGGCG CTTGCCCTTG CCCAACGGTT CGTCGAGGGC AAACTACGCA ACCAACGCGC ACTACTACAA CGGTTCAGCC GCAACCGGGC CGAACCACCA CCGGAAGCGC AGGCGGCGGC AGACGATCTT GAGGCCTATA TCAAGCGGGT AAAACGCACA ACGCAACTCA GCTCACTACT GGGGGTAGAA GGCAGTGCAA CCGCACGCTA CTTTGCCGGT CTACGCAGCC TGATCGGGCC GGAATGGTCA TTCAGCGGAC GCCAACGACG CCCACCACCC GACCCGGTCA ATCTGCTCCT CTCGCTCGGC TACACCCTCT TGGCGCACAA AGTGCTAGGG GCGGTACAGG CAGCCGGGTT CGACCCATAT CTCGGCTTCT TACACAGCCT TGACTATGGG CGGCCTTCGC TTGCGCTCGA CATAATGGAA GAGTTTCGTC CAATACTCAT CGACTCGCTG GTCGTGCGCA TCTGCAACGA CGGGCGCATC CGACCCGAAC ACTTCCGGCC GGGTGAAGGT GAGCGACCGA TCATCATCAC CGACGAGGGC AAACGGGCAT TTCTCACCGC GTTTGAAGAA CGCATGCGAA CCGAAGCCAC CCATCCCGAA GGCGCGGACA GTGGGCCGGG CAAAGTACCG TACACGCGCT GCATCGCGTT ACAGGCCAGA CGACTAGCGC GGGTGGTGCG TCAACGCACC GACGACTACG AGCCATTTGC CGTTCGATAA
|
Protein sequence | MATLYVIEQG AEIGCDGERI EVRRGADIIG SVPLVKLDDI VIFGNVGIST PAMKRLLDRG IEVTFMTVDG RYQGRLIGQV TAHVALRHAQ YACAADPARA LALAQRFVEG KLRNQRALLQ RFSRNRAEPP PEAQAAADDL EAYIKRVKRT TQLSSLLGVE GSATARYFAG LRSLIGPEWS FSGRQRRPPP DPVNLLLSLG YTLLAHKVLG AVQAAGFDPY LGFLHSLDYG RPSLALDIME EFRPILIDSL VVRICNDGRI RPEHFRPGEG ERPIIITDEG KRAFLTAFEE RMRTEATHPE GADSGPGKVP YTRCIALQAR RLARVVRQRT DDYEPFAVR
|
| |