Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aaci_2651 |
Symbol | |
ID | 8426192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446 |
Kingdom | Bacteria |
Replicon accession | NC_013205 |
Strand | - |
Start bp | 2731195 |
End bp | 2732235 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645028778 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003186044 |
Protein GI | 258512610 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000206162 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCGCCCG TAATCCTGAA AACGCTTTTC GTCCAGCGCG AAGGTGCCAT CGTCCGCGTG CATCAGGACA CGGTAGTCGT CACGTTGGAG AATGAAACGC TTCTACGCGT GCCCATGCAC ATGGTGGATT CCATCGTCGG CATCGGCCGC GTGTCGTTCA CGAGTCCGCT CCTGGAGCGT TGCGCCGCCG AGGGGCGGTC CGTCGTTCGC ATGACGCGGG GAGGACGTTT CTTGTATCGC ATTGAGGGTC GGATGTCGGG CAATGTGCTC CTGCGCACCG CGCAGCATGA GGCGGCCCGC TCTCCCGAAC GATCGTTAAC GATCATGCGC GCGATTGTCG CGGGTAAAGT CCACAACCAG CGTCAACTCG TGCTAAAGGC CGCGCGCGAT CTCACTGCTC CTGCAGACCG TTCTTTCGTC CGCGAGGTGG CGGGCGATCT CGGACGTGAG CTTCGCAAGC TCCCGTCGGC CTCACATCCC GACGAAATCC GCGGTGTCGA GGGTGCGAGC GCACGCCGGT ACTTCATGGC ATTGCGACAT CTGATTGCTC CGGCCATTCG CGACGCACTG TCGTTTGACG GCCGCAATCG ACGGCCTCCG CGTGATCCCG TCAATGCTGT GTTGTCCTTC CTCTACGCGC TCATCACGCG AGACGCAGAG AGCGCGCTGC TTGGCGTGGG ACTCGATCCG CAGATTGGCT TTCTCCACAC GCTGCGCCCG GGCCGCCCGT CGCTCGCGCT CGATCTGGTT GAGGAAATGC GCCCAATTTT GGCCGATCGC GTGATGCTGT CGCTCTTCAA TCGCCGCCAA CTGCAGCCTT CCGATTTCGA GGTTCTGCCA GGAGGGGCGG TGGAACTCAC CGATTCGGGA AGGAGGACCC TTTTCGCCGA GTGGGACAGG CGCAAGCAGG TCGAGATCGA GCATCCCCTA CTGAAACAGC CGGTCGCGTA CGGACGGCTG CTTGACGTGC AGGCGAGGCT ACTCGCTCGC GCCATCCGCT CACCAGCACT CGGCTACACG CCGTTTCTCT ACCGAGGCTG A
|
Protein sequence | MPPVILKTLF VQREGAIVRV HQDTVVVTLE NETLLRVPMH MVDSIVGIGR VSFTSPLLER CAAEGRSVVR MTRGGRFLYR IEGRMSGNVL LRTAQHEAAR SPERSLTIMR AIVAGKVHNQ RQLVLKAARD LTAPADRSFV REVAGDLGRE LRKLPSASHP DEIRGVEGAS ARRYFMALRH LIAPAIRDAL SFDGRNRRPP RDPVNAVLSF LYALITRDAE SALLGVGLDP QIGFLHTLRP GRPSLALDLV EEMRPILADR VMLSLFNRRQ LQPSDFEVLP GGAVELTDSG RRTLFAEWDR RKQVEIEHPL LKQPVAYGRL LDVQARLLAR AIRSPALGYT PFLYRG
|
| |