Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3904 |
Symbol | |
ID | 8335257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4428420 |
End bp | 4429391 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644957030 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003114633 |
Protein GI | 256393069 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.242948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0868629 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGAC CCGCCCAACG CGGCTCGTCA ACTCCCTACC AACTGCCCCG CATCGCCGAC CGGGTCAGCT TCGTCTATCT GGAACGGTGT ACCGTCCATC GCGACGCCAA CGCCATCACC GCCCAAGACG CCGACGGGAT CACCCACATC CCGTCGGCGA CCATCGGCAC CCTCCTGCTC GGCCCCGGAA CCCGCATCAC CCACCAAGCC ATGGCAGTAC TCGGCGACTG CGGAGCAAAC GTGGCATGGG TCGGCGAACA CGGCGCCCGC TTCTACGCAG CCGCACGCTC CCTCAACCGA TCATCCGCAC TCGTGGAAGC ACAAGCAACC CTCTGGGCGA ACCGCCGCAC ACGCCTGGAC ATCGCACGCG CCATGTACCG CATGCGATTC CCCGACGAGG ACCCCTCCGG CTTCATGCGC CAACAACTAC TCGGCATGGA AGGCCGCCGA CTCAAAGACT GCTACCGACA ACAGTCACAA CGCACCGGCG TGCCCTGGCA CGGCCGCCAA TACACACCGG GCAACTTCAA CGCCGGCGAC GCCATCAACC AAGCAATCAC CGCCGCCGCA CAATGCATGT ACGGCGTCGC CCACACCATC ATCACCGCGC TGGGTTGCTC TCCCGGACTC GGATTCATCC ACTCCGGGCA CGAACTGTCC TTCGTCATGG ACATCGCAGA CCTGTACAAG ACCGAGATCG GAATCCCCGT CGCCTTCGAC ACAGCCGCCG AAGACAGCAC CGACATCGGC CCACGAACCC GCCGCGCCCT ACGCGAGCAG ATCCGCACCA CGCGACTGCT CGAACGGTGC GTCGACGACG TCAAAGCGCT ACTCACCACA CCGAACAACG AACCCGGCAG CAGCGACGCC ACCGACACAG GCTTCTTCGA CCAGGTACAA CTCCAGTCCG ACCGCGAAAC CGAAGTCGAA GGCGGAAGAA ACTACGCCGA CGAACTCGGC ACAACCTGGT AA
|
Protein sequence | MAGPAQRGSS TPYQLPRIAD RVSFVYLERC TVHRDANAIT AQDADGITHI PSATIGTLLL GPGTRITHQA MAVLGDCGAN VAWVGEHGAR FYAAARSLNR SSALVEAQAT LWANRRTRLD IARAMYRMRF PDEDPSGFMR QQLLGMEGRR LKDCYRQQSQ RTGVPWHGRQ YTPGNFNAGD AINQAITAAA QCMYGVAHTI ITALGCSPGL GFIHSGHELS FVMDIADLYK TEIGIPVAFD TAAEDSTDIG PRTRRALREQ IRTTRLLERC VDDVKALLTT PNNEPGSSDA TDTGFFDQVQ LQSDRETEVE GGRNYADELG TTW
|
| |