Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2578 |
Symbol | |
ID | 5706150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2934586 |
End bp | 2935554 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641272041 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001537411 |
Protein GI | 159038158 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0729404 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTCC CTGGTGTCCC ACCGGCCGAA CTGCCGGATC TCAACCGTGC GCAGGACCGG ATCAGCTTCG TCTACCTCGA ACGCTGCGTC ATCCACCGAG ACAGCAATGC GATCACCGCC ACCGACGAGA AGGGGATCGT GCACATTCCG GCGGCCACCC TCGGTGTCCT GATGCTCGGG CCGGGCACCA GCATCACGCA GCAGGCGATG ATGCTCATCG CCGACAACGG GGCCACCGTC GTCTGGATCG GAGAGCATGG CGTCCGGTAC TACGCGCACG GCCGTCCCCT CGCCCGGTCG AGCCGTCTCC TCGTCGCCCA AGCCGCCGCC GTGTCCCACC GCGACCGACG GCTGCGGGTC GCACGAGCGA TGTACCGCAT GCGATTTCCC GGCGAGGACA CCACCAACCT CACCATGCAA CAGCTGCGCG GCAAGGAGGG TGCCAGGGTG CGCCGCTGCT ACCGGGAGAA CGCCCAGCGA ACCGGCGTCT CATGGAACAG CCGCGAGTAC GATCCGGACG ACTTCACCGG CAGCGACCCG GTCAACCAGG CGCTGTCAGC GGCCCACGCC TGCCTCTACG GAATCGTGCA CGCGGTTGTC GTCGCCGTCG GTGCCTCACC CGGACTCGGT TTCGTCCACA CCGGCCACGA TCGGTCATTC GTCTACGACA TCGCCGACCT CTACAAGGCC GACGTCACCA TCCCGGTCGC CTTCGACATC GCCGCCGCAG AGTCCACCGA CATCGGCGCC GACACACGCC GGGCCGTCCG CGACCGGGTG CACAATGGTG CACTCCTCGG CCGCTGTGTG CAAGACATCC GACGTCTGCT GCTCACCGAC AGTGCTGCCG GGCCGATCAA CGAGGAGGAG TTCGACGAGG AAGCCGACAA CGACGCCGTA CGTCTCTGGG ACGAAGGCGG TCTCGAGTTG GCCGGCGGCC GGAACTACGG CGGAGACGTG GACTTCTGA
|
Protein sequence | MKVPGVPPAE LPDLNRAQDR ISFVYLERCV IHRDSNAITA TDEKGIVHIP AATLGVLMLG PGTSITQQAM MLIADNGATV VWIGEHGVRY YAHGRPLARS SRLLVAQAAA VSHRDRRLRV ARAMYRMRFP GEDTTNLTMQ QLRGKEGARV RRCYRENAQR TGVSWNSREY DPDDFTGSDP VNQALSAAHA CLYGIVHAVV VAVGASPGLG FVHTGHDRSF VYDIADLYKA DVTIPVAFDI AAAESTDIGA DTRRAVRDRV HNGALLGRCV QDIRRLLLTD SAAGPINEEE FDEEADNDAV RLWDEGGLEL AGGRNYGGDV DF
|
| |