Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1977 |
Symbol | |
ID | 5707880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2275965 |
End bp | 2276927 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641271482 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001536853 |
Protein GI | 159037600 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.147845 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCA GCGCCCAGCG GCGACTCGCC GCACCGACCC TGGCCATGCT GCCCCGCGTG GCGGACTCGC TTAGCTTCCT CTACGCCGAC ATCGTTCGGA TCGTCCAAGA CGACACCGGA GTCCTCGCCC AGGTCGATAC CCCGAAAGGG GCCGAACGCG TCTATCTGCC CACAGCCGCG CTCAGCTGCC TTCTCCTCGG ACCCGGCACC TCGATCACCC ACCACGCCCT GTCCACCCTC GCCCGCCACG GCACTACCGT TGTCTGCGTC GGCTCCGGCG TCGTCCGCTG CTACGCCGGC ATCACCCCCA CCTCTCTGAC CACCAACTGG CTGGAAAAGC AGGCCCGCTG CTGGGCCGAC GACAACACCC GACTGCACGT AGCGATACGC ATGTACGAGC AACGCTTCGG CGAAGCCGTC CCCGAAGGCA CCACGCTGGC CCAGCTCCGC GGTATGGAAG GCCAGCGTAT GAAAACGCTC TACCGCCTGT TGGCCCAGAA GTATCGAACC GGCAAATTCC GCCGCAACTA CGACCCCAAC AAGTGGGACA CCCAGGACCC GGTCAACCTT GCGCTATCAG CGGCCAGCGC CTGCCTATAC GGAGTGGTCC ACGCCGTCAT CCTCGCTTTG GGCTGCTCAC CGGCGCTCGG CTTCGTACAC AACGGCACCC AACACGCCTT CGTCTACGAC ATCGCCGACC TCTACAAGGC CAAGGTCACC GTGCCGCTCG CCTTCTCCAT GAGCACCTCC GCCCAACCGG AACGCGACGT ACGCCGCAAG CTGCGCGACG GGTTCCGCCT GCTCAAGCTG ATGCCGACGA TCGTTACCGA CATCCAACAT CTACTCGACC CCGACAGCAC ACCTAAACAG CGGCAACCCG CCGCCGAAAT CACCTCACTC TGGGATCCAG AGATGGGAGC CATGCCGTCC GGAGTCAACT ACAGCTCAGA CCCCTGGGAC TAG
|
Protein sequence | MSTSAQRRLA APTLAMLPRV ADSLSFLYAD IVRIVQDDTG VLAQVDTPKG AERVYLPTAA LSCLLLGPGT SITHHALSTL ARHGTTVVCV GSGVVRCYAG ITPTSLTTNW LEKQARCWAD DNTRLHVAIR MYEQRFGEAV PEGTTLAQLR GMEGQRMKTL YRLLAQKYRT GKFRRNYDPN KWDTQDPVNL ALSAASACLY GVVHAVILAL GCSPALGFVH NGTQHAFVYD IADLYKAKVT VPLAFSMSTS AQPERDVRRK LRDGFRLLKL MPTIVTDIQH LLDPDSTPKQ RQPAAEITSL WDPEMGAMPS GVNYSSDPWD
|
| |