Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_0999 |
Symbol | |
ID | 5057445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 1118388 |
End bp | 1119368 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640473269 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001157852 |
Protein GI | 145593555 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.614939 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTGCCA GCGGGCGCAG CTATTGGCTG ACCGAACCGT GCCGAATCCG ACGCGAAGAC AACAGCATCC GGATCGAACG CGCCGATGGA CAACCTGTTC GCATTCCGAT CACCGACATT CGCGACCTTG TGCTCTTCGA CAACGCCGAC ATCAACACCG CCGCGGTATC GCTACTCAGC CGGCACGGAG TCACCGTGCA CCTACTTGAC CACTACGGCA ACTATGCTGG CGCGCTGACT CCAGCCGACG ACATGTCCTC CGCACACGTC GTCCGCGCCC AGGTGGCCCT GACAGGCAAC CCTCAGGCCC GACTCGCTGT CGCGCAGGCC CTCGTCCGGG CGACCGCGGT CAACGTAGCC TGGGCCCTGG GCACGGACCT GCTCGATGGG CCACTCGAAC GACTTCCCGC CCAAATCGGT GCCAGCACCT CATCCGGAGA CCTGATGGGA GTCGAAGGTA ACTTCCGGCG AACCGCGTGG GGAGTGCTCG ATACCCTGCT ACCGCCCTGG CTCCGGCTTG ACGGACGCAC CCGTCGCCCA CCCAGTAATG CCGGCAACGC GTTCATCAGC TACCTCAATG CCATCACCTA CGCTCGGGTT CTCACCGCGA TTCGCTGTAC GCCGCTGCAC CCGGCGATCG GCTTCCTGCA CGCCGACACC GACCGGCGCC GAAACACCCT CGCCCTTGAC CTCGCCGAAC CGTTCAAGCC GCTGCTCGCC GAACGACTGC TCCGCCGAGC AGCCGCGCAG CGAACCCTGA CCGCTGCAGA CTTCGTCAGC GACGTCCGTA GCGCGTCCCT CAGCCAGGCC GGACGGAAAA AGATTGCTGT CATGGTCCGC GAAGAACTGG CCACCACCGT CCAGCATCGG CAACTCCGGC GAAAGGTGTC CTACGAGGAG TTGATCCACC TGGAGGCCCT CAAGCTCGTA CGACTATGCC TCGAAGGCAC GACCTACAAG CCCTTCCGGC CCTGGTGGTA G
|
Protein sequence | MSASGRSYWL TEPCRIRRED NSIRIERADG QPVRIPITDI RDLVLFDNAD INTAAVSLLS RHGVTVHLLD HYGNYAGALT PADDMSSAHV VRAQVALTGN PQARLAVAQA LVRATAVNVA WALGTDLLDG PLERLPAQIG ASTSSGDLMG VEGNFRRTAW GVLDTLLPPW LRLDGRTRRP PSNAGNAFIS YLNAITYARV LTAIRCTPLH PAIGFLHADT DRRRNTLALD LAEPFKPLLA ERLLRRAAAQ RTLTAADFVS DVRSASLSQA GRKKIAVMVR EELATTVQHR QLRRKVSYEE LIHLEALKLV RLCLEGTTYK PFRPWW
|
| |