Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_5519 |
Symbol | |
ID | 7112734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011737 |
Strand | - |
Start bp | 172956 |
End bp | 174632 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643483897 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002380906 |
Protein GI | 218442585 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1468] RecB family exonuclease [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR00372] CRISPR-associated protein Cas4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.28105 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACTA CTGATCTAGA CCTAGAAACT TTGGCAGAAC CGATGGGGGA TACCATGAGG GTTTCGGCTC TCCATGCCTT TGCTTACTGC CCTCGTCTTT TTTACCTTGA GGAAGTAGAG GAACTCTATA CTCAAGATGC GGCGGTCTTT GCCGGTCGCC GGCTTCATGA AGAGATCGAT AAAAAAGAAG ATGAGGAATG GGAGGATCTC TACCTCGAAG ATGAGCATTT GGGCTTAAGG GGACGGGTGG ACGCTTTGCG GACTCGTGAC GGCCAAACCA TCCCCTACGA GCATAAACGG GGGCGCTGCT ACCGAGATGA AAAGAAGCAG CCTCAAGCTT GGGACAGCGA TCGCCTTCAA ATTTTAGCTT ATTGTTGTCT TATTGAGGCA GCATTAGGAA TTACTGTCCC AGAGGGAAGA ATTCGCTATC ATGCGGATAA CGTGCTAGTT CATGTCCCCT TCGATGAGAC AGGAAGACAA TGGGTAAAAG ACTCTATCCG GCAAGCACGA GAGTTAAGAA AATCCCCTTA TCGTCCTGCG GTGATTGACA ATGAACACTT GTGTTCTCGT TGTTCTTTGT CTCCTGTCTG TTTACCGGAA GAGGCTCGGT TAGCTCATAA TAAAGAGTGG CATCCCGTCC GGTTATTTCC TGTTGATGAT GAACGAGAAG TCATTCACGT TCTCGAACCC GGAACGAGAG TAGGACGCAC TGGGGAACAA CTGAAAATAA GTCGCCCTAA TCAACCCGAT GAGAAAATAG CGATTGGGCA AGTCTCTCAG GTGGTTCTCC ATAGTTTTTC CCAGATTTCT ACTCAAGCCG TTCATTTCCT GGCTTATAAA GAAGTTGGGA TTCACTTTGT TTCTGGGGGA GGGCGCTATA TAGGAAGTAT TGATGCTCGC TCCCGGAGTA TTCAACGGCG TGTCCGCCAG TATCAAGCGT TAAGTCAACC GGATTTTTGT CTAGAACTAG CCCGAAAGTT GGTTGCTTGT AGGGGAGAGG GGCAGCGTAA GTTTTTAATG CGGGGAAAAC GTAATAAAAA AGGTGATTCT CTTGCATTAG AGAAAACGAT CGCCCAAATG AAAGCGGTAC TCAAGCAAGT ACCACAAATT CAGTCCCTTG ATTCATTATT GGGAATTGAG GGCAATTTAG CTGCTCTTTA TTTTGGAGCT TTATCTAACC TATTGGCTGA AAATGCTCCA GAATCACTCT TATTTTCGGG TCGTAATCGT CGCCCTCCTA AAGATCGCTT TAATGCACTG TTGAGCTTTG GTTATTCTCT GCTCATCAAA GATGTAATGA ATGCTATTCT TGCTGTTGGG TTAGAGCCAG CATTAGGATT CTATCATCAG CCGAGAACCC AAGCTCCTCC TCTGGCCTTG GACTTAATGG AAATTTTCCG CGTTCCTTTG GTAGATATGC CCGTTGTCAC TTCTATCAAC CGAAGTCAGT GGGATATACA AGCGGACTTT GATGTACGGG GGCAGCAAGT TTGGCTTAGT GACAGTGGGC GACGCAAATT CATCAATCTT TACGAGCAAC GTAAAGCAGA AACCTGGAAA CACCCAGTTA CGGGCTATTC CTTGACCTAT CGCCGTTTAT TGGAGCTAGA AGTCCGATTA CTGGAAAAAG AATGGTCAGG AGAAGGCGGA TTATTTGGTC AGTTGATTGT ACGGTGA
|
Protein sequence | MQTTDLDLET LAEPMGDTMR VSALHAFAYC PRLFYLEEVE ELYTQDAAVF AGRRLHEEID KKEDEEWEDL YLEDEHLGLR GRVDALRTRD GQTIPYEHKR GRCYRDEKKQ PQAWDSDRLQ ILAYCCLIEA ALGITVPEGR IRYHADNVLV HVPFDETGRQ WVKDSIRQAR ELRKSPYRPA VIDNEHLCSR CSLSPVCLPE EARLAHNKEW HPVRLFPVDD EREVIHVLEP GTRVGRTGEQ LKISRPNQPD EKIAIGQVSQ VVLHSFSQIS TQAVHFLAYK EVGIHFVSGG GRYIGSIDAR SRSIQRRVRQ YQALSQPDFC LELARKLVAC RGEGQRKFLM RGKRNKKGDS LALEKTIAQM KAVLKQVPQI QSLDSLLGIE GNLAALYFGA LSNLLAENAP ESLLFSGRNR RPPKDRFNAL LSFGYSLLIK DVMNAILAVG LEPALGFYHQ PRTQAPPLAL DLMEIFRVPL VDMPVVTSIN RSQWDIQADF DVRGQQVWLS DSGRRKFINL YEQRKAETWK HPVTGYSLTY RRLLELEVRL LEKEWSGEGG LFGQLIVR
|
| |