Gene PCC7424_5519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_5519 
Symbol 
ID7112734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011737 
Strand
Start bp172956 
End bp174632 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content46% 
IMG OID643483897 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002380906 
Protein GI218442585 
COG category[L] Replication, recombination and repair 
COG ID[COG1468] RecB family exonuclease
[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR00372] CRISPR-associated protein Cas4 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.28105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACTA CTGATCTAGA CCTAGAAACT TTGGCAGAAC CGATGGGGGA TACCATGAGG 
GTTTCGGCTC TCCATGCCTT TGCTTACTGC CCTCGTCTTT TTTACCTTGA GGAAGTAGAG
GAACTCTATA CTCAAGATGC GGCGGTCTTT GCCGGTCGCC GGCTTCATGA AGAGATCGAT
AAAAAAGAAG ATGAGGAATG GGAGGATCTC TACCTCGAAG ATGAGCATTT GGGCTTAAGG
GGACGGGTGG ACGCTTTGCG GACTCGTGAC GGCCAAACCA TCCCCTACGA GCATAAACGG
GGGCGCTGCT ACCGAGATGA AAAGAAGCAG CCTCAAGCTT GGGACAGCGA TCGCCTTCAA
ATTTTAGCTT ATTGTTGTCT TATTGAGGCA GCATTAGGAA TTACTGTCCC AGAGGGAAGA
ATTCGCTATC ATGCGGATAA CGTGCTAGTT CATGTCCCCT TCGATGAGAC AGGAAGACAA
TGGGTAAAAG ACTCTATCCG GCAAGCACGA GAGTTAAGAA AATCCCCTTA TCGTCCTGCG
GTGATTGACA ATGAACACTT GTGTTCTCGT TGTTCTTTGT CTCCTGTCTG TTTACCGGAA
GAGGCTCGGT TAGCTCATAA TAAAGAGTGG CATCCCGTCC GGTTATTTCC TGTTGATGAT
GAACGAGAAG TCATTCACGT TCTCGAACCC GGAACGAGAG TAGGACGCAC TGGGGAACAA
CTGAAAATAA GTCGCCCTAA TCAACCCGAT GAGAAAATAG CGATTGGGCA AGTCTCTCAG
GTGGTTCTCC ATAGTTTTTC CCAGATTTCT ACTCAAGCCG TTCATTTCCT GGCTTATAAA
GAAGTTGGGA TTCACTTTGT TTCTGGGGGA GGGCGCTATA TAGGAAGTAT TGATGCTCGC
TCCCGGAGTA TTCAACGGCG TGTCCGCCAG TATCAAGCGT TAAGTCAACC GGATTTTTGT
CTAGAACTAG CCCGAAAGTT GGTTGCTTGT AGGGGAGAGG GGCAGCGTAA GTTTTTAATG
CGGGGAAAAC GTAATAAAAA AGGTGATTCT CTTGCATTAG AGAAAACGAT CGCCCAAATG
AAAGCGGTAC TCAAGCAAGT ACCACAAATT CAGTCCCTTG ATTCATTATT GGGAATTGAG
GGCAATTTAG CTGCTCTTTA TTTTGGAGCT TTATCTAACC TATTGGCTGA AAATGCTCCA
GAATCACTCT TATTTTCGGG TCGTAATCGT CGCCCTCCTA AAGATCGCTT TAATGCACTG
TTGAGCTTTG GTTATTCTCT GCTCATCAAA GATGTAATGA ATGCTATTCT TGCTGTTGGG
TTAGAGCCAG CATTAGGATT CTATCATCAG CCGAGAACCC AAGCTCCTCC TCTGGCCTTG
GACTTAATGG AAATTTTCCG CGTTCCTTTG GTAGATATGC CCGTTGTCAC TTCTATCAAC
CGAAGTCAGT GGGATATACA AGCGGACTTT GATGTACGGG GGCAGCAAGT TTGGCTTAGT
GACAGTGGGC GACGCAAATT CATCAATCTT TACGAGCAAC GTAAAGCAGA AACCTGGAAA
CACCCAGTTA CGGGCTATTC CTTGACCTAT CGCCGTTTAT TGGAGCTAGA AGTCCGATTA
CTGGAAAAAG AATGGTCAGG AGAAGGCGGA TTATTTGGTC AGTTGATTGT ACGGTGA
 
Protein sequence
MQTTDLDLET LAEPMGDTMR VSALHAFAYC PRLFYLEEVE ELYTQDAAVF AGRRLHEEID 
KKEDEEWEDL YLEDEHLGLR GRVDALRTRD GQTIPYEHKR GRCYRDEKKQ PQAWDSDRLQ
ILAYCCLIEA ALGITVPEGR IRYHADNVLV HVPFDETGRQ WVKDSIRQAR ELRKSPYRPA
VIDNEHLCSR CSLSPVCLPE EARLAHNKEW HPVRLFPVDD EREVIHVLEP GTRVGRTGEQ
LKISRPNQPD EKIAIGQVSQ VVLHSFSQIS TQAVHFLAYK EVGIHFVSGG GRYIGSIDAR
SRSIQRRVRQ YQALSQPDFC LELARKLVAC RGEGQRKFLM RGKRNKKGDS LALEKTIAQM
KAVLKQVPQI QSLDSLLGIE GNLAALYFGA LSNLLAENAP ESLLFSGRNR RPPKDRFNAL
LSFGYSLLIK DVMNAILAVG LEPALGFYHQ PRTQAPPLAL DLMEIFRVPL VDMPVVTSIN
RSQWDIQADF DVRGQQVWLS DSGRRKFINL YEQRKAETWK HPVTGYSLTY RRLLELEVRL
LEKEWSGEGG LFGQLIVR