Gene Cyan7425_5340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan7425_5340 
Symbol 
ID7280301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7425 
KingdomBacteria 
Replicon accessionNC_011880 
Strand
Start bp46774 
End bp47763 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content52% 
IMG OID643580485 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002478298 
Protein GI219883136 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.52671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCGC TCTACATCAG TCAGCAGGGT TGCTATGTTT CTCTGCGCCA GGAAATGCTA 
CTGGTCAAAC AGGGAACCAC AGTGCTGGAA TCGGTACAAC TTCCCCTCCT GGATCAAATT
CTCGTTTTTG GTACAGCTCA GCTCACCACC CAAGCCATGC GGGCTTGCCT CAGCCGTCAT
ATTCCGATCG CTTTCCTATC GCGGATGGGT TACTGCTATG GTCGCTTACT GCCTCTGGAG
CGGGGTTATC GTCGCTTAGC CCGTTTTCAA CAAATGCTGC ACACGGGTGA ACGGCTGGTG
ATTGCCCGAC AGATGGTCTG GGGAAAGCTA AAAAATTCGC GGGTGCTGCT AATGCGCCAA
CAACAACGAC GCGACTCTCC TGATCTGACT ACAGCCATAC AGGCCCTGGA CCATTTTGCT
GATAATGTTC GCCTGGCCAA TTCGCACGAA CAACTCCTGG GGTTGGAAGG GGCAGGGGCA
GCCACCTATT TCAAGGCCCT GGGAAGCTGC ATCCTGCGGG AGGGGTTTAC GTTTGCAGGC
AGAACCCGCC GGCCACCCAC CAATCCGGTC AATGCTCTCC TCAGTTTTGG CTATCAGGTA
CTCTGGAATC ACTTGCTCGC CTTGATCGAA TTACAGGATC TGGATCCCTA TGAGTCCTGT
TTGCACCAGA GTGGCGATCG ACATCCTGCC CTCGCTTCTG ACCTGTTGGA AGAATTCCGT
GCCCCCATTG TTGACTCTCT CACCTTGTAT TTGGTCAACC GTGGAATTGT GGATGCGGAT
AAGGATTTTG AATACCGGGA TGGCGGTTGT TTGCTAAATA ATAGCGGTCG CAAGAAGTAC
CTTTCAGCTT TTTTACAACG GATGGAAGAG CAGTTACATA CTGCTAATGG GCTACAACCC
CGCTGGGAAC TCCTCACTCA ACAGGTGCGT GCTTATAAGG CTTTTGTTTA TGCACCTGCC
CACGGTTATC AACCTTATCT AACACGCTAG
 
Protein sequence
MRSLYISQQG CYVSLRQEML LVKQGTTVLE SVQLPLLDQI LVFGTAQLTT QAMRACLSRH 
IPIAFLSRMG YCYGRLLPLE RGYRRLARFQ QMLHTGERLV IARQMVWGKL KNSRVLLMRQ
QQRRDSPDLT TAIQALDHFA DNVRLANSHE QLLGLEGAGA ATYFKALGSC ILREGFTFAG
RTRRPPTNPV NALLSFGYQV LWNHLLALIE LQDLDPYESC LHQSGDRHPA LASDLLEEFR
APIVDSLTLY LVNRGIVDAD KDFEYRDGGC LLNNSGRKKY LSAFLQRMEE QLHTANGLQP
RWELLTQQVR AYKAFVYAPA HGYQPYLTR