Gene PCC7424_5841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_5841 
Symbol 
ID7112828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011738 
Strand
Start bp265618 
End bp266622 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content34% 
IMG OID643484120 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002381129 
Protein GI218442809 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAC TTTATTTAAT GGAACAAGGA ACATGGGTTC AGAAAGAACA AGAACGATTG 
ATTATTCAAG TTTCTAAAAC TCAAAAGATG GAAGTTTTGA TGCGAGAAGT AGAGAGAATT
ATGATTTTTG GCAATGTTCA ATTAAGTACG CCAGCCATTA ATGCTTGTTT AAAACATAAT
ATTTTAGTCT TGTTTTTGAA TCAGGCAGGA CAATATAATG GTCATTTATG GAGTTTAGGC
TCTATTCATC TTAATAATGA AATGGTTCAG ATTAAACGTC ATCAAGAGCA TGAGTTTCAA
GTGAAGATAT CTAAAGCGAT CGTTTATGGA AAGCTGATGA ATTCTAAGCG ACTTTTAATG
CGACTAAATC GCAAGCGTCA AGTCCCAGAT ATGGACAAGG TAATTGAGGG AATTAATTCA
GATATTCTCA GTTTAGAGTC AGTGGATAAT CTTGACCAAT TGAGAGGTTA TGAGGGGATA
GCTGCTGCCC GCTATTTTCC GGCTTTTGGT CAATTAATTA CTAATGCGGC TTTTAGTTTT
TCTCTGAGAA ATCGTCAGCC TCCTACTGAT CCGGTTAATT CTTTATTAAG TTTTGGCTAT
ACTTTGTTAT TTAATAATGT TTTAAGTTTA ATTATTAGTG AAGGGCTTTC TCCTTATTTT
GGGAATTTTC ATTATGGAGA ACGGGATAAA CCTTATTTGG CTTTTGATTT GATGGAAGAG
TTTCGCGCTA TAATTGTGGA TGGTATGGTT TTAAGGGTGA TTAATAATGG TTTATTGACC
CTTAAAGATT TTGAACCGGT TGCGAGTAAT GGAGGAGTTT ATTTAACGGA TAAAGGAAGG
AGAATTTTTC TTAAGGAGTT TGAATCTCGA ATTAATAAAT TGATTTCTCA CCCCGATATT
CAATCGCCAG TTTCTTATCG ACAGACTATT CAGTTACAAA TTCGTCGGTA TAAACAAAGT
TTGTTATCCG ATGTGAGTTA TCAATCTTTT GTGAGGGATA TCTAA
 
Protein sequence
MATLYLMEQG TWVQKEQERL IIQVSKTQKM EVLMREVERI MIFGNVQLST PAINACLKHN 
ILVLFLNQAG QYNGHLWSLG SIHLNNEMVQ IKRHQEHEFQ VKISKAIVYG KLMNSKRLLM
RLNRKRQVPD MDKVIEGINS DILSLESVDN LDQLRGYEGI AAARYFPAFG QLITNAAFSF
SLRNRQPPTD PVNSLLSFGY TLLFNNVLSL IISEGLSPYF GNFHYGERDK PYLAFDLMEE
FRAIIVDGMV LRVINNGLLT LKDFEPVASN GGVYLTDKGR RIFLKEFESR INKLISHPDI
QSPVSYRQTI QLQIRRYKQS LLSDVSYQSF VRDI