Gene Synpcc7942_0656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0656 
Symbol 
ID3775639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp651274 
End bp652659 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content58% 
IMG OID637799068 
Productphotosystem II 44 kDa subunit reaction center protein 
Protein accessionYP_399675 
Protein GI81299467 
COG category 
COG ID 
TIGRFAM ID[TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast
[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAACGC TCTCTAGTCC TTCCGTGATC GCAGGCGGCC GGGATATTGA CTCCACCGGT 
TACGCTTGGT GGTCCGGCAA TGCCCGTTTG ATCAACCTGT CCGGTAAGCT GCTGGGCGCT
CACGTCGCTC ATGCTGGCTT GATCGTCTTC TGGGCTGGTG CGATGACGCT GTTTGAAGTC
GCGCACTTTG TCCCCGAAAA ACCGATGTAC GAGCAAGGCA TCATCCTGCT CTCGCACTTG
GCGACCCTCG GCTGGGGCGT TGGCCCTGGT GGCGAAGTCG TCGATACCTT CCCCTACTTT
GTGGTTGGGG TTCTGCACCT CATTTCTTCC GCCGTTCTGG GTTTGGGTGG GATCTACCAC
GCCCTGCGCG GCCCTGAGTC GCTGGAAGAG TACAGCACCT TCTTCAGCCA AGACTGGAAA
GACAAGAATC AGATGACCAA CATCATTGGT TATCACCTGA TTCTGCTGGG CTTAGGTGCC
TTCTTGCTGG TCTTTAAGGC CATGTTCTTC GGCGGTGTCT ATGACACCTG GGCGCCGGGT
GGTGGCGATG TCCGCATCAT CTCCAACCCA ACCCTCAACC CGGCTGTGAT CTTCGGCTAC
CTGCTGAAAT CACCCTTTGG TGGCGACGGC TGGATTGTCA GCGTCGACAA CCTTGAAGAC
GTGATTGGCG GCCATATCTG GATTGGTCTG ATCTGCATTT CGGGTGGTAT CTGGCACATC
CTGACCAAGC CTTTTGGCTG GGTGCGTCGC GCCTTCATCT GGAATGGCGA AGCTTACCTC
TCCTACAGCT TGGGTGCCCT GTCGTTGATG GGCTTCATTG CCTCGACGAT GGTTTGGTAC
AACAACACCG TCTATCCTTC CGAGTTCTTT GGCCCGACCG CTGCTGAAGC TTCGCAATCG
CAAGCCTTCA CCTTCTTGGT GCGTGACCAA CGCCTCGGTG CCAACATCGG TTCAGCTCAA
GGCCCGACCG GTCTGGGTAA ATACCTGATG CGCTCTCCTA CCGGCGAGAT CATCTTCGGT
GGCGAAACCA TGCGCTTCTG GGACTTCCGT GGCCCTTGGC TGGAGCCCCT GCGTGGACCG
AATGGTCTGG ATCTCGACAA GCTGACCAAT GACATTCAGC CTTGGCAAGC CCGTCGTGCG
GCTGAGTACA TGACCCACGC ACCGCTGGGT TCGCTGAACT CTGTGGGTGG TGTGGCAACG
GAAATCAACT CGGTGAACTT CGTGTCTCCC CGTGCTTGGT TGGCGACCAG CCACTTCGTC
TTGGCCTTCT TCTTCTTGGT CGGTCACCTC TGGCATGCAG GCCGCGCTCG TGCAGCTGCT
GCAGGCTTTG AGAAAGGTAT CGATCGCGCG ACCGAACCCG TGCTCGCAAT GAGAGACCTC
GACTAA
 
Protein sequence
MVTLSSPSVI AGGRDIDSTG YAWWSGNARL INLSGKLLGA HVAHAGLIVF WAGAMTLFEV 
AHFVPEKPMY EQGIILLSHL ATLGWGVGPG GEVVDTFPYF VVGVLHLISS AVLGLGGIYH
ALRGPESLEE YSTFFSQDWK DKNQMTNIIG YHLILLGLGA FLLVFKAMFF GGVYDTWAPG
GGDVRIISNP TLNPAVIFGY LLKSPFGGDG WIVSVDNLED VIGGHIWIGL ICISGGIWHI
LTKPFGWVRR AFIWNGEAYL SYSLGALSLM GFIASTMVWY NNTVYPSEFF GPTAAEASQS
QAFTFLVRDQ RLGANIGSAQ GPTGLGKYLM RSPTGEIIFG GETMRFWDFR GPWLEPLRGP
NGLDLDKLTN DIQPWQARRA AEYMTHAPLG SLNSVGGVAT EINSVNFVSP RAWLATSHFV
LAFFFLVGHL WHAGRARAAA AGFEKGIDRA TEPVLAMRDL D