Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0656 |
Symbol | |
ID | 3775639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 651274 |
End bp | 652659 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637799068 |
Product | photosystem II 44 kDa subunit reaction center protein |
Protein accession | YP_399675 |
Protein GI | 81299467 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTAACGC TCTCTAGTCC TTCCGTGATC GCAGGCGGCC GGGATATTGA CTCCACCGGT TACGCTTGGT GGTCCGGCAA TGCCCGTTTG ATCAACCTGT CCGGTAAGCT GCTGGGCGCT CACGTCGCTC ATGCTGGCTT GATCGTCTTC TGGGCTGGTG CGATGACGCT GTTTGAAGTC GCGCACTTTG TCCCCGAAAA ACCGATGTAC GAGCAAGGCA TCATCCTGCT CTCGCACTTG GCGACCCTCG GCTGGGGCGT TGGCCCTGGT GGCGAAGTCG TCGATACCTT CCCCTACTTT GTGGTTGGGG TTCTGCACCT CATTTCTTCC GCCGTTCTGG GTTTGGGTGG GATCTACCAC GCCCTGCGCG GCCCTGAGTC GCTGGAAGAG TACAGCACCT TCTTCAGCCA AGACTGGAAA GACAAGAATC AGATGACCAA CATCATTGGT TATCACCTGA TTCTGCTGGG CTTAGGTGCC TTCTTGCTGG TCTTTAAGGC CATGTTCTTC GGCGGTGTCT ATGACACCTG GGCGCCGGGT GGTGGCGATG TCCGCATCAT CTCCAACCCA ACCCTCAACC CGGCTGTGAT CTTCGGCTAC CTGCTGAAAT CACCCTTTGG TGGCGACGGC TGGATTGTCA GCGTCGACAA CCTTGAAGAC GTGATTGGCG GCCATATCTG GATTGGTCTG ATCTGCATTT CGGGTGGTAT CTGGCACATC CTGACCAAGC CTTTTGGCTG GGTGCGTCGC GCCTTCATCT GGAATGGCGA AGCTTACCTC TCCTACAGCT TGGGTGCCCT GTCGTTGATG GGCTTCATTG CCTCGACGAT GGTTTGGTAC AACAACACCG TCTATCCTTC CGAGTTCTTT GGCCCGACCG CTGCTGAAGC TTCGCAATCG CAAGCCTTCA CCTTCTTGGT GCGTGACCAA CGCCTCGGTG CCAACATCGG TTCAGCTCAA GGCCCGACCG GTCTGGGTAA ATACCTGATG CGCTCTCCTA CCGGCGAGAT CATCTTCGGT GGCGAAACCA TGCGCTTCTG GGACTTCCGT GGCCCTTGGC TGGAGCCCCT GCGTGGACCG AATGGTCTGG ATCTCGACAA GCTGACCAAT GACATTCAGC CTTGGCAAGC CCGTCGTGCG GCTGAGTACA TGACCCACGC ACCGCTGGGT TCGCTGAACT CTGTGGGTGG TGTGGCAACG GAAATCAACT CGGTGAACTT CGTGTCTCCC CGTGCTTGGT TGGCGACCAG CCACTTCGTC TTGGCCTTCT TCTTCTTGGT CGGTCACCTC TGGCATGCAG GCCGCGCTCG TGCAGCTGCT GCAGGCTTTG AGAAAGGTAT CGATCGCGCG ACCGAACCCG TGCTCGCAAT GAGAGACCTC GACTAA
|
Protein sequence | MVTLSSPSVI AGGRDIDSTG YAWWSGNARL INLSGKLLGA HVAHAGLIVF WAGAMTLFEV AHFVPEKPMY EQGIILLSHL ATLGWGVGPG GEVVDTFPYF VVGVLHLISS AVLGLGGIYH ALRGPESLEE YSTFFSQDWK DKNQMTNIIG YHLILLGLGA FLLVFKAMFF GGVYDTWAPG GGDVRIISNP TLNPAVIFGY LLKSPFGGDG WIVSVDNLED VIGGHIWIGL ICISGGIWHI LTKPFGWVRR AFIWNGEAYL SYSLGALSLM GFIASTMVWY NNTVYPSEFF GPTAAEASQS QAFTFLVRDQ RLGANIGSAQ GPTGLGKYLM RSPTGEIIFG GETMRFWDFR GPWLEPLRGP NGLDLDKLTN DIQPWQARRA AEYMTHAPLG SLNSVGGVAT EINSVNFVSP RAWLATSHFV LAFFFLVGHL WHAGRARAAA AGFEKGIDRA TEPVLAMRDL D
|
| |