Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9605_1993 |
Symbol | |
ID | 3737441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9605 |
Kingdom | Bacteria |
Replicon accession | NC_007516 |
Strand | + |
Start bp | 1810834 |
End bp | 1812222 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637776579 |
Product | photosystem II 44 kDa subunit reaction center protein |
Protein accession | YP_382289 |
Protein GI | 78213510 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.296438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.146878 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTAACGC TCTCTAATCC CGGTCTTGGC GCCACTGGCG GCAAAGACCT TCCCTCCACT GGGTATGCCT GGTGGTCCGG CAACGCCCGC TTGATCAACC TGTCCGGCCG TCTGCTTGGT GCCCACGTGG CCCACGCTGG TCTGATGGTG TTCTGGGCCG GCGCAATGAT GCTGTTCGAG GTGAGCCACT TCACCTTCGA TAAGCCCATG TACGAACAGG GCTTCATCTG CATGCCTCAC GTCGCCACCC TTGGCTACGG CGTGGGCCCC GGCGGTGAAG TCACTGATCT CTTCCCCTTC TTCGTGGTCG GTGTTCTGCA CCTGATCAGC TCCGCCGTGC TCGGCCTCGG CGGCCTCTAT CACGCCCTGC GTGGTCCTGA GATCCTGGAG AACTACTCCT CCTTCTTCTC CCAGGACTGG CGTGACAAGA ACCAGATGAC CAACATCATT GGTTATCACC TGATTCTCCT GGGCGTCGGC TGCCTGCTGC TGGTCTTCAA GGCCATGTTC TTCGGCGGCG TTTACGACAC CTGGGCCCCC GGCGGCGGTG ACGTCCGCAT GATCACTAAC CCGACCCTCG ATCCGGGCGT GATCTTCGGC TACCTGTTCC GCGCTCCCTT CGGCGGCGAG GGCTGGATTA TCGGTGTGAA CTCCATGGAG GACATCATCG GTGGCCACAT CTGGCTGGGT CTGACCCTGA TTTTCGGTGG CATCTGGCAT GCCATCACCA AGCCTTTCGG TTGGGTGCGT CGCGCCTTCA TCTGGAACGG TGAGGCCTAC CTGAGCTACA GCCTTGGCGC TCTGAGCTTC ATGAGCTTCA TCGCTTCGGC CTACATCTGG TTCAACAACA CCGCCTATCC CTCCGAGTTC TGGGGCCCCA CCAACGCTGA GGCATCCCAG GCTCAGAGCT TCACCTTCCT GGTGCGTGAC CAGCGTCTCG GCGCCAACAT CGGTTCCGCC ATGGGCCCCA CCGGCCTTGG TAAGTACTTG ATGCGATCAC CTACCGGTGA AATCATCTTC GGGGGTGAAA CCATGCGCTT CTGGGACTTC CGTGGTCCCT GGCTTGAGCC CCTGCGTGGT CCCAACGGCC TCAGCCTCGA CAAGCTGCAG AACGACATTC AGCCCTGGCA AGTGCGCCGT GCGGCTGAGT ACATGACCCA CGCTCCGAAC GCCTCGATCA ACTCCGTGGG CGGCATTATC ACCGAGCCCA ACTCGGTGAA CTACGTGAAC CTCCGCCAGT GGCTGGGTGC AACGCAGTTC GTGCTTGCCT TCTTCTTCCT GGTTGGTCAC CTCTGGCACG CCGGCCGCGC CCGCGCTGCT GCTGCTGGCT TCGAGAAAGG CATCGACCGC AAGGCTGAGC CTGTGCTCGG CATGCCTGAT CTCGACTGA
|
Protein sequence | MVTLSNPGLG ATGGKDLPST GYAWWSGNAR LINLSGRLLG AHVAHAGLMV FWAGAMMLFE VSHFTFDKPM YEQGFICMPH VATLGYGVGP GGEVTDLFPF FVVGVLHLIS SAVLGLGGLY HALRGPEILE NYSSFFSQDW RDKNQMTNII GYHLILLGVG CLLLVFKAMF FGGVYDTWAP GGGDVRMITN PTLDPGVIFG YLFRAPFGGE GWIIGVNSME DIIGGHIWLG LTLIFGGIWH AITKPFGWVR RAFIWNGEAY LSYSLGALSF MSFIASAYIW FNNTAYPSEF WGPTNAEASQ AQSFTFLVRD QRLGANIGSA MGPTGLGKYL MRSPTGEIIF GGETMRFWDF RGPWLEPLRG PNGLSLDKLQ NDIQPWQVRR AAEYMTHAPN ASINSVGGII TEPNSVNYVN LRQWLGATQF VLAFFFLVGH LWHAGRARAA AAGFEKGIDR KAEPVLGMPD LD
|
| |