Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_13351 |
Symbol | psbC |
ID | 4718054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1109686 |
End bp | 1111068 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640079054 |
Product | photosystem II PsbC protein (CP43) |
Protein accession | YP_001009726 |
Protein GI | 123968868 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAAACGC CCTTTAATAA TTTATTAAGA GCTCCAAACC AAAGTATTGA GGAAACTGGT TATGCCTGGT ATGTAGGCAA CGCTAGATTA ATCAATTTAT CTGGACGTTT ATTGGGAGCT CACATTGCTC ACTCTGGACT AATAGTCTTT TGGGCGGGAG CAATGATGCT CTTTGAGGTT AATCATTTCA CTTTTGATAA ACCAATGTGG GAGCAAGGTT TAATCTGTAT GCCACACGTT GCAATGTTTG GCTACGGGAT AGGCCCAGGT GGTGAAGTTA CTGATATCAT GCCTTTCTTC CAAGCAGGCG TGGTTCATTT GATAGCTTCT GCTGTTCTTG GTTTTGGTGG TATTTACCAT TCATTAGCAG GCCCAGAAAA ACTTGAAGAA GATTTTCCAT TTTTCTCCAC CGATTGGAGA GATAAAAATC AAATGACAAA TATTCTTGGA TATCATTTGA TTGTATTAGG TGTAGGTGCA TTAGCATGGT CAGTTAACTG GTGTTTTATT GGCGGTGCAT ATGACACATG GGCACCTGGT GGTGGTGAAG TCAGACTTGT TAATCCAACT TTAGATCCGA GAGTTATTCT TGGTTATCTA TTTAGATCTC CATGGGGAGG AGCTGGTTCA ATAATCGGTG TCAACTCCAT CGAAGATATT GTTGGTGGAC ACGTTTACGT GGGTATTACT GCAATTATTG GAGGAATATT CCATATCTTC ACAAAACCCT TTGGATGGGC AAGAAGAGCA TTCATCTGGA ATGGTGAAGG ACTATTAAGT TATGCTCTTG GTGGAATTTG TGTAGCAAGT TTCATTGCTT CAACATTCAT TTGGTTTAAC AATACTGCTT ACCCTTCAGA ATTTTATGGC CCAACAAATG CTGAAGCTTC ACAGGCTCAA AGCTTTACTT TCCTAGTGAG AGACCAAAGA ATTGGAGCTA ACGTAGGTTC AACAATGGGA CCAACAGGTC TAGGTAAGTA TCTCATGAGA TCTCCTACTG GTGAAATTAT ATTTGGTGGT GAAACAATGA GATTTTGGGA TTTCAGAGGT CCATGGTTAG AGCCTTTAAG AGGTCCTAAT GGTTTGAGCC TTGAGAAAAT TCAAAATGAT ATTCAGCCTT GGCAGGTAAG AAGAGCAGCT GAATATATGA CTCATGCTCC TAACGCTTCT ATCAACTCTG TTGGTGGAAT CATTACAGAG CCAAATGCTG TTAACTTCGT TAACCTAAGA CAATGGTTAG CTGCAGCTCA ATTCTTCCTA GGATGGTTTA CATTTATCGG TCATCTTTGG CATGCTGGAC GTGCTAGAGC TGCCGCTGCT GGTTTCGAGA AAGGAATCGA CAGAAAGAGT GAACCAGCTC TAGAAATGCC TGATCTAGAT TAA
|
Protein sequence | METPFNNLLR APNQSIEETG YAWYVGNARL INLSGRLLGA HIAHSGLIVF WAGAMMLFEV NHFTFDKPMW EQGLICMPHV AMFGYGIGPG GEVTDIMPFF QAGVVHLIAS AVLGFGGIYH SLAGPEKLEE DFPFFSTDWR DKNQMTNILG YHLIVLGVGA LAWSVNWCFI GGAYDTWAPG GGEVRLVNPT LDPRVILGYL FRSPWGGAGS IIGVNSIEDI VGGHVYVGIT AIIGGIFHIF TKPFGWARRA FIWNGEGLLS YALGGICVAS FIASTFIWFN NTAYPSEFYG PTNAEASQAQ SFTFLVRDQR IGANVGSTMG PTGLGKYLMR SPTGEIIFGG ETMRFWDFRG PWLEPLRGPN GLSLEKIQND IQPWQVRRAA EYMTHAPNAS INSVGGIITE PNAVNFVNLR QWLAAAQFFL GWFTFIGHLW HAGRARAAAA GFEKGIDRKS EPALEMPDLD
|
| |