Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_13501 |
Symbol | psbC |
ID | 4912436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1127422 |
End bp | 1128804 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640160939 |
Product | photosystem II PsbC protein (CP43) |
Protein accession | YP_001091574 |
Protein GI | 126696688 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.151398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAAACGC CCTTTAATAA TTTATTAAGA GCTCCAAACC AAAGTATTGA GGAAACTGGT TATGCCTGGT ATGTAGGCAA CGCTAGATTA ATCAATTTAT CTGGACGTTT ATTAGGAGCT CACATTGCTC ACTCTGGACT AATAGTCTTT TGGGCGGGAG CAATGATGCT CTTTGAGGTT AATCACTTTA CTTTTGATAA ACCAATGTGG GAGCAAGGTT TAATCTGTAT GCCACACGTC GCGATGTTTG GCTATGGAAT AGGCCCTGGT GGTGAAGTTA CTGATATCAT GCCTTTCTTC CAGGCAGGCG TGGTTCACTT AATAGCTTCT GCTGTTCTTG GTTTTGGTGG TATTTATCAC TCATTAGCAG GTCCAGAAAA ACTCGAAGAA GATTTTCCAT TTTTCTCCAC CGACTGGAGA GACAAAAACC AAATGACAAA TATCCTTGGA TATCATTTGA TTGTTTTAGG TGTAGGTGCG TTAGCATGGT CAGTTAACTG GTGTTTTATT GGCGGTGCAT ATGATACATG GGCACCAGGC GGTGGTGAAG TCAGACTTGT TAACCCAACT TTAGATCCAA GAGTTATTCT TGGTTATTTA TTCAGATCTC CATGGGGAGG AGCTGGATCA ATAATCGGCG TTAACTCCAT AGAAGATATT GTTGGTGGTC ACGTTTATGT TGGAATTACT GCAATTATCG GTGGAATATT CCATATCTTT ACCAAACCTT TTGGATGGGC AAGAAGAGCA TTTATCTGGA ATGGTGAAGG CTTATTAAGT TATGCACTTG GTGGAATTTG TGTAGCAAGT TTTATTGCTT CAACATTCAT CTGGTTTAAC AACACTGCTT ACCCTTCAGA ATTCTACGGT CCAACTAATG CTGAAGCATC ACAGGCTCAA AGTTTTACTT TCCTTGTGAG AGATCAAAGA ATTGGAGCTA ATGTAGGTTC AACAATGGGA CCAACAGGTC TAGGTAAGTA TCTCATGAGA TCTCCTACTG GTGAAATTAT ATTTGGTGGT GAAACAATGA GATTTTGGGA TTTCAGAGGT CCATGGTTAG AGCCTTTAAG AGGTCCTAAC GGATTGAGCC TTGAGAAAAT CCAAAATGAT ATTCAGCCTT GGCAGGTAAG AAGAGCCGCT GAATATATGA CTCATGCTCC TAACGCTTCT ATCAACTCTG TTGGTGGAAT TATTACAGAG CCAAATGCTG TTAACTTCGT TAACCTTAGA CAGTGGTTAG CTGCAGCTCA ATTCTTCCTA GGATGGTTTA CATTCATTGG TCACCTTTGG CATGCTGGAC GTGCTAGAGC AGCCGCTGCT GGTTTCGAAA AAGGAATCGA CAGAAAGAGT GAACCGGCTC TAGAAATGCC TGATTTAGAT TAA
|
Protein sequence | METPFNNLLR APNQSIEETG YAWYVGNARL INLSGRLLGA HIAHSGLIVF WAGAMMLFEV NHFTFDKPMW EQGLICMPHV AMFGYGIGPG GEVTDIMPFF QAGVVHLIAS AVLGFGGIYH SLAGPEKLEE DFPFFSTDWR DKNQMTNILG YHLIVLGVGA LAWSVNWCFI GGAYDTWAPG GGEVRLVNPT LDPRVILGYL FRSPWGGAGS IIGVNSIEDI VGGHVYVGIT AIIGGIFHIF TKPFGWARRA FIWNGEGLLS YALGGICVAS FIASTFIWFN NTAYPSEFYG PTNAEASQAQ SFTFLVRDQR IGANVGSTMG PTGLGKYLMR SPTGEIIFGG ETMRFWDFRG PWLEPLRGPN GLSLEKIQND IQPWQVRRAA EYMTHAPNAS INSVGGIITE PNAVNFVNLR QWLAAAQFFL GWFTFIGHLW HAGRARAAAA GFEKGIDRKS EPALEMPDLD
|
| |