Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_12341 |
Symbol | psbC |
ID | 5731837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1109667 |
End bp | 1111049 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641285602 |
Product | photosystem II PsbC protein (CP43) |
Protein accession | YP_001551119 |
Protein GI | 159903775 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.606702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.098859 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAACGC CCTTTAATAA TCTTCTAAAA GCTCCCAATC AAAGTATTGA AGAGACGGGT TATGCCTGGT ATGTAGGTAA TGCTCGTCTA ATCAATCTTT CAGGAAGATT GCTTGGAGCT CACATAGCCC ATACAGGACT AATGGTCTTT TGGGCAGGAG CCATGATGCT CTATGAAGTT AGTCACTTCA CCTTCGACAA ACCAATGTGG GAGCAAGGTT TAATCCTTTT GCCTCATGTA GCCATGTTTG GTTATGGAAT TGGACCGGGT GGAGAAGTTG TTGATGTAAT GCCATATTTC CAGGCAGGTG TTGTTCACCT TGTAGCTTCA GCAATCCTCG GATTTGGTGG TATTTACCAT GCCCTTGCAG GTCCTGAAAA ATTAGAAGAG GAGTTCCCAT TCTTTTCAAC TGACTGGAGA GATAAGGACC AAATGACAAC CATTCTTGGC CGTCATTTAT GTGTTCTTGG CTTAGGAGCA ATAGCTTTTG CCGCTAACTG GCAATTCCTT GGTGGTCTCT ATGACACATG GGCTCCAGGC GGTGGTGAGG TGAGATTAAT CACACCTACA ACTGACCCTG GTATCATCTT CGGCTACCTT TTCCAGACCC CTTGGGGAGG AGGAGGAAAT ATGGTGGGAG TTAATTCAGT TGAAGATATT GTTGGTGGCC ATTACTACCT AGGAATCATC GAATTAATAG GTGGTCATTT TCACATGCAA ACCAAACCTT TTGGATGGGC GAGAAGAGCC TTCATTTGGA ATGGCGAAGC TTTACTTAGC TATGCCCTAG GTGGGTTATG TGTAGCAAGC TTCTACGCAT CTGTATTTGT TTGGTTCAAC AACACTGCCT ATCCTTCAGA GTTTTACGGA CCAACAAACG CTGAAGCCTC TCAAGCGCAA AGCTTCACAT TCCTTGTCAG AGACCAGCGG ATAGGTGCAA ACGTTGGAAC CACAATGGGT CCTACTGGTT TAGGTAAGTA TTTAATGCGT TCTCCAACTG GTGAAATCAT CTTTGGCGGA GAAACAATGC GTTTTTGGGA CTTCCGTGGT CCTTGGCTTG AACCATTAAG GGGTGAAAAC GGTCTAAGTT TAGATAAAGT TCAAAATGAT ATACAGCCTT GGCAAGTTCG TCGAGCTGCT GAATACATGA CGCACGCTCC TAACGCTTCT ATCAACTCAG TTGGTGGAAT CATTACTGAG CCTAATGCAG TTAACTTTGT AAATCTCCGT CAGTGGCTTG CATCAGCCCA CTTCTTCTTA GGATGGTTTA CATTTGTTGG CCACCTATGG CATGCAGGAA GAGCCAGAGC TGCTGCGGGC GGCTTTGAAA AAGGTATCGA TCGCAAGAAC GAGCCTGCCC TTTCAATGCC TGATTTGGAC TAA
|
Protein sequence | METPFNNLLK APNQSIEETG YAWYVGNARL INLSGRLLGA HIAHTGLMVF WAGAMMLYEV SHFTFDKPMW EQGLILLPHV AMFGYGIGPG GEVVDVMPYF QAGVVHLVAS AILGFGGIYH ALAGPEKLEE EFPFFSTDWR DKDQMTTILG RHLCVLGLGA IAFAANWQFL GGLYDTWAPG GGEVRLITPT TDPGIIFGYL FQTPWGGGGN MVGVNSVEDI VGGHYYLGII ELIGGHFHMQ TKPFGWARRA FIWNGEALLS YALGGLCVAS FYASVFVWFN NTAYPSEFYG PTNAEASQAQ SFTFLVRDQR IGANVGTTMG PTGLGKYLMR SPTGEIIFGG ETMRFWDFRG PWLEPLRGEN GLSLDKVQND IQPWQVRRAA EYMTHAPNAS INSVGGIITE PNAVNFVNLR QWLASAHFFL GWFTFVGHLW HAGRARAAAG GFEKGIDRKN EPALSMPDLD
|
| |