Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_08421 |
Symbol | psbC |
ID | 4778629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 764428 |
End bp | 765822 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640086351 |
Product | photosystem II PsbC protein (CP43) |
Protein accession | YP_001016858 |
Protein GI | 124022551 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAACGC CCTTTAAATC TTCCATACTG CAGAATGCCG GGGGATACAG CCTTGAATCC ACCGGTTACG CATGGTGGGC AGGTAATGCC CGTTTCATCA ACCTTTCTGG ACGCCTACTT GGCGCCCACG TTGCCCATGC AGGTCTGATG ACTTTCTGGG CAGGAGCCAT GCTCCTGTTC GAAGTCAGTC ACTTCACCTT CGACAAGCCA ATTTTCGAGC AAGGCCTGAT CTTGATGCCA CATGTGGCAG CGCTTGGTTA TGGCGTTGGC ACAGGTGGCG AAATCGTAGA CATCTACCCG TACTTCCATT GCGGGGTGAT GCACTTAATC ATCTCTGCTG TGTTCGGCTT AGGTGGGGTC TATCACGCCT TGGTCGGCCC TGAAAAGCTT CAGGATTACA GCTCTCCATT TTTCAGGCTT GATTGGAAAG ACAAAAATCA GATGACCAAC ATTCTTGGTT ACAACCTGAT TTTTCTAGGC TGGGGAGCTC TTGCTCTGAT CCTCAAGGCC TGCTTCTTTG GCGGCATCTA TGACACCTGG GCTCCAGGTG GTGGCGACGT GAGATTAATC ACCAGCCCCA CGCTTGATCC AGGCGTGATC TTTGGATACG TGTTCAGTTC TCCCTGGGGA GGTACCGGTT GGATCACTGG TGTCAACTCA ATGGAAGACC TCATTGGCGG CCACATTTAC GTTGCTGCTC TTCTATTCGT TGGCGGCCAT TTTCACATTG CCACCAAGCC ATGGGGATGG GTTCGTAGAG CCTTCATCTG GAACGGAGAG GCCTACCTCA GCTACGCCCT TGCTGGCCTG AGCTGTTGTG GTTTCATTGC CACGGCTTAC ATTTGGTTCA ACGTCACCGC CTATCCATCA GAGTTCTACG GTCCATCGAA CGCCGAAGCC TCTCAAGCCC AGAGCTTCAC CTTCCTTGTT CGTGACCAAC GCCTTGGAGC AAATATTGGT ACTGCCATGG GGCCAACAGG CCTTGGTAAG TACCTAATGC GTTCTCCTAC TGGTGAGATC ATCTTTGGTG GAGAAACGAT GCGCTTCTGG GACTTCCGTG GTCCTTGGTT AGAACCACTG CGTGGCCCCA ATGGCCTGAG CCTTGACAAG CTTCAGAACG ATGTTCAGCC ATGGCAAGTG CGCCGTGCTG CTGAATACAT GACCCACGCC CCCAACGCCT CCATCAACTC AGTAGGTGGA ATCATCACCG AGCCCAACTC GGTTAACTTC GTGAACCTTC GCCAGTGGTT AGCTGGTCAT GCGTTCTTCC TAGCTTGGTT CACCATCGTG GGGCACTGGT TCCACGCTGG TCGCGCCAGG GCTGTTGCAG CCGGGTTTGA AAAAGGCATC GATCGTAAGA CCGAACCAGC CTTGTCAATG CCTGATCTCG ACTGA
|
Protein sequence | METPFKSSIL QNAGGYSLES TGYAWWAGNA RFINLSGRLL GAHVAHAGLM TFWAGAMLLF EVSHFTFDKP IFEQGLILMP HVAALGYGVG TGGEIVDIYP YFHCGVMHLI ISAVFGLGGV YHALVGPEKL QDYSSPFFRL DWKDKNQMTN ILGYNLIFLG WGALALILKA CFFGGIYDTW APGGGDVRLI TSPTLDPGVI FGYVFSSPWG GTGWITGVNS MEDLIGGHIY VAALLFVGGH FHIATKPWGW VRRAFIWNGE AYLSYALAGL SCCGFIATAY IWFNVTAYPS EFYGPSNAEA SQAQSFTFLV RDQRLGANIG TAMGPTGLGK YLMRSPTGEI IFGGETMRFW DFRGPWLEPL RGPNGLSLDK LQNDVQPWQV RRAAEYMTHA PNASINSVGG IITEPNSVNF VNLRQWLAGH AFFLAWFTIV GHWFHAGRAR AVAAGFEKGI DRKTEPALSM PDLD
|
| |