Gene P9303_08421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_08421 
SymbolpsbC 
ID4778629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp764428 
End bp765822 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content53% 
IMG OID640086351 
Productphotosystem II PsbC protein (CP43) 
Protein accessionYP_001016858 
Protein GI124022551 
COG category 
COG ID 
TIGRFAM ID[TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAACGC CCTTTAAATC TTCCATACTG CAGAATGCCG GGGGATACAG CCTTGAATCC 
ACCGGTTACG CATGGTGGGC AGGTAATGCC CGTTTCATCA ACCTTTCTGG ACGCCTACTT
GGCGCCCACG TTGCCCATGC AGGTCTGATG ACTTTCTGGG CAGGAGCCAT GCTCCTGTTC
GAAGTCAGTC ACTTCACCTT CGACAAGCCA ATTTTCGAGC AAGGCCTGAT CTTGATGCCA
CATGTGGCAG CGCTTGGTTA TGGCGTTGGC ACAGGTGGCG AAATCGTAGA CATCTACCCG
TACTTCCATT GCGGGGTGAT GCACTTAATC ATCTCTGCTG TGTTCGGCTT AGGTGGGGTC
TATCACGCCT TGGTCGGCCC TGAAAAGCTT CAGGATTACA GCTCTCCATT TTTCAGGCTT
GATTGGAAAG ACAAAAATCA GATGACCAAC ATTCTTGGTT ACAACCTGAT TTTTCTAGGC
TGGGGAGCTC TTGCTCTGAT CCTCAAGGCC TGCTTCTTTG GCGGCATCTA TGACACCTGG
GCTCCAGGTG GTGGCGACGT GAGATTAATC ACCAGCCCCA CGCTTGATCC AGGCGTGATC
TTTGGATACG TGTTCAGTTC TCCCTGGGGA GGTACCGGTT GGATCACTGG TGTCAACTCA
ATGGAAGACC TCATTGGCGG CCACATTTAC GTTGCTGCTC TTCTATTCGT TGGCGGCCAT
TTTCACATTG CCACCAAGCC ATGGGGATGG GTTCGTAGAG CCTTCATCTG GAACGGAGAG
GCCTACCTCA GCTACGCCCT TGCTGGCCTG AGCTGTTGTG GTTTCATTGC CACGGCTTAC
ATTTGGTTCA ACGTCACCGC CTATCCATCA GAGTTCTACG GTCCATCGAA CGCCGAAGCC
TCTCAAGCCC AGAGCTTCAC CTTCCTTGTT CGTGACCAAC GCCTTGGAGC AAATATTGGT
ACTGCCATGG GGCCAACAGG CCTTGGTAAG TACCTAATGC GTTCTCCTAC TGGTGAGATC
ATCTTTGGTG GAGAAACGAT GCGCTTCTGG GACTTCCGTG GTCCTTGGTT AGAACCACTG
CGTGGCCCCA ATGGCCTGAG CCTTGACAAG CTTCAGAACG ATGTTCAGCC ATGGCAAGTG
CGCCGTGCTG CTGAATACAT GACCCACGCC CCCAACGCCT CCATCAACTC AGTAGGTGGA
ATCATCACCG AGCCCAACTC GGTTAACTTC GTGAACCTTC GCCAGTGGTT AGCTGGTCAT
GCGTTCTTCC TAGCTTGGTT CACCATCGTG GGGCACTGGT TCCACGCTGG TCGCGCCAGG
GCTGTTGCAG CCGGGTTTGA AAAAGGCATC GATCGTAAGA CCGAACCAGC CTTGTCAATG
CCTGATCTCG ACTGA
 
Protein sequence
METPFKSSIL QNAGGYSLES TGYAWWAGNA RFINLSGRLL GAHVAHAGLM TFWAGAMLLF 
EVSHFTFDKP IFEQGLILMP HVAALGYGVG TGGEIVDIYP YFHCGVMHLI ISAVFGLGGV
YHALVGPEKL QDYSSPFFRL DWKDKNQMTN ILGYNLIFLG WGALALILKA CFFGGIYDTW
APGGGDVRLI TSPTLDPGVI FGYVFSSPWG GTGWITGVNS MEDLIGGHIY VAALLFVGGH
FHIATKPWGW VRRAFIWNGE AYLSYALAGL SCCGFIATAY IWFNVTAYPS EFYGPSNAEA
SQAQSFTFLV RDQRLGANIG TAMGPTGLGK YLMRSPTGEI IFGGETMRFW DFRGPWLEPL
RGPNGLSLDK LQNDVQPWQV RRAAEYMTHA PNASINSVGG IITEPNSVNF VNLRQWLAGH
AFFLAWFTIV GHWFHAGRAR AVAAGFEKGI DRKTEPALSM PDLD