Gene P9211_12341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_12341 
SymbolpsbC 
ID5731837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1109667 
End bp1111049 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content45% 
IMG OID641285602 
Productphotosystem II PsbC protein (CP43) 
Protein accessionYP_001551119 
Protein GI159903775 
COG category 
COG ID 
TIGRFAM ID[TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.606702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.098859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAACGC CCTTTAATAA TCTTCTAAAA GCTCCCAATC AAAGTATTGA AGAGACGGGT 
TATGCCTGGT ATGTAGGTAA TGCTCGTCTA ATCAATCTTT CAGGAAGATT GCTTGGAGCT
CACATAGCCC ATACAGGACT AATGGTCTTT TGGGCAGGAG CCATGATGCT CTATGAAGTT
AGTCACTTCA CCTTCGACAA ACCAATGTGG GAGCAAGGTT TAATCCTTTT GCCTCATGTA
GCCATGTTTG GTTATGGAAT TGGACCGGGT GGAGAAGTTG TTGATGTAAT GCCATATTTC
CAGGCAGGTG TTGTTCACCT TGTAGCTTCA GCAATCCTCG GATTTGGTGG TATTTACCAT
GCCCTTGCAG GTCCTGAAAA ATTAGAAGAG GAGTTCCCAT TCTTTTCAAC TGACTGGAGA
GATAAGGACC AAATGACAAC CATTCTTGGC CGTCATTTAT GTGTTCTTGG CTTAGGAGCA
ATAGCTTTTG CCGCTAACTG GCAATTCCTT GGTGGTCTCT ATGACACATG GGCTCCAGGC
GGTGGTGAGG TGAGATTAAT CACACCTACA ACTGACCCTG GTATCATCTT CGGCTACCTT
TTCCAGACCC CTTGGGGAGG AGGAGGAAAT ATGGTGGGAG TTAATTCAGT TGAAGATATT
GTTGGTGGCC ATTACTACCT AGGAATCATC GAATTAATAG GTGGTCATTT TCACATGCAA
ACCAAACCTT TTGGATGGGC GAGAAGAGCC TTCATTTGGA ATGGCGAAGC TTTACTTAGC
TATGCCCTAG GTGGGTTATG TGTAGCAAGC TTCTACGCAT CTGTATTTGT TTGGTTCAAC
AACACTGCCT ATCCTTCAGA GTTTTACGGA CCAACAAACG CTGAAGCCTC TCAAGCGCAA
AGCTTCACAT TCCTTGTCAG AGACCAGCGG ATAGGTGCAA ACGTTGGAAC CACAATGGGT
CCTACTGGTT TAGGTAAGTA TTTAATGCGT TCTCCAACTG GTGAAATCAT CTTTGGCGGA
GAAACAATGC GTTTTTGGGA CTTCCGTGGT CCTTGGCTTG AACCATTAAG GGGTGAAAAC
GGTCTAAGTT TAGATAAAGT TCAAAATGAT ATACAGCCTT GGCAAGTTCG TCGAGCTGCT
GAATACATGA CGCACGCTCC TAACGCTTCT ATCAACTCAG TTGGTGGAAT CATTACTGAG
CCTAATGCAG TTAACTTTGT AAATCTCCGT CAGTGGCTTG CATCAGCCCA CTTCTTCTTA
GGATGGTTTA CATTTGTTGG CCACCTATGG CATGCAGGAA GAGCCAGAGC TGCTGCGGGC
GGCTTTGAAA AAGGTATCGA TCGCAAGAAC GAGCCTGCCC TTTCAATGCC TGATTTGGAC
TAA
 
Protein sequence
METPFNNLLK APNQSIEETG YAWYVGNARL INLSGRLLGA HIAHTGLMVF WAGAMMLYEV 
SHFTFDKPMW EQGLILLPHV AMFGYGIGPG GEVVDVMPYF QAGVVHLVAS AILGFGGIYH
ALAGPEKLEE EFPFFSTDWR DKDQMTTILG RHLCVLGLGA IAFAANWQFL GGLYDTWAPG
GGEVRLITPT TDPGIIFGYL FQTPWGGGGN MVGVNSVEDI VGGHYYLGII ELIGGHFHMQ
TKPFGWARRA FIWNGEALLS YALGGLCVAS FYASVFVWFN NTAYPSEFYG PTNAEASQAQ
SFTFLVRDQR IGANVGTTMG PTGLGKYLMR SPTGEIIFGG ETMRFWDFRG PWLEPLRGEN
GLSLDKVQND IQPWQVRRAA EYMTHAPNAS INSVGGIITE PNAVNFVNLR QWLASAHFFL
GWFTFVGHLW HAGRARAAAG GFEKGIDRKN EPALSMPDLD