Gene P9301_03401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_03401 
SymbolpsbB 
ID4912483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp310621 
End bp312144 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content42% 
IMG OID640159910 
Productphotosystem II PsbB protein (CP47) 
Protein accessionYP_001090564 
Protein GI126695678 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTGC CTTGGTATCG AGTTCACACA GTAGTTATTA ATGACCCAGG TCGACTACTT 
GCTGTGCATC TTATGCATAC TGCATTATTA GCCGGCTGGG CCGGTTCAAT GGCTCTTTAC
GAATTAGCCA TTTTTGATCC TTCTGATGCT GTTCTCAATC CAATGTGGAG ACAGGGGATG
TACGTTATGC CTTTTATGGC AAGACTAGGT ATCACAAGTA GTTGGAACGG ATGGGATATT
ACCGGTGCTA CTGGAGTTGA TCCTGGATTC TGGAGTTTCG AAGGGGTTGC CGCAGCTCAC
ATAGTATTTA GTGGTCTATT AATGTTGGCC TCTATTTGGC ACTGGACATA CTGGGACTTA
GATTTGTGGG AAGATTCAAG AACTGGTGAA CCTGCTCTTG ACTTGCCAAG AATTTTCGGG
ATTCACCTCC TTCTAGCAGG ACTAACCTGT TTTGGTTTTG GAGCTTTTCA TTGTGCAAAC
GTTGGGATTT GGGTTTCTGA CCCTTATGGC TTAACTGGTC ACGTAGAACC TGTGGCTCCA
TCCTGGGGAG TAGAAGGATT TAATCCTTTT AATCCTGGAG GTATAGTGGC GAACCATATT
GCAGCAGGAC TTATGGGTAT TATTGGAGGT ATTTTTCATA TCACCAATAG ACCTGGAGAA
AGACTTTATA GAGCACTAAA ACTTGGAAGT CTCGAGGGAG TTCTAGCTAG TGCTTTGGCT
GCTGTATTAT TTGTTTCTTT CGTTGTTTCC GGAACAATGT GGTACGGTTC AGCAACAACT
CCGGTAGAGC TTTTTGGTCC TACCAGATAT CAATGGGATT CAGGCTATTT CAAAACTGAA
ATCAATAGAA GAGTGCAAGC TGCTATAGAT GATGGTGCCA CTAAATCAGA GGCATATGCA
TCGATTCCAG AAAAATTAGC CTTCTACGAT TACGTTGGAA ATAGTCCAGC TAAAGGAGGA
CTATTTAGAG TTGGAGCTCT TGTTAATGGT GATGGATTAC CAACTGGTTG GCAAGGTCAC
ATTGCTTTTC AAGATAAGGA AGGTAACGAA TTAGAAGTTA GAAGAATTCC TAATTTCTTT
GAAAACTTCC CTGTCATTCT TGAAGACAAA GAAGGTAATG TAAGAGCAGA TATCCCATTT
AGAAGAGCTG AAGCAAAGTA TTCATTCGAA CAGACTGGTA TAACTGCAAC TATCTATGGA
GGAGACCTAG ATGGACAAAC ATTTACAGAC CCTGCAGTAG TTAAAAGGTT AGCTAGAAAA
GCTCAACTTG GAGAAGCATT CAAGTTTGAC AGAGAAACAT ATAAATCTGA TGGCGTATTC
CGAAGTTCTC CAAGAGCCTG GTTTACATAT GCACATTTAT GTTTCGGATT GCTATTCTTA
TTTGGTCACT GGTGGCATGC TTCAAGAACT CTTTACAGAA ATTCCTTTGC TGGTATTGAT
GCTGAGATTG GAGACCAAGT TGAATTTGGT TTATTCAAGA AACTTGGTGA CGAAACCACA
AGAAGAATCC CAGGAAGGGT TTAA
 
Protein sequence
MGLPWYRVHT VVINDPGRLL AVHLMHTALL AGWAGSMALY ELAIFDPSDA VLNPMWRQGM 
YVMPFMARLG ITSSWNGWDI TGATGVDPGF WSFEGVAAAH IVFSGLLMLA SIWHWTYWDL
DLWEDSRTGE PALDLPRIFG IHLLLAGLTC FGFGAFHCAN VGIWVSDPYG LTGHVEPVAP
SWGVEGFNPF NPGGIVANHI AAGLMGIIGG IFHITNRPGE RLYRALKLGS LEGVLASALA
AVLFVSFVVS GTMWYGSATT PVELFGPTRY QWDSGYFKTE INRRVQAAID DGATKSEAYA
SIPEKLAFYD YVGNSPAKGG LFRVGALVNG DGLPTGWQGH IAFQDKEGNE LEVRRIPNFF
ENFPVILEDK EGNVRADIPF RRAEAKYSFE QTGITATIYG GDLDGQTFTD PAVVKRLARK
AQLGEAFKFD RETYKSDGVF RSSPRAWFTY AHLCFGLLFL FGHWWHASRT LYRNSFAGID
AEIGDQVEFG LFKKLGDETT RRIPGRV