Gene P9515_03471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_03471 
SymbolpsbB 
ID4718706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp320428 
End bp321951 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content42% 
IMG OID640080014 
Productphotosystem II PsbB protein (CP47) 
Protein accessionYP_001010663 
Protein GI123965582 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTGC CTTGGTATCG AGTTCACACA GTAGTTATCA ACGACCCCGG TCGACTACTA 
GCTGTGCATC TCATGCATAC TGCATTATTA GCCGGCTGGG CCGGCTCAAT GGCTCTATAT
GAATTAGCCA TTTTTGATCC TTCAGACGCT GTTCTCAATC CAATGTGGAG ACAGGGGATG
TATGTCATGC CATTCATGGC AAGATTAGGA ATCACTAGTA GTTGGAACGG ATGGGATATT
ACTGGTGCTA CAGGAGTTGA CCCAGGATTC TGGAGCTTTG AAGGTGTTGC TGCAGCCCAC
ATCGTTTTTA GTGGACTGCT AATGCTTGCT TCGATCTGGC ATTGGACATA TTGGGATTTA
GATCTATGGG AAGACGAAAG AACAGGAGAA CCTGCTCTTG ATCTTCCTAG AATATTTGGT
ATTCACCTTC TTTTAGCTGG AATTACTTGT TTTGGATTTG GAGCTTTTCA CTGTGCAAAT
GTAGGCATTT GGGTTTCTGA CCCATATGGT TTAACAGGTC ATGTAGAACC CGTTGCACCA
TCATGGGGAG CCGATGGATT TAATCCGTTC AATCCTGGTG GTATAGTTGC AAATCACATT
GCAGCCGGGC TTCTTGGAAT TATAGGTGGA ATTTTCCATA TTACTAATAG ACCAGGTGAG
AGGCTCTACA AAGCATTAAG GTTAGGAAGC TTAGAGGGTG TTTTAGCAAG CGCTTTAGCT
GCTGTTCTTT TTGTATCTTT TGTTGTCGCT GGAACAATGT GGTATGGATC TGCAACAACG
CCTGTAGAAT TATTTGGCCC TACAAGATAC CAATGGGACT CTGGTTACTT CAAAACTGAA
ATTAATAGAA GAGTACAAGC AGCTATTGAT GACGGAGCGA CTAGGGAAGA AGCATATGCT
GCAATTCCTG AAAAGTTAGC TTTCTACGAC TATGTAGGAA ATAGTCCTGC AAAAGGAGGA
TTGTTTAGGG TTGGAGCTCT TGTAAATGGA GATGGATTAC CCACAGGATG GCAGGGGCAT
ACTGTATTCA CAGATAAAGA AGGCAATGAC TTAGAAGTCA GAAGAATTCC TAATTTCTTT
GAAAACTTCC CCGTTATTCT CGAAGATAAA CAGGGTAATG TAAGAGCTGA TATCCCATTT
AGAAGAGCTG AAGCAAAGTA TTCTTTTGAA CAAACTGGCA TCACAGCTAC AATTTATGGT
GGTGATCTTA ATGGTCAAAC ATTTACAGAC CCAGCTGTGG TTAAAAGACT AGCCCGTAAA
GCTCAGTTAG GAGAGGCATT CAAGTTTGAT AGAGAAACCT ATAAATCTGA TGGTGTTTTC
CGTAGTTCTC CAAGAGCTTG GTTTACATAT GCACATTTAT GTTTCGGATT ACTATTCCTA
TTTGGACACT GGTGGCACGC CTCTAGAACC CTCTATAAAG ATAGATTCGC TGGTATTGAC
GCTGAGATAG GAGATCAAGT TGAGTTTGGT CTCTTTAAGA AACTTGGTGA CGAAACCACC
AGAAGAATCC CAGGAAGGGT TTAA
 
Protein sequence
MGLPWYRVHT VVINDPGRLL AVHLMHTALL AGWAGSMALY ELAIFDPSDA VLNPMWRQGM 
YVMPFMARLG ITSSWNGWDI TGATGVDPGF WSFEGVAAAH IVFSGLLMLA SIWHWTYWDL
DLWEDERTGE PALDLPRIFG IHLLLAGITC FGFGAFHCAN VGIWVSDPYG LTGHVEPVAP
SWGADGFNPF NPGGIVANHI AAGLLGIIGG IFHITNRPGE RLYKALRLGS LEGVLASALA
AVLFVSFVVA GTMWYGSATT PVELFGPTRY QWDSGYFKTE INRRVQAAID DGATREEAYA
AIPEKLAFYD YVGNSPAKGG LFRVGALVNG DGLPTGWQGH TVFTDKEGND LEVRRIPNFF
ENFPVILEDK QGNVRADIPF RRAEAKYSFE QTGITATIYG GDLNGQTFTD PAVVKRLARK
AQLGEAFKFD RETYKSDGVF RSSPRAWFTY AHLCFGLLFL FGHWWHASRT LYKDRFAGID
AEIGDQVEFG LFKKLGDETT RRIPGRV