Gene P9303_22121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_22121 
SymbolpsbB 
ID4778041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1963389 
End bp1964921 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content54% 
IMG OID640087728 
Productphotosystem II PsbB protein (CP47) 
Protein accessionYP_001018212 
Protein GI124023905 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTGC CCTGGTATCG GGTGCACACG GTCGTTATCA ACGACCCCGG CCGACTTCTG 
GCCGTGCACC TCATGCACAC AGCCCTGTTA GCCGGCTGGG CCGGCTCCAT GGCCCTCTAC
GAATTAGCCA TCTTCGATCC TTCGGATCCC GTCCTCAACC CCATGTGGCG GCAAGGCATG
TATGTCATGC CGTTCATGAC CCGCTGCGGA ATTACTGGCA GCTGGGGGGG CTGGAGCATC
ACCGGTGAAA CCGGTGTTGA TCCAGGCCTC TGGAGCTTCG AAGGTGTTGC TGCCGCTCAC
ATCATCTTCA GTGGCCTATT AATGCTGGCC GCAATCTGGC ATTGGACCTA CTGGGATCTT
GATCTTTGGC TAGATCCACG CACTCAAGAG CCTGCTTTAG ACCTTCCAAA AATCTTCGGC
ATCCACCTGA CACTTGCAGG TGCGGTTTGC TTCGGCTTTG GAGCCTTCCA TCTATCTGGA
CTTTATGGTC CAGGGATGTG GGTCTCTGAT TCCTATGGAC TAACAGGTCA TATGGAAAAC
GTCGCTCCTG AATGGGGAGC GGCTGGCTTC AATCCATTTA GTCCAGGCGG CATCGTGGCG
AACCACATTG CTGCAGGAAT CTTTGGATTC CTTGGTGGCC ATTTCCAAAT GCTCAACCGC
CCACCCGAAC GGCTTTACAA AGCCCTTCGA ATGGGCAATA TCGAAACAGT CTTAGCCAGT
GGCTGCGCGG CTGTTTGCGC TATCGCCTTC ATCGTTTCAG GAACGATGTG GTATGGATCT
GCTGCCACAC CTGTTGAACT GTTTGGACCT ACCCGTTATC AGTGGGACCA AAATTTCTAC
AGAACTGAAA TCAACCGTCG TGTCCAATCA GCCATGGATG ACGGCGCTAC TCAAGAGGCG
GCTTATGCCG CTATCCCTGA GAAGCTTGCC TTCTACGACT ATGTAGGTAA CAGCCCTGCA
AAGGGTGGTC TATTTAGAGT TGGCGCCATG GTGAATGGCG ACGGCCTTGC TACTGGCTGG
CTCGGTCACA TCTCTTTCCA AGACAGGGCA GGCAATGATC TCCAGGTCCG CCGGATCCCG
AATTTCTTCG AGAACTTCCC TGTATTACTT GAAGATCAGA ACGGCGTCGT TCGTGCTGAC
ATCCCCTTCC GCCGTGCTGA AGCAAAGAAT TCCTTTGAGC AGCAAGGCGT GACGGCAACC
ATCTATGGCG GTTCCATGGA TGGGAAAACC TTCACTGATA CCGCTGATGT GAAGCGTCTG
GCTCGCAAGG CTCAACTTGG TGAAGCCTTT ACTTTCGACC GCGAGACCTA TGCTTCTGAT
GGTGTTTTCC GAAGCTCACC TCGAGGCTGG TTCACCTTTG CTCACGTCAA CTTTGCGCTC
CTGTTCCTGT TCGGCCACTG GTGGCATGCA GCTAGGACCT TGTACCGCGA TGTGTTTGCC
GGTATCGATC CTGACCTTGG CGACCAAGTT GAATTCGGTG TTTTCCAAAA GTTGGGCGAC
GCTTCTACTC GTCGCGTTCC AGGGCAAACT TAA
 
Protein sequence
MGLPWYRVHT VVINDPGRLL AVHLMHTALL AGWAGSMALY ELAIFDPSDP VLNPMWRQGM 
YVMPFMTRCG ITGSWGGWSI TGETGVDPGL WSFEGVAAAH IIFSGLLMLA AIWHWTYWDL
DLWLDPRTQE PALDLPKIFG IHLTLAGAVC FGFGAFHLSG LYGPGMWVSD SYGLTGHMEN
VAPEWGAAGF NPFSPGGIVA NHIAAGIFGF LGGHFQMLNR PPERLYKALR MGNIETVLAS
GCAAVCAIAF IVSGTMWYGS AATPVELFGP TRYQWDQNFY RTEINRRVQS AMDDGATQEA
AYAAIPEKLA FYDYVGNSPA KGGLFRVGAM VNGDGLATGW LGHISFQDRA GNDLQVRRIP
NFFENFPVLL EDQNGVVRAD IPFRRAEAKN SFEQQGVTAT IYGGSMDGKT FTDTADVKRL
ARKAQLGEAF TFDRETYASD GVFRSSPRGW FTFAHVNFAL LFLFGHWWHA ARTLYRDVFA
GIDPDLGDQV EFGVFQKLGD ASTRRVPGQT