Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_22121 |
Symbol | psbB |
ID | 4778041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1963389 |
End bp | 1964921 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640087728 |
Product | photosystem II PsbB protein (CP47) |
Protein accession | YP_001018212 |
Protein GI | 124023905 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03039] photosystem II chlorophyll-binding protein CP47 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTGC CCTGGTATCG GGTGCACACG GTCGTTATCA ACGACCCCGG CCGACTTCTG GCCGTGCACC TCATGCACAC AGCCCTGTTA GCCGGCTGGG CCGGCTCCAT GGCCCTCTAC GAATTAGCCA TCTTCGATCC TTCGGATCCC GTCCTCAACC CCATGTGGCG GCAAGGCATG TATGTCATGC CGTTCATGAC CCGCTGCGGA ATTACTGGCA GCTGGGGGGG CTGGAGCATC ACCGGTGAAA CCGGTGTTGA TCCAGGCCTC TGGAGCTTCG AAGGTGTTGC TGCCGCTCAC ATCATCTTCA GTGGCCTATT AATGCTGGCC GCAATCTGGC ATTGGACCTA CTGGGATCTT GATCTTTGGC TAGATCCACG CACTCAAGAG CCTGCTTTAG ACCTTCCAAA AATCTTCGGC ATCCACCTGA CACTTGCAGG TGCGGTTTGC TTCGGCTTTG GAGCCTTCCA TCTATCTGGA CTTTATGGTC CAGGGATGTG GGTCTCTGAT TCCTATGGAC TAACAGGTCA TATGGAAAAC GTCGCTCCTG AATGGGGAGC GGCTGGCTTC AATCCATTTA GTCCAGGCGG CATCGTGGCG AACCACATTG CTGCAGGAAT CTTTGGATTC CTTGGTGGCC ATTTCCAAAT GCTCAACCGC CCACCCGAAC GGCTTTACAA AGCCCTTCGA ATGGGCAATA TCGAAACAGT CTTAGCCAGT GGCTGCGCGG CTGTTTGCGC TATCGCCTTC ATCGTTTCAG GAACGATGTG GTATGGATCT GCTGCCACAC CTGTTGAACT GTTTGGACCT ACCCGTTATC AGTGGGACCA AAATTTCTAC AGAACTGAAA TCAACCGTCG TGTCCAATCA GCCATGGATG ACGGCGCTAC TCAAGAGGCG GCTTATGCCG CTATCCCTGA GAAGCTTGCC TTCTACGACT ATGTAGGTAA CAGCCCTGCA AAGGGTGGTC TATTTAGAGT TGGCGCCATG GTGAATGGCG ACGGCCTTGC TACTGGCTGG CTCGGTCACA TCTCTTTCCA AGACAGGGCA GGCAATGATC TCCAGGTCCG CCGGATCCCG AATTTCTTCG AGAACTTCCC TGTATTACTT GAAGATCAGA ACGGCGTCGT TCGTGCTGAC ATCCCCTTCC GCCGTGCTGA AGCAAAGAAT TCCTTTGAGC AGCAAGGCGT GACGGCAACC ATCTATGGCG GTTCCATGGA TGGGAAAACC TTCACTGATA CCGCTGATGT GAAGCGTCTG GCTCGCAAGG CTCAACTTGG TGAAGCCTTT ACTTTCGACC GCGAGACCTA TGCTTCTGAT GGTGTTTTCC GAAGCTCACC TCGAGGCTGG TTCACCTTTG CTCACGTCAA CTTTGCGCTC CTGTTCCTGT TCGGCCACTG GTGGCATGCA GCTAGGACCT TGTACCGCGA TGTGTTTGCC GGTATCGATC CTGACCTTGG CGACCAAGTT GAATTCGGTG TTTTCCAAAA GTTGGGCGAC GCTTCTACTC GTCGCGTTCC AGGGCAAACT TAA
|
Protein sequence | MGLPWYRVHT VVINDPGRLL AVHLMHTALL AGWAGSMALY ELAIFDPSDP VLNPMWRQGM YVMPFMTRCG ITGSWGGWSI TGETGVDPGL WSFEGVAAAH IIFSGLLMLA AIWHWTYWDL DLWLDPRTQE PALDLPKIFG IHLTLAGAVC FGFGAFHLSG LYGPGMWVSD SYGLTGHMEN VAPEWGAAGF NPFSPGGIVA NHIAAGIFGF LGGHFQMLNR PPERLYKALR MGNIETVLAS GCAAVCAIAF IVSGTMWYGS AATPVELFGP TRYQWDQNFY RTEINRRVQS AMDDGATQEA AYAAIPEKLA FYDYVGNSPA KGGLFRVGAM VNGDGLATGW LGHISFQDRA GNDLQVRRIP NFFENFPVLL EDQNGVVRAD IPFRRAEAKN SFEQQGVTAT IYGGSMDGKT FTDTADVKRL ARKAQLGEAF TFDRETYASD GVFRSSPRGW FTFAHVNFAL LFLFGHWWHA ARTLYRDVFA GIDPDLGDQV EFGVFQKLGD ASTRRVPGQT
|
| |