Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_03391 |
Symbol | psbB |
ID | 4717027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 311035 |
End bp | 312558 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640078042 |
Product | photosystem II PsbB protein (CP47) |
Protein accession | YP_001008734 |
Protein GI | 123967876 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03039] photosystem II chlorophyll-binding protein CP47 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0452672 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATTGC CTTGGTATCG AGTTCACACA GTAGTTATTA ATGACCCAGG TCGACTACTT GCTGTGCATC TTATGCATAC TGCATTATTA GCCGGCTGGG CCGGTTCAAT GGCTCTTTAT GAATTAGCCA TTTTTGATCC TTCTGATGCT GTTCTCAATC CAATGTGGAG ACAGGGGATG TACGTTATGC CTTTCATGGC AAGACTAGGT ATCACAAGTA GTTGGAATGG ATGGGATATT ACTGGTGCTA CTGGAGTTGA TCCTGGATTC TGGAGTTTCG AAGGGGTTGC GGCAGCACAC ATAGTATTTA GTGGCCTATT GATGTTGGCA TCTATTTGGC ACTGGACATA TTGGGACCTA GATTTATGGG AAGATTCAAG AACTGGTGAG CCTGCTCTTG ATTTACCAAG AATTTTTGGA ATTCACCTTC TTTTAGCAGG CCTTACATGT TTTGGTTTTG GAGCTTTTCA TTGCGCAAAC GTTGGGATTT GGGTTTCTGA TCCATATGGT TTAAGTGGTC ATGTAGAACC TGTAGCACCT TCCTGGGGAG TTGAAGGATT TAATCCTTTT AATCCAGGAG GTATTGTGGC TAATCATATT GCAGCTGGAC TTATGGGTAT TATTGGAGGT ATTTTTCACA TTACCAACAG GCCTGGAGAA AGACTTTACA GAGCATTAAA ACTTGGAAGT CTCGAAGGAG TTCTTGCTAG TGCTTTAGCT GCTGTATTGT TTGTTTCTTT CGTTGTTTCC GGAACAATGT GGTACGGTTC AGCAACCACC CCAATAGAGC TTTTTGGTCC TACTAGATAT CAATGGGATT CAGGCTATTT CAAAACTGAA ATTAACAGAA GAGTGCAAGC TGCTATTGAT GATGGTGCAA CTAAATCAGA GGCATATGCA TCTATACCAG AAAAATTAGC CTTCTACGAT TATGTCGGTA ATAGTCCAGC TAAAGGGGGA CTATTCAGAG TTGGAGCTCT TGTTAATGGT GATGGCTTAC CAACTGGTTG GCAAGGTCAC ATTGCTTTCC AAGATAAGGA AGGTAACGAA TTAGAAGTTA GAAGAATTCC TAACTTCTTT GAAAACTTCC CTGTCATTCT TGAAGATAAA GAAGGTAATG TAAGGGCAGA TATCCCATTT AGAAGAGCTG AAGCAAAGTA CTCATTTGAG CAAACTGGTA TTACTGCAAC AATTTATGGA GGAGACCTAG ATGGACAAAC ATTTACAGAT CCTGCAGTAG TGAAAAGATT AGCAAGAAAG GCTCAACTTG GAGAAGCATT CAAGTTTGAC AGAGAGACAT ATAAATCTGA CGGTGTATTC CGAAGTTCTC CAAGAGCATG GTTTACATAT GCACATTTAT GTTTCGGATT GCTATTCTTG TTTGGGCACT GGTGGCACGC TTCAAGAACT CTTTACAGAA ATTCCTTTGC TGGTATTGAT GCTGAAATTG GCGACCAAGT TGAATTTGGT TTATTCAAAA AGCTTGGAGA CGAAACCACA AGAAGAATCC CTGGAAGGGT TTAA
|
Protein sequence | MGLPWYRVHT VVINDPGRLL AVHLMHTALL AGWAGSMALY ELAIFDPSDA VLNPMWRQGM YVMPFMARLG ITSSWNGWDI TGATGVDPGF WSFEGVAAAH IVFSGLLMLA SIWHWTYWDL DLWEDSRTGE PALDLPRIFG IHLLLAGLTC FGFGAFHCAN VGIWVSDPYG LSGHVEPVAP SWGVEGFNPF NPGGIVANHI AAGLMGIIGG IFHITNRPGE RLYRALKLGS LEGVLASALA AVLFVSFVVS GTMWYGSATT PIELFGPTRY QWDSGYFKTE INRRVQAAID DGATKSEAYA SIPEKLAFYD YVGNSPAKGG LFRVGALVNG DGLPTGWQGH IAFQDKEGNE LEVRRIPNFF ENFPVILEDK EGNVRADIPF RRAEAKYSFE QTGITATIYG GDLDGQTFTD PAVVKRLARK AQLGEAFKFD RETYKSDGVF RSSPRAWFTY AHLCFGLLFL FGHWWHASRT LYRNSFAGID AEIGDQVEFG LFKKLGDETT RRIPGRV
|
| |