Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_12331 |
Symbol | psbD |
ID | 5730366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1108607 |
End bp | 1109683 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641285601 |
Product | photosystem II PsbD protein (D2) |
Protein accession | YP_001551118 |
Protein GI | 159903774 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01151] photosystem II, DI subunit (also called Q(B)) [TIGR01152] Photosystem II, DII subunit (also called Q(A)) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.153207 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATAG CTGTTGGTGG CGCCCAAGAA AGAGGATGGT TTGACGTCCT TGATGACTGG CTTAAGCGCG ACCGATTCGT ATTTGTTGGT TGGTCTGGTC TTCTACTTTT TCCTACTGCC TATTTGGCAA TAGGGGGGTG GTTCTTAGGC ACAACCTTTG TTACTTCCTG GTACACACAT GGTGTAGCCA GCTCTTACCT AGAAGGTTGT AATTTTCTTA CAGCTGCAGT TAGCACCCCT GGTGATGCCA TGGGTCATAG CCTTCTATTC CTTTGGGGAC CTGAGGCACA AGGCAGTCTT GTTCGTTGGT TACAACTTGG CGGTCTATGG AACTTTGTAG TTCTCCATGG AATATTTAGC CTTATAGGCT TCATGCTTCG TCAATTTGAA ATCGCAAGAC TTGTTGGAAT TCGTCCCTAC AACGCTCTTG CATTTTCTGC TGTAATTGCC GTTTATACAG CTTGCTTCCT TATATACCCA CTAGGTCAGC ACAGCTTTTT CTTTGCTCCT TCTTTTGGGG TAGCAGCAAT TTTCCGCTTC ATCCTCTTTA TTCAAGGTTT CCATAACATC ACTCTTAACC CATTTCACAT GATGGGAGTT GCGGGAATTC TTGGAGGTGC GCTTCTTTGT GCAATTCATG GAGCCACTGT GCAGAACACT CTGTATGAGG ACACGAGTAT TTATACGGGT GGCAAAGCTC AAAGTACTAC TTTTAGAGGT TTTGACCCAA CTCAAGAAGA AGAGACCTAC TCCATGGTTA CTGCTAATCG CTTTTGGAGT CAGATCTTTG GAATTGCCTT CTCAAACAAG CGCTTCCTTC ATTTCTTAAT GCTCTTCGTA CCTGTAATGG GTATGTGGTG CGCTGCTATT GGCATCGTTG GTTTAGCCCT AAACCTAAGG GCCTATGACT TTGTTAGCCA AGAAATTCGC GCTGCTGAAG ACCCTGAGTT TGAAACGTTC TATACCAAAA ACATTCTTTT AAATGAAGGT ATGAGAGCCT GGATGTCTTC TGTGGATCAG CCACACGAAA ACTTTGTATT CCCTGAGGAG GTACTCCCAC GTGGAAACGC CCTTTAA
|
Protein sequence | MTIAVGGAQE RGWFDVLDDW LKRDRFVFVG WSGLLLFPTA YLAIGGWFLG TTFVTSWYTH GVASSYLEGC NFLTAAVSTP GDAMGHSLLF LWGPEAQGSL VRWLQLGGLW NFVVLHGIFS LIGFMLRQFE IARLVGIRPY NALAFSAVIA VYTACFLIYP LGQHSFFFAP SFGVAAIFRF ILFIQGFHNI TLNPFHMMGV AGILGGALLC AIHGATVQNT LYEDTSIYTG GKAQSTTFRG FDPTQEEETY SMVTANRFWS QIFGIAFSNK RFLHFLMLFV PVMGMWCAAI GIVGLALNLR AYDFVSQEIR AAEDPEFETF YTKNILLNEG MRAWMSSVDQ PHENFVFPEE VLPRGNAL
|
| |