Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_04051 |
Symbol | psbB |
ID | 4781127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 373317 |
End bp | 374840 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640083674 |
Product | photosystem II PsbB protein (CP47) |
Protein accession | YP_001014234 |
Protein GI | 124025118 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03039] photosystem II chlorophyll-binding protein CP47 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.644071 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTGC CCTGGTATCG GGTGCACACA GTCGTTATTA ACGACCCTGG CCGTCTCTTG GCCGTGCACC TCATGCATAC TGCTTTGTTA GCCGGCTGGG CCGGCTCAAT GGCCTTATAT GAATTGGCCA TTTTTGATCC TTCTGATCCA GTACTTAATC CAATGTGGCG CCAAGGCATG TATGTCATGC CCTTTATGGC CCGCCTAGGA ATTACAGGAA GCTGGAACGG ATGGGATATA ACTGGAGCAA CCGGAGTTGA CCCCGGTTTC TGGAGCTTCG AAGGTGTTGC TGCAGCACAT ATTGTTTTCA GTGGTCTCCT TATGCTCGCT GCCGTATGGC ACTGGACATA TTGGGACCTT GAATTGTGGG AAGATTCCCG CACTGGAGAG CCAGCTTTAG ATCTTCCAAG AATATTTGGA ATTCATCTTT TCTTAGCCGG CCTGACATGC TTTGGCTTTG GCGCATTTCA TTGTTCATCA GTAGGCATAT GGGTTTCTGA TCCATATGGA TTAACTGGGC ATGTAGAAAA AGTTGCTCCA GTCTGGGGTG CAGAAGGATT TAATCCATTC AATGCTGGAG GAATTGTCGC CAATCATGTT GGGGCAGGAC TTTTAGGAAT AATTGGTGGA ATATTCCATA TCACCAATAG ACCTGGAGAA AGGCTCTATA GGGCTCTAAA GCTAGGAAGT CTTGAAGGCG TTCTAGCAAG TGCTTTGGCA GCTGTTCTTT TTGTATCCTT CGTAGTTGCT GGAACAATGT GGTATGGATC AGCAACAACT CCAGTTGAAT TATTTGGTCC TACTAGATAT CAATGGGATT CTGGTTATTT CAAAACTGAA ATCAACCGCA GAGTTCAAGA GTCTATGAAT GATGGTGCAT CGAAAGCTGA GGCTTATGCA TCGATTCCTG AGCAACTTGC TTTCTACGAC TATGTAGGTA ATAGTCCCGC CAAAGGAGGA CTATTTAGAG TTGGTGCAAT GGTTAATGGA GATGGAGTTC CGAGTGGATG GCAAGGTCAC ATCTCATTTA CTGATAGCGA TGGCAATGAC TTAGAAGTAA GAAGAATTCC AAATTTCTTT GAGAACTTCC CTGTCATTCT TGAAGACAAG GATGGGAACG TTAAAGCTGA CATTCCTTTC CGTCGAGCCG AAGCTAAGTA CTCCATTGAA CAAACTGGAG TCACAGCTAC TGTTTATGGT GGTGAATTAA ATGGACAAAC ATTTACTGAT CCAGTAATAG TTAAACGTTT AGCTAGAAAA TCTCAGCTTG GAGAGGCCTT CAAGTTTGAC AGAGATAGAT ACAAGTCAGA TGGAGTATTC AGAAGCTCTC CAAGAGCCTG GTTCACTTAT GCTCACTTGT GTTTTGGCTT ACTATTCCTC TTCGGGCACT GGTGGCACGC AGCTAGGACT CTCTTTGGCG ACAGATTCTC AGGTATCGAT CCAGAGCTTG GGGATCAGGT TGAGTTTGGT CTCTTCAAAA AACTTGGAGA CGAATCTACC CGCCGAGTTC CTGGTCGAGC CTAA
|
Protein sequence | MGLPWYRVHT VVINDPGRLL AVHLMHTALL AGWAGSMALY ELAIFDPSDP VLNPMWRQGM YVMPFMARLG ITGSWNGWDI TGATGVDPGF WSFEGVAAAH IVFSGLLMLA AVWHWTYWDL ELWEDSRTGE PALDLPRIFG IHLFLAGLTC FGFGAFHCSS VGIWVSDPYG LTGHVEKVAP VWGAEGFNPF NAGGIVANHV GAGLLGIIGG IFHITNRPGE RLYRALKLGS LEGVLASALA AVLFVSFVVA GTMWYGSATT PVELFGPTRY QWDSGYFKTE INRRVQESMN DGASKAEAYA SIPEQLAFYD YVGNSPAKGG LFRVGAMVNG DGVPSGWQGH ISFTDSDGND LEVRRIPNFF ENFPVILEDK DGNVKADIPF RRAEAKYSIE QTGVTATVYG GELNGQTFTD PVIVKRLARK SQLGEAFKFD RDRYKSDGVF RSSPRAWFTY AHLCFGLLFL FGHWWHAART LFGDRFSGID PELGDQVEFG LFKKLGDEST RRVPGRA
|
| |