Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_03501 |
Symbol | psbB |
ID | 5730768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 329417 |
End bp | 330973 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641284698 |
Product | photosystem II PsbB protein (CP47) |
Protein accession | YP_001550235 |
Protein GI | 159902891 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03039] photosystem II chlorophyll-binding protein CP47 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.234957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTGC CCTGGTATCG GGTGCACACT GTCGTTATTA ACGATCCTGG CCGACTCTTG GCCGTGCACC TCATGCACAC TGCTTTGCTA GCCGGCTGGG CCGGCTCCAT GGCCCTATAT GAATTGGCCA TATTTGATCC TTCTGATCCA GTCTTGAACC CTATGTGGCG CCAAGGCATG TATGTCATGC CGTTCATGGC CCGCTTAGGT GTCACAAGCA GTTGGAAGGG TTGGGACATC ACTGGAGGTG TCGGCTCATT CGACTTTGAT TCATTAGGGT TCTGGGGAAA AGCTCTACCT TATTCAACTT TTGAAGGGGT CGCTGTAGCA CATATCCTCT TCAGTGGCCT ATTAATGCTT GCTGCCATCT GGCATTGGAC CTATTGGGAT CTAGAGCTCT GGGAAGACTC CCGAACAGGA GAGCCAGCCT TAGATCTACC AAGAATTTTC GGAATCCATT TACTTCTTGC AGGGCTTACT TGCTTTGGTT TTGGCGCATT CCATCTATCA GCCGTTGGCA TGTGGGTCTC AGACTCATAT GGCCTAGGAG GTCACGTAGA AAAAGTCGCT CCTGTTTGGG GTGCAGACGG CTTCAACCCC TTTAGTGCTG GAGGAATTGT CGCTAACCAT ATTGGAGCTG GTCTTTTAGG AATTATTGGT GGAGTTTTCC ATATCACCAA CCGTCCTGGA GAAAGGCTCT ATAGGAATTT AAGGATGGGA AGCCTAGAAG GTGTTCTCGC AAGTGCGCTT GCTGCAGTCC TATTTGTCTC GTTTGTGGTT GCTGGAACTA TGTGGTATGG ATCTGCTACC ACACCAATTG AGCTATTTGG TCCTACCAGA TACCAATGGG ATTCTGGGTA CTTCAAAACT GAAATAAATA GAAGGGTGCA AGCCTCTATA AATGAGGGTG CTTCTAAAGA AGAAGCTTAT GCAGCAATCC CTGAGAAGCT AGCTTTCTAT GACTATGTAG GAAATAGCCC TGCAAAAGGA GGATTATTTA GAGCTGGTGC TCTAGTTAAT GGCGATGGTG TCCCAACTGG CTGGCAAGGT CACGTTTCAT TCTCAGATAA AGAAGGAAAT GAGCTTGAAG TCAGAAGAAT GCCAAACTTC TTTGAGAACT TCCCAGTAAT TCTTGAAGAC AAAGATGGAA ATGTCAGAGC TGACATTCCA TTCCGTAGAG CAGAAGCCAA GTATTCCTTT GAACAAACAG GCATTACTGC AACAGTTTAT GGTGGTGAAC TAAGTGGGCA AACCTTTAGT GACCCTGTAG TTGTTAAGCG TCTTGCTCGC AAAGCACAAC TTGGTGAGTC CTTTAAGTTC GATAGAGATC GCTACAAATC AGATGGGGTC TTCCGAAGTG GCCCAAGAGC ATGGTTTACT TATGCTCATG CTTGCTTTGG GTTGCTCTAC TTATTTGGGC ACTGGTGGCA TGCTGCCAGA ACTCTATATC GAGATACCTT TGCTGGAATT GATCCAGACC TTGGCGACCA GGTCGAGTTT GGTCTCTTCA AGAAACTTGG AGATGAATCC ACACGACGCG TCCCAGGGCG TGCTTAA
|
Protein sequence | MGLPWYRVHT VVINDPGRLL AVHLMHTALL AGWAGSMALY ELAIFDPSDP VLNPMWRQGM YVMPFMARLG VTSSWKGWDI TGGVGSFDFD SLGFWGKALP YSTFEGVAVA HILFSGLLML AAIWHWTYWD LELWEDSRTG EPALDLPRIF GIHLLLAGLT CFGFGAFHLS AVGMWVSDSY GLGGHVEKVA PVWGADGFNP FSAGGIVANH IGAGLLGIIG GVFHITNRPG ERLYRNLRMG SLEGVLASAL AAVLFVSFVV AGTMWYGSAT TPIELFGPTR YQWDSGYFKT EINRRVQASI NEGASKEEAY AAIPEKLAFY DYVGNSPAKG GLFRAGALVN GDGVPTGWQG HVSFSDKEGN ELEVRRMPNF FENFPVILED KDGNVRADIP FRRAEAKYSF EQTGITATVY GGELSGQTFS DPVVVKRLAR KAQLGESFKF DRDRYKSDGV FRSGPRAWFT YAHACFGLLY LFGHWWHAAR TLYRDTFAGI DPDLGDQVEF GLFKKLGDES TRRVPGRA
|
| |