Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_04091 |
Symbol | psbA |
ID | 4777705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 406385 |
End bp | 407461 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640085912 |
Product | photosystem II PsbA protein (D1) |
Protein accession | YP_001016426 |
Protein GI | 124022119 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01151] photosystem II, DI subunit (also called Q(B)) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCA CCATTCGCAG TGGTCGCCTC AGTAGCTGGG AATCCTTCTG CAATTGGGTC ACCTCCACCA ATAACCGCAT TTATGTGGGT TGGTTCGGTG TCCTGATGGT TCCCACGCTG TTGGCAGCTG CCATCTGCTT CACCATTGCC TTTATTGCGG CACCGCCGGT TGATATCGAC GGTATCCGTG AGCCTGTTGC TGGCTCTTTC CTCTACGGCA ATAACATCAT TTCTGGTGCT GTTGTTCCTT CCAGCAACGC CATCGGCCTG CACTTTTATC CCATTTGGGA AGCTGCTTCT GTTGATGAGT GGCTTTACAA CGGCGGTCCA TACCAGCTTG TTGTGTTCCA CTTCCTGATC GGCATCTGTT GCTGGTTGGG TCGCCAATGG GAACTCTCCT ATCGCTTGGG CATGCGCCCT TGGATCTGTG TGGCCTACAG CGCACCGCTG TCGGCAGCCT TTGCTGTGTT CCTGATCTAT CCAGTAGGTC AGGGTTCCTT CTCGGATGGC ATGCCTTTAG GCATCTCTGG AACCTTCAAC TTCATGTTGG TGTTCCAGGC AGAGCACAAC ATCCTCATGC ATCCCTTCCA CATGATTGGT GTGGCGGGCA TGTTTGGCGG CAGTTTGTTC TCCGCCATGC ACGGTTCACT GGTGACCTCA TCGTTGATTC GTGAAACCAC CGAAACCGAG TCTCAGAACT ACGGCTATAA GTTCGGCCAA GAGGAAGAGA CCTACAACAT CGTGGCTGCC CACGGTTATT TCGGTCGTCT GATCTTCCAG TACGCCAGCT TCAATAACAG CCGCAGTCTG CACTTCTTCC TGGCTGCCTG GCCGGTGATC TGCATCTGGA TCACCTCACT GGGTATTAGC ACCATGGCCT TCAACTTGAA CGGCTTTAAC TTCAACCAGT CGGTTCTCGA TGCCCAAGGC AGAGTTGTAC CAACCTGGGC TGATGTGCTC AACCGCAGCA ACCTCGGTAT GGAAGTGATG CACGAGCGTA ATGCTCATAA CTTCCCTCTC GACCTGGCAG CTGCTGAGTC CACACCTGTG GCTCTGATTG CTCCTGCGAT TGGCTGA
|
Protein sequence | MTTTIRSGRL SSWESFCNWV TSTNNRIYVG WFGVLMVPTL LAAAICFTIA FIAAPPVDID GIREPVAGSF LYGNNIISGA VVPSSNAIGL HFYPIWEAAS VDEWLYNGGP YQLVVFHFLI GICCWLGRQW ELSYRLGMRP WICVAYSAPL SAAFAVFLIY PVGQGSFSDG MPLGISGTFN FMLVFQAEHN ILMHPFHMIG VAGMFGGSLF SAMHGSLVTS SLIRETTETE SQNYGYKFGQ EEETYNIVAA HGYFGRLIFQ YASFNNSRSL HFFLAAWPVI CIWITSLGIS TMAFNLNGFN FNQSVLDAQG RVVPTWADVL NRSNLGMEVM HERNAHNFPL DLAAAESTPV ALIAPAIG
|
| |