Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_16401 |
Symbol | psaB |
ID | 5730000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1479419 |
End bp | 1481647 |
Gene Length | 2229 bp |
Protein Length | 742 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641286019 |
Product | photosystem I P700 chlorophyll a apoprotein A2 |
Protein accession | YP_001551525 |
Protein GI | 159904181 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01336] photosystem I core protein PsaB |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGA AATTTCCATC GTTTAACCAG GGTCTGGCAC AGGACCCAAC AACCCGACGT ATTTGGTACG GAATCGCCAC GGCTCACGAC TTCGAGAGCC ATGACGGAAT GACCGAAGAA CAGCTTTATC AAAAGCTGTT TGCTACACAC TTTGGTCACC TAGCCATTAT TGGCTTATGG GTTGCTGGGA ACCTGTTCCA CATCGCCTGG CAGGGCAACT ATGAACAGTG GGTTACCGAC CCACTTCATA TTCGCCCAAT AGCACACGCA ATCTGGGACC CTCACTTTGG TCAAGGTCTA ACAGACGCCT TAACTCAAGC AGGGGCAACT TCCCCAGTAA ATATTGCTTA TTCCGGCCTA TACCACTGGT GGTACACAAT AGGAATGAGA ACAAATGAGC AGCTATTTCA AGGTGCAATT TTCTTAAATA TCCTTGTTTG CTGGCTCCTA TTCGCAGGTT GGTTGCATCT TCAGCCAAAG TTCAGGCCTT CTTTGGCATG GTTCAAAAAT GCTGAAGCTC AACTCAACCA TCATTTAGCA GTTTTATTTG GCTTTAGCAG CATTGCCTGG ACTGGTCACC TTGTTCATGT GGCAATTCCT GAGTCCAGAG GCCAACATAT TGGCTGGGAT AACTGGTTAA CAGTTCTTCC TCACCCAGAA GGTTTGGCTC CATTCTTTTC ATTGAATTGG GGTGCTTATG CACAAAACCC AGACTCCTTG GATGCAGTTT TTGGAACTTC GCAAGGTGCA GGTACCGCAA TATTTACGTT CTTAGGTGGA CTTCACCCTC AAAGTGAATC ACTATGGCTC ACTGATATTG CACATCACCA TCTAGCAATA GGTGTGATGT TTATCATTGC TGGCCATATG TATAGAAATA CTTTTGGAAT TGGTCACACC CTTAAAGAAA TTACAGAGGC TCATAACACT GGCCATCCAA ATGATCCTCA TAAAGGTCAT TTCGGAATAA ATCACAATGG AATTTATGAG ACAGTTAACA ACTCTCTTCA TTTCCAACTT GGTCTGGCTC TAGCTTCATT AGGTGCTGCA TGTAGTTTGG TTGCTCAGCA CATGGGTGCA CTTCCTTCTT ATGCATTTAT AGCCAGGGAC TACACAACCC AATCTGCTCT TTATACACAC CATCAATACA TAGCAATGTT CTTGATGGTT GGTGCTTTCT CTCATGGAGC AATCTTTTTT GTGAGGGATT ATGATCCCGA ACTCAATAAA GATAACGTTC TGGCAAGAGT TCTAGAGACT AAGGAAGCAT TAATTAGTCA CCTCAGCTGG GTAACTATGC TTCTTGGGTT CCATACTCTT GGTCTTTATG TTCATAACGA CGTTGTAGTT GCTTTTGGCA ATCCAGAAAA GCAAATTCTC ATTGAACCAG TTTTTGCTCA GGCTATTCAA GCCTTTAGTG GAAAAGTCAT GTATGGCATT GATGCGTTAT TAGCTAATGC AAATAGTTCT GCAACCTTGG CTGCAAACAG CATGCCTGGT AACCATTATT GGATGGATCT TATTAATCGT CAGGACGCTT TAACTAACTT CCTCCCAATT GGGCCAGCTG ACTTCTTGGT TCACCATGCA ATTGCTTTAG GACTACACAC AACTGCCTTA ATACTGATTA AAGGTGCTCT TGATGCTAGA GGCACTAAAT TAATCCCTGA CAAGAAAGAC TTTGGCTATG CATTCCCTTG TGATGGCCCT GGTAGAGGTG GTACATGTGA TAGTTCGGCT TGGGATGCTA CTTATTTAGC AATGTTCTGG GCTCTAAATA CAATCGCTTG GATAACTTTC TATTGGCACT GGAAGCATCT TGCTATTTGG CAAGGCAATG TAGCCCAGTT TAATGAGTCA GGTACTTACT TGATGGGATG GTTCAGAGAT TATTTATGGC TCAATAGTTC TCAGCTGATC AATGGATATA ACCCCTTTGG AGTTAATGCA TTATCTCCGT GGGCATGGAT GTTCCTCTTC GGTCACCTGA TTTGGGCTAC AGGTTTCATG TTCCTGATTT CCTGGAGAGG ATACTGGCAG GAGCTCATAG AAACTTTAGT TTGGGCTCAC CAGCGTACTC CTATTGCCAA CCTTGTAGGT TGGAGAGATA AGCCAGTTGC CTTATCAATT GTTCAAGCAC GTCTAGTTGG ACTTACACAT TTCACCGTAG GAAACTTTGT TACCTTTGGT GCCTTTGTGA TAGCTTCTAC TTCGGGCAAG TTTGGATAG
|
Protein sequence | MATKFPSFNQ GLAQDPTTRR IWYGIATAHD FESHDGMTEE QLYQKLFATH FGHLAIIGLW VAGNLFHIAW QGNYEQWVTD PLHIRPIAHA IWDPHFGQGL TDALTQAGAT SPVNIAYSGL YHWWYTIGMR TNEQLFQGAI FLNILVCWLL FAGWLHLQPK FRPSLAWFKN AEAQLNHHLA VLFGFSSIAW TGHLVHVAIP ESRGQHIGWD NWLTVLPHPE GLAPFFSLNW GAYAQNPDSL DAVFGTSQGA GTAIFTFLGG LHPQSESLWL TDIAHHHLAI GVMFIIAGHM YRNTFGIGHT LKEITEAHNT GHPNDPHKGH FGINHNGIYE TVNNSLHFQL GLALASLGAA CSLVAQHMGA LPSYAFIARD YTTQSALYTH HQYIAMFLMV GAFSHGAIFF VRDYDPELNK DNVLARVLET KEALISHLSW VTMLLGFHTL GLYVHNDVVV AFGNPEKQIL IEPVFAQAIQ AFSGKVMYGI DALLANANSS ATLAANSMPG NHYWMDLINR QDALTNFLPI GPADFLVHHA IALGLHTTAL ILIKGALDAR GTKLIPDKKD FGYAFPCDGP GRGGTCDSSA WDATYLAMFW ALNTIAWITF YWHWKHLAIW QGNVAQFNES GTYLMGWFRD YLWLNSSQLI NGYNPFGVNA LSPWAWMFLF GHLIWATGFM FLISWRGYWQ ELIETLVWAH QRTPIANLVG WRDKPVALSI VQARLVGLTH FTVGNFVTFG AFVIASTSGK FG
|
| |