Gene P9211_16401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_16401 
SymbolpsaB 
ID5730000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1479419 
End bp1481647 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content44% 
IMG OID641286019 
Productphotosystem I P700 chlorophyll a apoprotein A2 
Protein accessionYP_001551525 
Protein GI159904181 
COG category 
COG ID 
TIGRFAM ID[TIGR01336] photosystem I core protein PsaB 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGA AATTTCCATC GTTTAACCAG GGTCTGGCAC AGGACCCAAC AACCCGACGT 
ATTTGGTACG GAATCGCCAC GGCTCACGAC TTCGAGAGCC ATGACGGAAT GACCGAAGAA
CAGCTTTATC AAAAGCTGTT TGCTACACAC TTTGGTCACC TAGCCATTAT TGGCTTATGG
GTTGCTGGGA ACCTGTTCCA CATCGCCTGG CAGGGCAACT ATGAACAGTG GGTTACCGAC
CCACTTCATA TTCGCCCAAT AGCACACGCA ATCTGGGACC CTCACTTTGG TCAAGGTCTA
ACAGACGCCT TAACTCAAGC AGGGGCAACT TCCCCAGTAA ATATTGCTTA TTCCGGCCTA
TACCACTGGT GGTACACAAT AGGAATGAGA ACAAATGAGC AGCTATTTCA AGGTGCAATT
TTCTTAAATA TCCTTGTTTG CTGGCTCCTA TTCGCAGGTT GGTTGCATCT TCAGCCAAAG
TTCAGGCCTT CTTTGGCATG GTTCAAAAAT GCTGAAGCTC AACTCAACCA TCATTTAGCA
GTTTTATTTG GCTTTAGCAG CATTGCCTGG ACTGGTCACC TTGTTCATGT GGCAATTCCT
GAGTCCAGAG GCCAACATAT TGGCTGGGAT AACTGGTTAA CAGTTCTTCC TCACCCAGAA
GGTTTGGCTC CATTCTTTTC ATTGAATTGG GGTGCTTATG CACAAAACCC AGACTCCTTG
GATGCAGTTT TTGGAACTTC GCAAGGTGCA GGTACCGCAA TATTTACGTT CTTAGGTGGA
CTTCACCCTC AAAGTGAATC ACTATGGCTC ACTGATATTG CACATCACCA TCTAGCAATA
GGTGTGATGT TTATCATTGC TGGCCATATG TATAGAAATA CTTTTGGAAT TGGTCACACC
CTTAAAGAAA TTACAGAGGC TCATAACACT GGCCATCCAA ATGATCCTCA TAAAGGTCAT
TTCGGAATAA ATCACAATGG AATTTATGAG ACAGTTAACA ACTCTCTTCA TTTCCAACTT
GGTCTGGCTC TAGCTTCATT AGGTGCTGCA TGTAGTTTGG TTGCTCAGCA CATGGGTGCA
CTTCCTTCTT ATGCATTTAT AGCCAGGGAC TACACAACCC AATCTGCTCT TTATACACAC
CATCAATACA TAGCAATGTT CTTGATGGTT GGTGCTTTCT CTCATGGAGC AATCTTTTTT
GTGAGGGATT ATGATCCCGA ACTCAATAAA GATAACGTTC TGGCAAGAGT TCTAGAGACT
AAGGAAGCAT TAATTAGTCA CCTCAGCTGG GTAACTATGC TTCTTGGGTT CCATACTCTT
GGTCTTTATG TTCATAACGA CGTTGTAGTT GCTTTTGGCA ATCCAGAAAA GCAAATTCTC
ATTGAACCAG TTTTTGCTCA GGCTATTCAA GCCTTTAGTG GAAAAGTCAT GTATGGCATT
GATGCGTTAT TAGCTAATGC AAATAGTTCT GCAACCTTGG CTGCAAACAG CATGCCTGGT
AACCATTATT GGATGGATCT TATTAATCGT CAGGACGCTT TAACTAACTT CCTCCCAATT
GGGCCAGCTG ACTTCTTGGT TCACCATGCA ATTGCTTTAG GACTACACAC AACTGCCTTA
ATACTGATTA AAGGTGCTCT TGATGCTAGA GGCACTAAAT TAATCCCTGA CAAGAAAGAC
TTTGGCTATG CATTCCCTTG TGATGGCCCT GGTAGAGGTG GTACATGTGA TAGTTCGGCT
TGGGATGCTA CTTATTTAGC AATGTTCTGG GCTCTAAATA CAATCGCTTG GATAACTTTC
TATTGGCACT GGAAGCATCT TGCTATTTGG CAAGGCAATG TAGCCCAGTT TAATGAGTCA
GGTACTTACT TGATGGGATG GTTCAGAGAT TATTTATGGC TCAATAGTTC TCAGCTGATC
AATGGATATA ACCCCTTTGG AGTTAATGCA TTATCTCCGT GGGCATGGAT GTTCCTCTTC
GGTCACCTGA TTTGGGCTAC AGGTTTCATG TTCCTGATTT CCTGGAGAGG ATACTGGCAG
GAGCTCATAG AAACTTTAGT TTGGGCTCAC CAGCGTACTC CTATTGCCAA CCTTGTAGGT
TGGAGAGATA AGCCAGTTGC CTTATCAATT GTTCAAGCAC GTCTAGTTGG ACTTACACAT
TTCACCGTAG GAAACTTTGT TACCTTTGGT GCCTTTGTGA TAGCTTCTAC TTCGGGCAAG
TTTGGATAG
 
Protein sequence
MATKFPSFNQ GLAQDPTTRR IWYGIATAHD FESHDGMTEE QLYQKLFATH FGHLAIIGLW 
VAGNLFHIAW QGNYEQWVTD PLHIRPIAHA IWDPHFGQGL TDALTQAGAT SPVNIAYSGL
YHWWYTIGMR TNEQLFQGAI FLNILVCWLL FAGWLHLQPK FRPSLAWFKN AEAQLNHHLA
VLFGFSSIAW TGHLVHVAIP ESRGQHIGWD NWLTVLPHPE GLAPFFSLNW GAYAQNPDSL
DAVFGTSQGA GTAIFTFLGG LHPQSESLWL TDIAHHHLAI GVMFIIAGHM YRNTFGIGHT
LKEITEAHNT GHPNDPHKGH FGINHNGIYE TVNNSLHFQL GLALASLGAA CSLVAQHMGA
LPSYAFIARD YTTQSALYTH HQYIAMFLMV GAFSHGAIFF VRDYDPELNK DNVLARVLET
KEALISHLSW VTMLLGFHTL GLYVHNDVVV AFGNPEKQIL IEPVFAQAIQ AFSGKVMYGI
DALLANANSS ATLAANSMPG NHYWMDLINR QDALTNFLPI GPADFLVHHA IALGLHTTAL
ILIKGALDAR GTKLIPDKKD FGYAFPCDGP GRGGTCDSSA WDATYLAMFW ALNTIAWITF
YWHWKHLAIW QGNVAQFNES GTYLMGWFRD YLWLNSSQLI NGYNPFGVNA LSPWAWMFLF
GHLIWATGFM FLISWRGYWQ ELIETLVWAH QRTPIANLVG WRDKPVALSI VQARLVGLTH
FTVGNFVTFG AFVIASTSGK FG