Gene P9303_23471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_23471 
SymbolpsaB 
ID4778124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2062154 
End bp2064403 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content51% 
IMG OID640087868 
Productphotosystem I P700 chlorophyll a apoprotein A2 
Protein accessionYP_001018347 
Protein GI124024040 
COG category 
COG ID 
TIGRFAM ID[TIGR01336] photosystem I core protein PsaB 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGA AATTTCCTTC GTTCAGCCAG GGTCTGGCAC AGGACCCTAC AACCCGCCGT 
ATTTGGTACG GCATAGCCAC GGCTCACGAC TTCGAGAGCC ACGACGGCAT GACCGAAGAA
CGGCTTTACC AAAAGCTGTT CTCTACCCAT TTCGGTCATC TGGCTGTTAT TGGCCTATGG
GTTGCAGGAA ACCTGTTCCA TATCGCCTGG CAGGGCAACT TCGAACAGTG GGTCGCCGAC
CCACAAAATG TTCAACCGAT TGCCCACGCA ATTTGGGATC CTCACTTTGG TCAAGGCATC
AGTGATGCTT TGACTCAAGC GGGTGCTTCC CGTCCAGTAA ACATTTGCTA TTCCGGTTTG
TACCACTGGT GGTACACCAT CGGAATGCGC ACCAATACAG AGCTTTACCA GGGCGCCATC
TTCATTGACA TCCTGGTTGC CTGGCTCTTG TTCGGTGGCT GGCTGCATTT ACAGCCAAAG
TTCCGTCCTT CGCTGGCGTG GTTCAAGAAC GCTGAGGCGA TGATGAATCA TCACCTCGCA
GTGTTGTTCG GTTTCACCAA CATCGCCTGG ACTGGTCACC TCGTTCACGT GGCCATTCCT
GAATCACGTG GTCAGCACGT CGGTTGGGAC AACTTCCTCA CAGTTCTTCC ACACCCTGAA
GGGTTGACCC CATTCTTCAC AGGAAACTGG GGAGCCTATG CACAGAATCC TGATTCTTTG
AATCATGCCT TTGGTACTTC TGAAGGCGCT GGCACGGCCA TCCTTACTTT CCTAGGTGGT
GTTCATCCAC AGAGTGGTGC GCTTTGGCTG ACTGACATCT CGCACCACCA CATTGCGATC
GGTGTCTTCA TGATCATCGG TGGCCACATG TACCGGAACA GTTTCGGTAT CGGCCATACC
TTTAAGGAAA TCACCGATGG TCACAACACA AGTCATCCTA ATGATCCTCA TAAGGATGGT
TTCCGAGAAA AAATCGGTCA CTACGGTCTT AGTCACACCG GGATCACCGA TACGATCAAC
AACTCCCTGC ACTTCCAACT GGGTCTTGCT CTTGCCTGTC TAGGTACAGC AGCCAGTCTT
GTAGCGCATC ACATGGGTGC TTTGCCCTCG TATGCCTTCA TTGCGCAGGA CTACACCACT
CAGGCAGCTC TTTACACCCA CCATCAGTAC ATCGCCATCT TCCTGATGTG CGGTGCCTTC
TCTCACGGAG CAATCTTCTT TGTCCGTGAC TATGACCCTG AGGCCAATAA GGACAATGTT
CTTGCAAGGG TCTTAGAAAC CAAAGAAGCA CTGATCAGTC ACTTGAGCTG GGTTTGCATG
CTCCTCGGTT TCCATACCCT GGCGCTGTAT CTCCATAACG ATGTGGTGAT TGCTTTCGGA
ACCCCCGAAA AGCAGATCCT TGTTGAGCCC ATCTTTGCTC AGTTCATTCA GGCCGCTAGT
GGCAAGGTCA TGTATGGCCT TGATGTGCTG CTTGCTAATG CAAATAGTGC CCCCTCCTTG
GCTGCGGCTG GGATGCCTGG GGATCACTAC TGGATGGATC TGATTAATGC CAGTCCAGAG
GTTTCCAACT TCATGCCTAT TGGTCCAGGA GACTTCCTGG TTCACCATGG CATTGCATTG
GGACTTCACA CCACTGCATT GATCTTGATC AAGGGTGCAC TTGATGCAAG AGGTTCCAAG
CTGATGCCCG ATAAAAAGGA CTTCGGCTAT GCCTTTGCAT GTGATGGCCC TGGACGTGGC
GGCACCTGCG ACATCTCTGC GTGGGACTCC ACCTATATGG CCATTTTCTG GGCACTGAAT
ACCATTGCTT GGGCAACCTA TTACTGGCAC TGGAAGCACC TTGCTGCTTG GCAGGGCAAC
ATGGCTCAGT TCAATGAGTC CAGTACCCAC TTAATGGGTT GGTTCAGGGA TTATCTCTGG
ATCAATAGTT CTCAGATCAT CAATGGCTAC AACCCATTCG GGATTAATAA TCTCTCCCCT
TGGGCATACA TGTTCCTTGC AGGCCATTTG GTCTGGGCAA CAGGATTCAT GTTCCTGATC
TCTTGGAGGG GTTATTGGCA GGAGCTGATC GAGACTCTCG TTTGGGCTCA TCAGCGTTCT
CCGATTGCCA ACCTGGTTGG ATGGCGTGAT AAGCCAGTAG CTCTTTCGAT CGTGCAGGCC
CGTCTCGTTG GTGTGACTCA CTTTGCAGTG GGCAACATAT TTACGTTCGG TGCCTTTGTG
ATTGCATCTA CAGCTAGCAA GTTTGGATAA
 
Protein sequence
MATKFPSFSQ GLAQDPTTRR IWYGIATAHD FESHDGMTEE RLYQKLFSTH FGHLAVIGLW 
VAGNLFHIAW QGNFEQWVAD PQNVQPIAHA IWDPHFGQGI SDALTQAGAS RPVNICYSGL
YHWWYTIGMR TNTELYQGAI FIDILVAWLL FGGWLHLQPK FRPSLAWFKN AEAMMNHHLA
VLFGFTNIAW TGHLVHVAIP ESRGQHVGWD NFLTVLPHPE GLTPFFTGNW GAYAQNPDSL
NHAFGTSEGA GTAILTFLGG VHPQSGALWL TDISHHHIAI GVFMIIGGHM YRNSFGIGHT
FKEITDGHNT SHPNDPHKDG FREKIGHYGL SHTGITDTIN NSLHFQLGLA LACLGTAASL
VAHHMGALPS YAFIAQDYTT QAALYTHHQY IAIFLMCGAF SHGAIFFVRD YDPEANKDNV
LARVLETKEA LISHLSWVCM LLGFHTLALY LHNDVVIAFG TPEKQILVEP IFAQFIQAAS
GKVMYGLDVL LANANSAPSL AAAGMPGDHY WMDLINASPE VSNFMPIGPG DFLVHHGIAL
GLHTTALILI KGALDARGSK LMPDKKDFGY AFACDGPGRG GTCDISAWDS TYMAIFWALN
TIAWATYYWH WKHLAAWQGN MAQFNESSTH LMGWFRDYLW INSSQIINGY NPFGINNLSP
WAYMFLAGHL VWATGFMFLI SWRGYWQELI ETLVWAHQRS PIANLVGWRD KPVALSIVQA
RLVGVTHFAV GNIFTFGAFV IASTASKFG