Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_17161 |
Symbol | psaA |
ID | 4912402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1448749 |
End bp | 1451052 |
Gene Length | 2304 bp |
Protein Length | 767 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640161314 |
Product | photosystem I P700 chlorophyll a apoprotein A1 |
Protein accession | YP_001091940 |
Protein GI | 126697054 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01335] photosystem I core protein PsaA |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATCA GCCCACCAGA AAGTGGAGAA AAAAACAAAA AGGTTTTGGA GGATCCTGTT AAGGCCGATC CAAGACCTAT TGATTTTGCC AAATTAGATA AGCCAGGTTT CTGGTCAACT AAATTATCTA AGGGTCCAAA AACTACTACT TGGATCTGGA ATTTACATGC AGATGCACAT GATTTTGATG TGCATACAGG CGATGCTGAA GAAGCAACAA GAAAAATCTT TTCAGCTCAT TTTGGACATC TTGCAGTCAT TTTTATATGG ATGAGTGCTG CATTTTTCCA TGGAGCAAGA TTTTCTAATT ATTCAGGTTG GTTAGCTGAT CCAACTCATG TCAAGCCAGG TGCTCAGCAA GTTTGGGCAA TTGTTGGTCA AGAAATGCTT AATGCTGATC TTGGTGCTAA CTACAATGGT ATTCAAATCA GTTCAGGAAT ATTCCACATG TGGAGAGCAT GGGGAATTAC TAACGAGAGT GAACTGATGG CATTAGCAAT AGGTGCTGTA GTAATGGCTG CACTTATGCT TCATGCGGGA ATTTTTCATT ATCATAAAGC CGCTCCAAAA ATGGAGTGGT TCCAAGATAT TGAGTCTATG CTAAACCACC ATATAGCTGG TTTAGTCGGA TTAGGATCTT TAGCATGGGC TGGTCACTGT ATTCATATAG GAGCTCCTAC AGCAGCTCTT TTAGATGCAA TTGATGCAGG CTCTCCTTTA GTCATCAATG GTAAAGAGAT AGCAACTATT GCAGATATGC CTATGCCGCA TCAACTCTGT GATCCACAAA TTATTGGTCA GATATTCCCA GGATTAGCAA GTGGTACAGG TAATTTCTTT AGCTTAAACT GGTTAGCTTT CTCAGACTTT CTCACTTTCA AAGGCGGACT TAACCCTGTT ACAGGAAGTT TATGGATGAC TGATGTTTCA CATCATCATT TAGCTTTTGG TGTAATAGCA ATCATTGGTG GTCATATGTA TAGAACCAAT TATGGTATTG GTCATAGTAT GAAAGAAATA TTAGATTCAC AGCAAGGAGA CCCAATATTA TTCCCTGCTC CTAAAGGTCA TCAAGGTCTT TTTGAGTTCA TGGCAGAAAG TAGACATGCC CAGCTTGCGG TAAACCTAGC AATGCTTGGA TCAATAAGCA TACTTGTATC TCATCATATG TATGCGATGC CTCCATATCC ATATATAGCT ACTGACTACA TGACAGTTCT TGGATTATTT ACTCATCACA TGTGGATAGG TGGATTATTC ATAGTAGGTG CAGGAGCGCA TGCTGGAATT GCAATGGTCA GAGATTACGA TCCAGCAAAA CATATTGATA ATGTATTAGA CAGAATTCTT AAAGCAAGAG ATGCCCTAAT CAGTCACTTG AACTGGGTAT GTATGTGGTT AGGATTTCAT AGTTTTGGAC TCTATATTCA CAACGATACT ATGAGAGCTT TGGGAAGACC CCAAGATATG TTTAGTGATT CTGCAATCCA ACTTCAGCCA ATTTTTGCTC AATGGGTACA GAGTATTCAA GCATCTGCTG TTGGAACTTC TCTTTTAGCA GGTACTGCAG AAGCTCTACC TCACAAAGCT TTGAGTGAAG TTTTTAACGG AAGTTTAGTA GAAGTGGGTG GAAAGGTTGC TATAGCTCCG ATTCCATTAG GGACTGCTGA TTTAATGATT CATCATATTC ATGCTTTCCA AATTCACGTT ACTGTTTTGA TACTTCTTAA AGGAGTTCTT TATGCAAGAA GTTCAAGGTT GATCCCTGAT AAAGCTTCTT TAGGATTTAG ATTCCCTTGT GATGGACCTG GTAGAGGTGG TACATGTCAA GTTTCTTCAT GGGATCACGT GTTCTTAGCC CTTTTCTGGA TGTATAACTG TTTATCCATA GTTATTTTCC ACTTCTCTTG GAAAATGCAG AGTGATGTTT GGGGCCTTAC CGGTGGTAAC TTCGCACAAA GTTCCATTAC TATTAATGGT TGGTTAAGAG ATTTCCTTTG GGCGCAAGCT TCTCAAGTAT TAACAAGTTA TGGTCAATCC ATAAGCATGT ACGGTTTGAT GTTCTTAGGA GCTCACTTCA TATGGGCATT TAGTTTAATG TTCCTCTTTA GTGGACGCGG ATATTGGCAA GAATTATTCG AATCAATTGT TTGGGCACAC AACAAACTTA AAGTAGCCCC AACCATTCAA CCAAGAGCTT TATCTATCAC TCAGGGTAGA GCAGTAGGTG TAACACACTT CCTTGTCGGT GGTATTGCTA CCACATGGGC TTTCTTCCAT GCTCGCCTTT TCGGCCTGGG CTAA
|
Protein sequence | MTISPPESGE KNKKVLEDPV KADPRPIDFA KLDKPGFWST KLSKGPKTTT WIWNLHADAH DFDVHTGDAE EATRKIFSAH FGHLAVIFIW MSAAFFHGAR FSNYSGWLAD PTHVKPGAQQ VWAIVGQEML NADLGANYNG IQISSGIFHM WRAWGITNES ELMALAIGAV VMAALMLHAG IFHYHKAAPK MEWFQDIESM LNHHIAGLVG LGSLAWAGHC IHIGAPTAAL LDAIDAGSPL VINGKEIATI ADMPMPHQLC DPQIIGQIFP GLASGTGNFF SLNWLAFSDF LTFKGGLNPV TGSLWMTDVS HHHLAFGVIA IIGGHMYRTN YGIGHSMKEI LDSQQGDPIL FPAPKGHQGL FEFMAESRHA QLAVNLAMLG SISILVSHHM YAMPPYPYIA TDYMTVLGLF THHMWIGGLF IVGAGAHAGI AMVRDYDPAK HIDNVLDRIL KARDALISHL NWVCMWLGFH SFGLYIHNDT MRALGRPQDM FSDSAIQLQP IFAQWVQSIQ ASAVGTSLLA GTAEALPHKA LSEVFNGSLV EVGGKVAIAP IPLGTADLMI HHIHAFQIHV TVLILLKGVL YARSSRLIPD KASLGFRFPC DGPGRGGTCQ VSSWDHVFLA LFWMYNCLSI VIFHFSWKMQ SDVWGLTGGN FAQSSITING WLRDFLWAQA SQVLTSYGQS ISMYGLMFLG AHFIWAFSLM FLFSGRGYWQ ELFESIVWAH NKLKVAPTIQ PRALSITQGR AVGVTHFLVG GIATTWAFFH ARLFGLG
|
| |