Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_17281 |
Symbol | psaA |
ID | 4718459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1471672 |
End bp | 1473975 |
Gene Length | 2304 bp |
Protein Length | 767 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640079455 |
Product | photosystem I P700 chlorophyll a apoprotein A1 |
Protein accession | YP_001010118 |
Protein GI | 123969260 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01335] photosystem I core protein PsaA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.72658 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATCA GCCCACCAGA AAGTGGAGAA AAAAACAAAA AAGTTTTGGA AGATCCTGTA AAGGCCGATC CAAGACCTAT TGATTTTGCC AAATTAGATA AGCCAGGTTT CTGGTCAAGT AAATTATCTA AAGGTCCAAA AACTACAACT TGGATCTGGA ATTTGCATGC TGATGCACAT GATTTCGATG TGCATACAGG CGATGCTGAA GAAGCAACAA GAAAAATCTT TTCAGCTCAC TTTGGACATC TTGCAGTCAT TTTTATATGG ATGAGTGCTG CATTTTTCCA TGGAGCAAGA TTTTCTAATT ACTCAGGTTG GTTAGCTGAT CCAACTCATG TCAAACCAGG AGCTCAGCAA GTATGGGCAA TCGTTGGTCA AGAAATGCTT AATGCTGATC TTGGTGCTAA TTACAACGGT ATTCAAATTA GTTCAGGAAT ATTCCACATG TGGCGAGCAT GGGGAATCAC TAATGAGAGT GAACTCATGG CTTTGGCAAT AGGTGCTGTT GTAATGGCTG CACTTATGCT TCATGCTGGA ATTTTTCATT ATCACAAAGC GGCTCCAAAA ATGGAGTGGT TTCAAGATAT AGAGTCTATG CTTAACCACC ACATAGCTGG TTTAGTAGGA TTAGGATCTT TAGCATGGGC TGGCCATTGT ATTCATATCG GAGCTCCTAC TGCGGCTCTC TTAGATGCAA TTGATGCAGG TTCTCCTTTA GTTATTAATG GGAAAGAAAT AGCAACAATT GCAGATATGC CTATGCCGCA TCAACTCTGC GATCCACAAA TTATCGGTCA GATATTCCCT GGATTAGCAA GTGGTACAGG CAATTTCTTC AGTTTAAATT GGTTAGCTTT CTCAGACTTC CTTACTTTCA AAGGTGGACT TAACCCTGTG ACAGGTAGCT TGTGGATGAC TGATGTTTCA CATCATCATT TAGCTTTTGG TGTAATAGCA ATAATCGGTG GTCATATGTA TAGAACCAAT TATGGAATTG GTCATAGTAT GAAAGAAATA TTAGATTCAC AACAAGGCGA CCCAATATTA TTCCCTGCGC CTAAAGGTCA TCAAGGACTT TTTGAGTTCA TGGCAGAAAG TAGACATGCA CAGCTATCGG TAAATCTAGC GATGCTTGGA TCAATAAGCA TTCTTGTATC TCACCACATG TATGCGATGC CTCCGTATCC TTATATAGCT ACTGACTACA TGACAGTTCT TGGATTATTT ACCCATCACA TGTGGATAGG TGGATTATTC ATAGTTGGAG CAGGTGCGCA TGCTGGAATT GCAATGGTTA GAGATTATGA TCCAGCAAAA CATATTGATA ATGTCTTAGA CAGAATTCTT AAGGCAAGAG ATGCTTTAAT CAGTCACTTG AACTGGGTAT GTATGTGGTT AGGATTCCAT AGTTTTGGAC TCTATATTCA TAACGATACT ATGAGAGCTT TGGGTAGACC TCAAGATATG TTTAGTGATT CAGCAATCCA ACTTCAGCCA ATCTTTGCTC AATGGGTACA GAGTATTCAA GCATCTGCTG TTGGAACTTT TCTTTTAGCA GGTACTTCAG AAGCTTTACC TCACAAAGCT TTAAGTGAAG TTTTTAATGG AAGTTTAGTA GAAGTTGGCG GAAAGGTAGC TATAGCGCCA ATTCCATTAG GTACAGCTGA TTTAATGATT CATCATATTC ATGCTTTCCA AATACATGTA ACTGTCTTGA TACTTCTCAA AGGAGTACTT TATGCAAGAA GTTCAAGGTT GATCCCTGAT AAAGCTTCTT TAGGATTTAG ATTTCCTTGT GATGGACCTG GAAGAGGTGG TACATGTCAA GTTTCTTCAT GGGATCACGT TTTCTTAGCT CTCTTCTGGA TGTATAACTG TTTATCAATA GTTATTTTCC ACTTCTCTTG GAAAATGCAG AGTGATGTTT GGGGACTTAC TGGTGGTAAT TTCGCACAAA GTTCAATTAC TATCAATGGT TGGTTAAGAG ATTTCCTCTG GGCTCAAGCT TCTCAAGTAT TAACAAGCTA TGGTCAATCA ATAAGCATGT ACGGTTTGAT GTTCTTAGGT GCTCATTTTA TATGGGCGTT CAGTTTAATG TTCCTCTTCA GCGGTAGAGG ATATTGGCAA GAATTGTTCG AATCAATTGT TTGGGCACAC AATAAACTAA AGGTTGCACC AACTATTCAG CCCCGAGCTC TATCTATTAC TCAGGGACGT GCAGTAGGTG TTACACACTT CCTAGTAGGT GGTATTGCTA CCACATGGGC CTTCTTCCAT GCTCGCCTTT TCGGGCTGGG CTAA
|
Protein sequence | MTISPPESGE KNKKVLEDPV KADPRPIDFA KLDKPGFWSS KLSKGPKTTT WIWNLHADAH DFDVHTGDAE EATRKIFSAH FGHLAVIFIW MSAAFFHGAR FSNYSGWLAD PTHVKPGAQQ VWAIVGQEML NADLGANYNG IQISSGIFHM WRAWGITNES ELMALAIGAV VMAALMLHAG IFHYHKAAPK MEWFQDIESM LNHHIAGLVG LGSLAWAGHC IHIGAPTAAL LDAIDAGSPL VINGKEIATI ADMPMPHQLC DPQIIGQIFP GLASGTGNFF SLNWLAFSDF LTFKGGLNPV TGSLWMTDVS HHHLAFGVIA IIGGHMYRTN YGIGHSMKEI LDSQQGDPIL FPAPKGHQGL FEFMAESRHA QLSVNLAMLG SISILVSHHM YAMPPYPYIA TDYMTVLGLF THHMWIGGLF IVGAGAHAGI AMVRDYDPAK HIDNVLDRIL KARDALISHL NWVCMWLGFH SFGLYIHNDT MRALGRPQDM FSDSAIQLQP IFAQWVQSIQ ASAVGTFLLA GTSEALPHKA LSEVFNGSLV EVGGKVAIAP IPLGTADLMI HHIHAFQIHV TVLILLKGVL YARSSRLIPD KASLGFRFPC DGPGRGGTCQ VSSWDHVFLA LFWMYNCLSI VIFHFSWKMQ SDVWGLTGGN FAQSSITING WLRDFLWAQA SQVLTSYGQS ISMYGLMFLG AHFIWAFSLM FLFSGRGYWQ ELFESIVWAH NKLKVAPTIQ PRALSITQGR AVGVTHFLVG GIATTWAFFH ARLFGLG
|
| |