Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMT9312_1616 |
Symbol | psaA |
ID | 3766434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9312 |
Kingdom | Bacteria |
Replicon accession | NC_007577 |
Strand | - |
Start bp | 1514630 |
End bp | 1516933 |
Gene Length | 2304 bp |
Protein Length | 767 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637798154 |
Product | photosystem I P700 chlorophyll a apoprotein A1 |
Protein accession | YP_398112 |
Protein GI | 78780000 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01335] photosystem I core protein PsaA |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.165391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATCA GCCCACCAGA AAGTGGAGAA AAAAACAAGA AAGTTTTGGA AGATCCTGTT AAGGCCGATC CAAGACCTAT TGATTTTGCC AAATTAGATA AGCCAGGTTT CTGGTCTACT AAATTATCCA AGGGTCCAAA AACTACAACT TGGATCTGGA ATTTGCATGC AGATGCACAT GATTTTGATG TGCATACAGG CGATGCTGAA GAAGCAACAA GAAAAATCTT CTCAGCTCAC TTTGGACATC TTGCAATCAT TTTTATATGG ATGAGTGCTG CTTTTTTCCA TGGAGCAAGA TTTTCAAATT ACACAGGTTG GTTAGCTGAT CCAACTAATG TTAAACCAGG AGCTCAGCAA GTTTGGGCTG TTGTCGGCCA AGAGATGCTA AACGGTAATC TTGGTGCGGA CTACAACGGT ATTCAAATAA GCTCTGGAAT TTTTCATATG TGGAGAGCTT GGGGTATTAC TAACGAAAGT GAGTTAATGG CTTTGGCTAT TGGTGCAGTA ATAATGGCTG CTCTTATGCT TCATGGAGGT ATATACCATT ATCATAAAGC TGCTCCAAAA TTGGAATGGT TCCAAAATAT TGAGTCAATG CTTAATCACC ATATTGCAGG TTTGGTTGGG CTTGGATCAC TAGCTTGGGC TGGACATTGT ATTCATATTG GAGCGCCAAC AGCTGCACTT CTTGATGCAA TTGACGCTGG ATCACCTTTA GTTATTAATG GTCAAAAGAT CGCAACAATT GCGGATATGC CAATGCCTCA TCAGCTTTGC GATCCCCAAA TAATTGGTCA GATCTTTCCT GGTCTAGCTA GTGGAGTTGG TAATTTCTTT AGTCTTAATT GGTTCGCTTT CTCAGATTTC TTAACTTTTA AAGGTGGTTT GAATCCTGTC ACTGGAAGTT TATGGATGAC TGATATTGCA CATCATCATT TAGCTTTTGG TGTAATTGCC ATAATTGGAG GTCATTTATA TAGAACTAAT TATGGAATTG GTTCAAGTAT GAAAGAGATT TTAGAAGCTC ATCAAGGAGA TCCAATATTA TTCCCAGCAC CAAAAGGTCA TCAAGGTCTT TTCGAGTTCA TGGCTGAAAG CAGACACGCT CAACTATCTG TAAACCTCGC ATGTCTAGGA TCTCTTAGCA TCCTGATTTC TCATCATATG TATGCGATGC CTCCATATCC TTACATAGCG ACTGATTATA TGACTGTTCT TGGTTTATTT ACACATCATA TGTGGATAGG AGCTTTATTT ATAGTTGGGG CTGGAGCTCA TGCTGGTATA GCCATGGTTA GAGACTATGA TCCAGCAAAA CATATTGATA ACGTTTTAGA TAGGATTTTA AAAGCTAGAG ATGCCTTAAT AAGTCATCTT AACTGGGTTT GTATGTGGTT AGGTTTTCAT AGTTTTGGAC TTTATATTCA TAACGACACT ATGAGAGCTT TAGGTAGACC TCAGGACATG TTTAGTGATA ATGCAATTCA GCTTCAACCA ATATTTGCGC AATGGGTTCA AAGTATTCAA GCTTCTGCAG TAGGCACTTC AATTTTAGCT GGTACTCCGG AGGGATTACC TCAAAAAGCT TTAAGTGAAG TCTTTAACGG AAGTTTAGTT GAAGTAGGTG GAAAGGTAGC CATATCTCCA ATTCAACTTG GAACTGCTGA TTTGATGATC CATCATATTC ATGCTTTCCA AATACATGTA ACCGTTTTAA TACTCCTTAA AGGAGTACTT TACGCAAGAA GTTCAAGACT AATTCCAGAT AAAGCTTCTC TTGGATTTAG ATTCCCTTGT GATGGTCCAG GAAGAGGTGG TACTTGTCAG GTTTCATCAT GGGATCATGT CTTCTTAGCT CTTTTCTGGA TGTATAACTG CATATCAATA GTTATATTCC ACTTTTCTTG GAAAATGCAA AGTGATGTGT GGGGACTTAC AGGAGGAAAC TTCTCTCAAA GTGCTATCAC CATTAATGGT TGGCTAAGAG ATTTCCTATG GGCTCAATCA TCTCAAGTAC TAACAAGTTA CGGTGAAGCT ATTAGTATGT ATGGATTGAT GTTCTTAGGA GCTCACTTCA TATGGGCGTT TAGTTTGATG TTCTTATTCA GTGGAAGAGG CTATTGGCAA GAATTATTTG AATCAATAGT ATGGGCTCAT AACAAGCTTA AAGTTGCCCC AACAATACAA CCAAGAGCAC TTTCAATTAC TCAAGGTAGA GCCGTTGGTG TAGCTCACTT CCTTCTCGGA GGTATAGCAA CTACTTGGGC TTTCTTCCAT GCTCGCCTTT TCGGGCTGGG CTAA
|
Protein sequence | MTISPPESGE KNKKVLEDPV KADPRPIDFA KLDKPGFWST KLSKGPKTTT WIWNLHADAH DFDVHTGDAE EATRKIFSAH FGHLAIIFIW MSAAFFHGAR FSNYTGWLAD PTNVKPGAQQ VWAVVGQEML NGNLGADYNG IQISSGIFHM WRAWGITNES ELMALAIGAV IMAALMLHGG IYHYHKAAPK LEWFQNIESM LNHHIAGLVG LGSLAWAGHC IHIGAPTAAL LDAIDAGSPL VINGQKIATI ADMPMPHQLC DPQIIGQIFP GLASGVGNFF SLNWFAFSDF LTFKGGLNPV TGSLWMTDIA HHHLAFGVIA IIGGHLYRTN YGIGSSMKEI LEAHQGDPIL FPAPKGHQGL FEFMAESRHA QLSVNLACLG SLSILISHHM YAMPPYPYIA TDYMTVLGLF THHMWIGALF IVGAGAHAGI AMVRDYDPAK HIDNVLDRIL KARDALISHL NWVCMWLGFH SFGLYIHNDT MRALGRPQDM FSDNAIQLQP IFAQWVQSIQ ASAVGTSILA GTPEGLPQKA LSEVFNGSLV EVGGKVAISP IQLGTADLMI HHIHAFQIHV TVLILLKGVL YARSSRLIPD KASLGFRFPC DGPGRGGTCQ VSSWDHVFLA LFWMYNCISI VIFHFSWKMQ SDVWGLTGGN FSQSAITING WLRDFLWAQS SQVLTSYGEA ISMYGLMFLG AHFIWAFSLM FLFSGRGYWQ ELFESIVWAH NKLKVAPTIQ PRALSITQGR AVGVAHFLLG GIATTWAFFH ARLFGLG
|
| |