Gene P9301_17161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_17161 
SymbolpsaA 
ID4912402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1448749 
End bp1451052 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content40% 
IMG OID640161314 
Productphotosystem I P700 chlorophyll a apoprotein A1 
Protein accessionYP_001091940 
Protein GI126697054 
COG category 
COG ID 
TIGRFAM ID[TIGR01335] photosystem I core protein PsaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATCA GCCCACCAGA AAGTGGAGAA AAAAACAAAA AGGTTTTGGA GGATCCTGTT 
AAGGCCGATC CAAGACCTAT TGATTTTGCC AAATTAGATA AGCCAGGTTT CTGGTCAACT
AAATTATCTA AGGGTCCAAA AACTACTACT TGGATCTGGA ATTTACATGC AGATGCACAT
GATTTTGATG TGCATACAGG CGATGCTGAA GAAGCAACAA GAAAAATCTT TTCAGCTCAT
TTTGGACATC TTGCAGTCAT TTTTATATGG ATGAGTGCTG CATTTTTCCA TGGAGCAAGA
TTTTCTAATT ATTCAGGTTG GTTAGCTGAT CCAACTCATG TCAAGCCAGG TGCTCAGCAA
GTTTGGGCAA TTGTTGGTCA AGAAATGCTT AATGCTGATC TTGGTGCTAA CTACAATGGT
ATTCAAATCA GTTCAGGAAT ATTCCACATG TGGAGAGCAT GGGGAATTAC TAACGAGAGT
GAACTGATGG CATTAGCAAT AGGTGCTGTA GTAATGGCTG CACTTATGCT TCATGCGGGA
ATTTTTCATT ATCATAAAGC CGCTCCAAAA ATGGAGTGGT TCCAAGATAT TGAGTCTATG
CTAAACCACC ATATAGCTGG TTTAGTCGGA TTAGGATCTT TAGCATGGGC TGGTCACTGT
ATTCATATAG GAGCTCCTAC AGCAGCTCTT TTAGATGCAA TTGATGCAGG CTCTCCTTTA
GTCATCAATG GTAAAGAGAT AGCAACTATT GCAGATATGC CTATGCCGCA TCAACTCTGT
GATCCACAAA TTATTGGTCA GATATTCCCA GGATTAGCAA GTGGTACAGG TAATTTCTTT
AGCTTAAACT GGTTAGCTTT CTCAGACTTT CTCACTTTCA AAGGCGGACT TAACCCTGTT
ACAGGAAGTT TATGGATGAC TGATGTTTCA CATCATCATT TAGCTTTTGG TGTAATAGCA
ATCATTGGTG GTCATATGTA TAGAACCAAT TATGGTATTG GTCATAGTAT GAAAGAAATA
TTAGATTCAC AGCAAGGAGA CCCAATATTA TTCCCTGCTC CTAAAGGTCA TCAAGGTCTT
TTTGAGTTCA TGGCAGAAAG TAGACATGCC CAGCTTGCGG TAAACCTAGC AATGCTTGGA
TCAATAAGCA TACTTGTATC TCATCATATG TATGCGATGC CTCCATATCC ATATATAGCT
ACTGACTACA TGACAGTTCT TGGATTATTT ACTCATCACA TGTGGATAGG TGGATTATTC
ATAGTAGGTG CAGGAGCGCA TGCTGGAATT GCAATGGTCA GAGATTACGA TCCAGCAAAA
CATATTGATA ATGTATTAGA CAGAATTCTT AAAGCAAGAG ATGCCCTAAT CAGTCACTTG
AACTGGGTAT GTATGTGGTT AGGATTTCAT AGTTTTGGAC TCTATATTCA CAACGATACT
ATGAGAGCTT TGGGAAGACC CCAAGATATG TTTAGTGATT CTGCAATCCA ACTTCAGCCA
ATTTTTGCTC AATGGGTACA GAGTATTCAA GCATCTGCTG TTGGAACTTC TCTTTTAGCA
GGTACTGCAG AAGCTCTACC TCACAAAGCT TTGAGTGAAG TTTTTAACGG AAGTTTAGTA
GAAGTGGGTG GAAAGGTTGC TATAGCTCCG ATTCCATTAG GGACTGCTGA TTTAATGATT
CATCATATTC ATGCTTTCCA AATTCACGTT ACTGTTTTGA TACTTCTTAA AGGAGTTCTT
TATGCAAGAA GTTCAAGGTT GATCCCTGAT AAAGCTTCTT TAGGATTTAG ATTCCCTTGT
GATGGACCTG GTAGAGGTGG TACATGTCAA GTTTCTTCAT GGGATCACGT GTTCTTAGCC
CTTTTCTGGA TGTATAACTG TTTATCCATA GTTATTTTCC ACTTCTCTTG GAAAATGCAG
AGTGATGTTT GGGGCCTTAC CGGTGGTAAC TTCGCACAAA GTTCCATTAC TATTAATGGT
TGGTTAAGAG ATTTCCTTTG GGCGCAAGCT TCTCAAGTAT TAACAAGTTA TGGTCAATCC
ATAAGCATGT ACGGTTTGAT GTTCTTAGGA GCTCACTTCA TATGGGCATT TAGTTTAATG
TTCCTCTTTA GTGGACGCGG ATATTGGCAA GAATTATTCG AATCAATTGT TTGGGCACAC
AACAAACTTA AAGTAGCCCC AACCATTCAA CCAAGAGCTT TATCTATCAC TCAGGGTAGA
GCAGTAGGTG TAACACACTT CCTTGTCGGT GGTATTGCTA CCACATGGGC TTTCTTCCAT
GCTCGCCTTT TCGGCCTGGG CTAA
 
Protein sequence
MTISPPESGE KNKKVLEDPV KADPRPIDFA KLDKPGFWST KLSKGPKTTT WIWNLHADAH 
DFDVHTGDAE EATRKIFSAH FGHLAVIFIW MSAAFFHGAR FSNYSGWLAD PTHVKPGAQQ
VWAIVGQEML NADLGANYNG IQISSGIFHM WRAWGITNES ELMALAIGAV VMAALMLHAG
IFHYHKAAPK MEWFQDIESM LNHHIAGLVG LGSLAWAGHC IHIGAPTAAL LDAIDAGSPL
VINGKEIATI ADMPMPHQLC DPQIIGQIFP GLASGTGNFF SLNWLAFSDF LTFKGGLNPV
TGSLWMTDVS HHHLAFGVIA IIGGHMYRTN YGIGHSMKEI LDSQQGDPIL FPAPKGHQGL
FEFMAESRHA QLAVNLAMLG SISILVSHHM YAMPPYPYIA TDYMTVLGLF THHMWIGGLF
IVGAGAHAGI AMVRDYDPAK HIDNVLDRIL KARDALISHL NWVCMWLGFH SFGLYIHNDT
MRALGRPQDM FSDSAIQLQP IFAQWVQSIQ ASAVGTSLLA GTAEALPHKA LSEVFNGSLV
EVGGKVAIAP IPLGTADLMI HHIHAFQIHV TVLILLKGVL YARSSRLIPD KASLGFRFPC
DGPGRGGTCQ VSSWDHVFLA LFWMYNCLSI VIFHFSWKMQ SDVWGLTGGN FAQSSITING
WLRDFLWAQA SQVLTSYGQS ISMYGLMFLG AHFIWAFSLM FLFSGRGYWQ ELFESIVWAH
NKLKVAPTIQ PRALSITQGR AVGVTHFLVG GIATTWAFFH ARLFGLG