Gene A9601_17281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_17281 
SymbolpsaA 
ID4718459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1471672 
End bp1473975 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content40% 
IMG OID640079455 
Productphotosystem I P700 chlorophyll a apoprotein A1 
Protein accessionYP_001010118 
Protein GI123969260 
COG category 
COG ID 
TIGRFAM ID[TIGR01335] photosystem I core protein PsaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.72658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATCA GCCCACCAGA AAGTGGAGAA AAAAACAAAA AAGTTTTGGA AGATCCTGTA 
AAGGCCGATC CAAGACCTAT TGATTTTGCC AAATTAGATA AGCCAGGTTT CTGGTCAAGT
AAATTATCTA AAGGTCCAAA AACTACAACT TGGATCTGGA ATTTGCATGC TGATGCACAT
GATTTCGATG TGCATACAGG CGATGCTGAA GAAGCAACAA GAAAAATCTT TTCAGCTCAC
TTTGGACATC TTGCAGTCAT TTTTATATGG ATGAGTGCTG CATTTTTCCA TGGAGCAAGA
TTTTCTAATT ACTCAGGTTG GTTAGCTGAT CCAACTCATG TCAAACCAGG AGCTCAGCAA
GTATGGGCAA TCGTTGGTCA AGAAATGCTT AATGCTGATC TTGGTGCTAA TTACAACGGT
ATTCAAATTA GTTCAGGAAT ATTCCACATG TGGCGAGCAT GGGGAATCAC TAATGAGAGT
GAACTCATGG CTTTGGCAAT AGGTGCTGTT GTAATGGCTG CACTTATGCT TCATGCTGGA
ATTTTTCATT ATCACAAAGC GGCTCCAAAA ATGGAGTGGT TTCAAGATAT AGAGTCTATG
CTTAACCACC ACATAGCTGG TTTAGTAGGA TTAGGATCTT TAGCATGGGC TGGCCATTGT
ATTCATATCG GAGCTCCTAC TGCGGCTCTC TTAGATGCAA TTGATGCAGG TTCTCCTTTA
GTTATTAATG GGAAAGAAAT AGCAACAATT GCAGATATGC CTATGCCGCA TCAACTCTGC
GATCCACAAA TTATCGGTCA GATATTCCCT GGATTAGCAA GTGGTACAGG CAATTTCTTC
AGTTTAAATT GGTTAGCTTT CTCAGACTTC CTTACTTTCA AAGGTGGACT TAACCCTGTG
ACAGGTAGCT TGTGGATGAC TGATGTTTCA CATCATCATT TAGCTTTTGG TGTAATAGCA
ATAATCGGTG GTCATATGTA TAGAACCAAT TATGGAATTG GTCATAGTAT GAAAGAAATA
TTAGATTCAC AACAAGGCGA CCCAATATTA TTCCCTGCGC CTAAAGGTCA TCAAGGACTT
TTTGAGTTCA TGGCAGAAAG TAGACATGCA CAGCTATCGG TAAATCTAGC GATGCTTGGA
TCAATAAGCA TTCTTGTATC TCACCACATG TATGCGATGC CTCCGTATCC TTATATAGCT
ACTGACTACA TGACAGTTCT TGGATTATTT ACCCATCACA TGTGGATAGG TGGATTATTC
ATAGTTGGAG CAGGTGCGCA TGCTGGAATT GCAATGGTTA GAGATTATGA TCCAGCAAAA
CATATTGATA ATGTCTTAGA CAGAATTCTT AAGGCAAGAG ATGCTTTAAT CAGTCACTTG
AACTGGGTAT GTATGTGGTT AGGATTCCAT AGTTTTGGAC TCTATATTCA TAACGATACT
ATGAGAGCTT TGGGTAGACC TCAAGATATG TTTAGTGATT CAGCAATCCA ACTTCAGCCA
ATCTTTGCTC AATGGGTACA GAGTATTCAA GCATCTGCTG TTGGAACTTT TCTTTTAGCA
GGTACTTCAG AAGCTTTACC TCACAAAGCT TTAAGTGAAG TTTTTAATGG AAGTTTAGTA
GAAGTTGGCG GAAAGGTAGC TATAGCGCCA ATTCCATTAG GTACAGCTGA TTTAATGATT
CATCATATTC ATGCTTTCCA AATACATGTA ACTGTCTTGA TACTTCTCAA AGGAGTACTT
TATGCAAGAA GTTCAAGGTT GATCCCTGAT AAAGCTTCTT TAGGATTTAG ATTTCCTTGT
GATGGACCTG GAAGAGGTGG TACATGTCAA GTTTCTTCAT GGGATCACGT TTTCTTAGCT
CTCTTCTGGA TGTATAACTG TTTATCAATA GTTATTTTCC ACTTCTCTTG GAAAATGCAG
AGTGATGTTT GGGGACTTAC TGGTGGTAAT TTCGCACAAA GTTCAATTAC TATCAATGGT
TGGTTAAGAG ATTTCCTCTG GGCTCAAGCT TCTCAAGTAT TAACAAGCTA TGGTCAATCA
ATAAGCATGT ACGGTTTGAT GTTCTTAGGT GCTCATTTTA TATGGGCGTT CAGTTTAATG
TTCCTCTTCA GCGGTAGAGG ATATTGGCAA GAATTGTTCG AATCAATTGT TTGGGCACAC
AATAAACTAA AGGTTGCACC AACTATTCAG CCCCGAGCTC TATCTATTAC TCAGGGACGT
GCAGTAGGTG TTACACACTT CCTAGTAGGT GGTATTGCTA CCACATGGGC CTTCTTCCAT
GCTCGCCTTT TCGGGCTGGG CTAA
 
Protein sequence
MTISPPESGE KNKKVLEDPV KADPRPIDFA KLDKPGFWSS KLSKGPKTTT WIWNLHADAH 
DFDVHTGDAE EATRKIFSAH FGHLAVIFIW MSAAFFHGAR FSNYSGWLAD PTHVKPGAQQ
VWAIVGQEML NADLGANYNG IQISSGIFHM WRAWGITNES ELMALAIGAV VMAALMLHAG
IFHYHKAAPK MEWFQDIESM LNHHIAGLVG LGSLAWAGHC IHIGAPTAAL LDAIDAGSPL
VINGKEIATI ADMPMPHQLC DPQIIGQIFP GLASGTGNFF SLNWLAFSDF LTFKGGLNPV
TGSLWMTDVS HHHLAFGVIA IIGGHMYRTN YGIGHSMKEI LDSQQGDPIL FPAPKGHQGL
FEFMAESRHA QLSVNLAMLG SISILVSHHM YAMPPYPYIA TDYMTVLGLF THHMWIGGLF
IVGAGAHAGI AMVRDYDPAK HIDNVLDRIL KARDALISHL NWVCMWLGFH SFGLYIHNDT
MRALGRPQDM FSDSAIQLQP IFAQWVQSIQ ASAVGTFLLA GTSEALPHKA LSEVFNGSLV
EVGGKVAIAP IPLGTADLMI HHIHAFQIHV TVLILLKGVL YARSSRLIPD KASLGFRFPC
DGPGRGGTCQ VSSWDHVFLA LFWMYNCLSI VIFHFSWKMQ SDVWGLTGGN FAQSSITING
WLRDFLWAQA SQVLTSYGQS ISMYGLMFLG AHFIWAFSLM FLFSGRGYWQ ELFESIVWAH
NKLKVAPTIQ PRALSITQGR AVGVTHFLVG GIATTWAFFH ARLFGLG