Gene P9515_17031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_17031 
SymbolpsaA 
ID4720212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp1501600 
End bp1503882 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content40% 
IMG OID640081396 
Productphotosystem I P700 chlorophyll a apoprotein A1 
Protein accessionYP_001012017 
Protein GI123966936 
COG category 
COG ID 
TIGRFAM ID[TIGR01335] photosystem I core protein PsaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATTA GCCCACCAGA AAGTGGAGAA AAAGACAAAA AGATTTTGGA ATCACCCGTT 
AAGGCTGATC CTAGACCTAT TGATTTCGCC AAATTAGATA AACCAGGTTT CTGGTCAAGT
AAATTATCCA AAGGTCCGAA AACCACAACT TGGATTTGGA ATTTACATGC AGACGCCCAC
GATTTTGATG TACATACAGG AGATGCTGAA GAAGCCACCA GAAAAATATT CTCAGCTCAC
TTTGGACATC TTGCAGTAAT TTTCATATGG ATGAGTGCTG CATTTTTCCA TGGAGCAAGA
TTCTCAAACT ATTCAGGGTG GTTGGCTGAT CCTACCCATG TAAAGCCTGG AGCTCAACAA
GTGTGGGCAA TCGTAGGTCA AGAGATGCTA AATGGTGATC TTGGTGCTAA TTACAATGGA
ATACAAATAA GCTCTGGAGT ATTCCACATG TGGCGAGCCT GGGGAATTAC AAATGAGAGT
GAATTGATGG CATTAGCTAT TGGAGCGGTT GTAATGGCTG CACTAATGCT TCATGCAGGG
ATATTCCATT ACCACAAAGC TGCTCCAAAA ATGGAATGGT TCCAAGATGT TGAGTCCATG
ATGAATCATC ATTTGGCTGG CCTATTGGGA CTAGGTTCTT TAGCTTGGGC AGGTCATACT
ATTCATATTG GTGCCCCTAC AGCAGCTCTT TTAGATGCGA TTGATGCTGG CAGCCCACTG
ATTATTAACG GAAAAGAGAT AGCAACAATT GCTGATATTC CAATGCCTCA TCAATTGTGT
GATCCACAAA TTGTTGGTCA AATCTTCCCG GGTCTTGCAA GTGGTACAGG TAACTTCTTT
AGTTTGAATT GGTTCGCTTT TTCAGATTTC TTAACTTTCA AGGGAGGACT TAATCCGGTT
ACTGGAAGCT TATGGATGAC TGATATTGCA CATCATCATC TTGCAATTGC TGTACTGTTC
ATAATTGCTG GTCATATGTA CAGGACTAAT TATGGAATTG GTCATAGTAT GAAAGAAATA
TTAGATGCAC ATCAGGGAGA TCCCATTCTT TTTCCTGCGC CTAGAGGTCA TCAAGGTCTT
TTTGATTTTA TGGCGGAAAG TAGACATGCT CAGTTATCGG TTAACTTAGC CTTGCTTGGA
TCTTTATCTA TTATCATCTC CCATCATATG TATGCAATGC CTCCATATCC TTACATTGCT
ACCGATTATA TGACTGTATT GGGATTGTTT ACCCACCATA TGTGGATAGG AGGTCTGTTT
ATTGTTGGAG CTGGAGCACA CGCAGGAATA GCTATGGTAA GAGACTATGA TCCAGCAAAA
CACATAGATA ATGTCTTAGA TAGAGTATTG AAAGCTAGGG ACGCGTTAAT AAGTCACCTT
AATTGGGTTT GTATGTGGTT AGGTTTTCAT AGTTTTGGAC TTTATATTCA CAACGACACT
ATGAGAGCTT TGGGTAGACC TCAAGATATG TTTAGTGATA AAGCAATTCA ATTACAACCA
ATTTTTGCTC AATGGATACA AAATATTCAA TCATCAGGAG TTGGAACAAC ACTCTTAGAA
GGTAATGGGG TAAGCCAAGT ATTTAATGGT GAGACAATAT CTGTTGGAGG AAAAGTTGCG
ATGACTGGTA TCCCTCTAGG AACTGCTGAT TTAATGATTC ACCACATTCA TGCATTCCAG
ATACATGTAA CAGTACTAAT TCTCCTTAAA GGTGTCCTTT ATGCAAGAAG TTCAAGGTTA
ATTCCTGATA AAGCATCACT TGGATTTAGA TTCCCATGTG ATGGTCCTGG AAGAGGAGGT
ACATGTCAGG TTTCTTCCTG GGATCATGTC TTCTTGGCAC TTTTCTGGAT GTATAATTGT
CTATCAATTG TTATTTTCCA TTTTTCTTGG AAAATGCAAA GTGATGTTTG GGGTCTTACA
GGAGGTAATT TTGCTCAAAG TGCAATAACT ATTAATGGAT GGTTAAGAGA CTTTTTATGG
GCACAAGCAG CACAGGTATT GACAAGTTAC GGTCAATCAA TAAGTATGTA TGGCTTAATG
TTCTTAGGTG CTCACTTTAT TTGGGCATTT AGCTTAATGT TCCTATTTAG TGGAAGAGGA
TACTGGCAAG AACTATTTGA ATCAATTGTT TGGGCACACA ATAAATTGAA AGTCGCTCCA
ACAATTCAAC CAAGAGCACT ATCAATTACT CAAGGTCGTG CAGTTGGTGT AACTCACTTC
TTAGTGGGTG GTATAGCTAC TACTTGGGCA TTTTTCCATG CTCGCCTTTT CGGGATTGGC
TAA
 
Protein sequence
MTISPPESGE KDKKILESPV KADPRPIDFA KLDKPGFWSS KLSKGPKTTT WIWNLHADAH 
DFDVHTGDAE EATRKIFSAH FGHLAVIFIW MSAAFFHGAR FSNYSGWLAD PTHVKPGAQQ
VWAIVGQEML NGDLGANYNG IQISSGVFHM WRAWGITNES ELMALAIGAV VMAALMLHAG
IFHYHKAAPK MEWFQDVESM MNHHLAGLLG LGSLAWAGHT IHIGAPTAAL LDAIDAGSPL
IINGKEIATI ADIPMPHQLC DPQIVGQIFP GLASGTGNFF SLNWFAFSDF LTFKGGLNPV
TGSLWMTDIA HHHLAIAVLF IIAGHMYRTN YGIGHSMKEI LDAHQGDPIL FPAPRGHQGL
FDFMAESRHA QLSVNLALLG SLSIIISHHM YAMPPYPYIA TDYMTVLGLF THHMWIGGLF
IVGAGAHAGI AMVRDYDPAK HIDNVLDRVL KARDALISHL NWVCMWLGFH SFGLYIHNDT
MRALGRPQDM FSDKAIQLQP IFAQWIQNIQ SSGVGTTLLE GNGVSQVFNG ETISVGGKVA
MTGIPLGTAD LMIHHIHAFQ IHVTVLILLK GVLYARSSRL IPDKASLGFR FPCDGPGRGG
TCQVSSWDHV FLALFWMYNC LSIVIFHFSW KMQSDVWGLT GGNFAQSAIT INGWLRDFLW
AQAAQVLTSY GQSISMYGLM FLGAHFIWAF SLMFLFSGRG YWQELFESIV WAHNKLKVAP
TIQPRALSIT QGRAVGVTHF LVGGIATTWA FFHARLFGIG