Gene P9211_16391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_16391 
SymbolpsaA 
ID5730140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1477076 
End bp1479397 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content44% 
IMG OID641286018 
Productphotosystem I P700 chlorophyll a apoprotein A1 
Protein accessionYP_001551524 
Protein GI159904180 
COG category 
COG ID 
TIGRFAM ID[TIGR01335] photosystem I core protein PsaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.647233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTA GCCCACCAGA ACGTGGGGAG AAAGCAAAAG GAGGAGTGCC AACTCCTTAT 
GACCAGCCAG TTGATAGAGG CAATGCTCCT GTCGACTTTG AAAAACTCAA CAAACCTGGG
TTCTGGTCTT CAAAGCTTTC CAAAGGTCCA AAAACCACTA CATGGATTTG GAACCTCCAC
GCTGACGCAC ATGATTTCGA CACTCATCTA GGAGACCTAG AAGAGACAAG TAGGAAGATC
TTCTCAGCCC ATTTTGGCCA CTTGGCTGTG GTTTTTATAT GGATGAGTGC TGCATTCTTC
CATGGAGCTC GCTTCTCTAA TTACACTGGT TGGTTAGCTG ACCCAACAAA CGTAAAGCCT
GGTGCCCAGG TTGTTTGGCC AGTAGTTGGA CAAGAAATTC TTAATGCTGA TTTAGGAGGC
AATTATCACG GTCTCCAAAT CACATCAGGT ATTTTCCAAA TGTGGCGTGC TTGGGGTATT
ACCAGCGAAG TACAACTAAT GGCATTAGCC ATTGGTGCAG TTGTAATGGC TGCCTTAATG
CTTCACGGTG GAATTTATCA TTATCACAAA GCAGCTCCAA AACTTGAGTG GTTCCAAAAA
ATAGAACCAA TGATCCAGCA CCATCAGATA GCTTTGATAG GTCTGGGATC AATTGCATGG
GCAGGTCACC TAATTCATAT TGGTGCCCCG GTATCAGCTC TATTAGATGC AATTGATGCA
GGGAATCCAT TAGTAGTTGA TGGTGTATCA ATTGCAAGTG CTTCTGATGT CACTAACCTT
GCTCCAAAGC TTTGTGATCC AGCTATAGCA AGTCAGATTT TCCCTAGCCT AGCTGGCCGA
ACTGTTGAGA ATTTCTTTAC TCTCAACTGG TGGGCGTTCA CCGATATCCT TACAAACAAA
GGTGGCTTGA ATCCTGTAAC AGGAAGCCTT TGGATGACTG ATATCTCTCA TCACCACCTA
GCGTTTGGTG TCTTTGCCAT CTTTGGTGGT CATATGTGGC GTAACGCGGT TCATGGCGTA
GGTCACAGCA TGAAAGAGAT AATGGATGTT CATAAGGGTG ATCCAATCCT TTTCCCTGCT
CCAAAAGGGC ATGAAGGGAT TTTTGATTTC TTAAGTAATA GTTGGCATGG CCAATTAAGT
ATCAATCTCG CAATGGTTGG GTCAGCAAGT ATTGTTGTTG CCCATCACAT GTATGCGCTT
CCTCCATATC CATACATTGC TATTGATTAT CCAACTGTAC TTGGATTATT TACTCACCAC
ATGTGGATAG GTGGTTTATT TATCTGTGGA GCAGCAGCTC ATGCTGGCAT AGCGATGATC
AGGGACTACG ACCCTGCGAT TCATGTAGAT AACGTTCTTG ATAGAATGCT CAAGGCTCGT
GATGCAATTA TCAGCCACTT AAACTGGGTT TGCATGTGGC TTGGATTCCA TAGTTTTGGT
CTTTATATTC ATAACGATGT CATGAGAGCT CTTGGACGCC CTCAAGACAT GTTTAGTGAT
ACAGGAATTC AATTACAGCC TTTCTTGGCA CAATGGGTTC AAAACTTACA GCAAAGTGCT
GTTGGTACAG GTCAGCTTGT AGGTGCAGGA AATTTACCTG GCAACGTTCT TAGTGAAGTA
TTCAATGGAA ATGTCATTGA AGTTGGAGGA AAAGTTGCTA TTGGGCCGAT CCCATTAGGA
ACTGCAGACT TGATGATTCA TCATGTTCAT GCATTCACAA TTCATGTAAC TCTCCTAATT
CTTTTAAAAG GAGTTCTTTA CTCAAGAAGT TCTCGTCTCA TTCCTGACAA AGCTCAACTT
GGCTTCCGTT TCCCATGTGA TGGGCCAGGA AGAGGAGGCA CATGTCAGGT TTCCTCATGG
GATCATGTTT TCCTTGGCCT ATTTTGGATG TATAACAGTC TCTCTGTTGT TATCTTCCAT
TTCTCATGGA AAATGCAAAG TGATGTATGG GGCTTAACTG GTGGAAATTT TGCCCAAAGC
TCAATCACAA TTAATGGTTG GCTTAGGGAT TTCCTATGGG CACAATCTTC TCAGGTTCTA
ACAAGCTATG GACAACCCAT TAGTATGTAT GGACTAATGT TCCTTGGTGC TCACTTTGTT
TGGGCATTCA GCCTTATGTT CCTATTCAGT GGCCGTGGTT ACTGGCAGGA ACTTTTTGAG
TCAATCGTTT GGGCCCATAA CAAATTGAAA GTGGCTCCAA CAATTCAACC TCGTGCATTG
TCTATTACTC AAGGCCGTGC AGTTGGTGTT GCTCACTTCC TTCTAGGTGG AATAGCTACC
ACCTGGGCCT TCTTCCACGC CCGCCTTATT GGGCTCGGCT AA
 
Protein sequence
MTISPPERGE KAKGGVPTPY DQPVDRGNAP VDFEKLNKPG FWSSKLSKGP KTTTWIWNLH 
ADAHDFDTHL GDLEETSRKI FSAHFGHLAV VFIWMSAAFF HGARFSNYTG WLADPTNVKP
GAQVVWPVVG QEILNADLGG NYHGLQITSG IFQMWRAWGI TSEVQLMALA IGAVVMAALM
LHGGIYHYHK AAPKLEWFQK IEPMIQHHQI ALIGLGSIAW AGHLIHIGAP VSALLDAIDA
GNPLVVDGVS IASASDVTNL APKLCDPAIA SQIFPSLAGR TVENFFTLNW WAFTDILTNK
GGLNPVTGSL WMTDISHHHL AFGVFAIFGG HMWRNAVHGV GHSMKEIMDV HKGDPILFPA
PKGHEGIFDF LSNSWHGQLS INLAMVGSAS IVVAHHMYAL PPYPYIAIDY PTVLGLFTHH
MWIGGLFICG AAAHAGIAMI RDYDPAIHVD NVLDRMLKAR DAIISHLNWV CMWLGFHSFG
LYIHNDVMRA LGRPQDMFSD TGIQLQPFLA QWVQNLQQSA VGTGQLVGAG NLPGNVLSEV
FNGNVIEVGG KVAIGPIPLG TADLMIHHVH AFTIHVTLLI LLKGVLYSRS SRLIPDKAQL
GFRFPCDGPG RGGTCQVSSW DHVFLGLFWM YNSLSVVIFH FSWKMQSDVW GLTGGNFAQS
SITINGWLRD FLWAQSSQVL TSYGQPISMY GLMFLGAHFV WAFSLMFLFS GRGYWQELFE
SIVWAHNKLK VAPTIQPRAL SITQGRAVGV AHFLLGGIAT TWAFFHARLI GLG