Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_16391 |
Symbol | psaA |
ID | 5730140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1477076 |
End bp | 1479397 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641286018 |
Product | photosystem I P700 chlorophyll a apoprotein A1 |
Protein accession | YP_001551524 |
Protein GI | 159904180 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01335] photosystem I core protein PsaA |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.647233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATTA GCCCACCAGA ACGTGGGGAG AAAGCAAAAG GAGGAGTGCC AACTCCTTAT GACCAGCCAG TTGATAGAGG CAATGCTCCT GTCGACTTTG AAAAACTCAA CAAACCTGGG TTCTGGTCTT CAAAGCTTTC CAAAGGTCCA AAAACCACTA CATGGATTTG GAACCTCCAC GCTGACGCAC ATGATTTCGA CACTCATCTA GGAGACCTAG AAGAGACAAG TAGGAAGATC TTCTCAGCCC ATTTTGGCCA CTTGGCTGTG GTTTTTATAT GGATGAGTGC TGCATTCTTC CATGGAGCTC GCTTCTCTAA TTACACTGGT TGGTTAGCTG ACCCAACAAA CGTAAAGCCT GGTGCCCAGG TTGTTTGGCC AGTAGTTGGA CAAGAAATTC TTAATGCTGA TTTAGGAGGC AATTATCACG GTCTCCAAAT CACATCAGGT ATTTTCCAAA TGTGGCGTGC TTGGGGTATT ACCAGCGAAG TACAACTAAT GGCATTAGCC ATTGGTGCAG TTGTAATGGC TGCCTTAATG CTTCACGGTG GAATTTATCA TTATCACAAA GCAGCTCCAA AACTTGAGTG GTTCCAAAAA ATAGAACCAA TGATCCAGCA CCATCAGATA GCTTTGATAG GTCTGGGATC AATTGCATGG GCAGGTCACC TAATTCATAT TGGTGCCCCG GTATCAGCTC TATTAGATGC AATTGATGCA GGGAATCCAT TAGTAGTTGA TGGTGTATCA ATTGCAAGTG CTTCTGATGT CACTAACCTT GCTCCAAAGC TTTGTGATCC AGCTATAGCA AGTCAGATTT TCCCTAGCCT AGCTGGCCGA ACTGTTGAGA ATTTCTTTAC TCTCAACTGG TGGGCGTTCA CCGATATCCT TACAAACAAA GGTGGCTTGA ATCCTGTAAC AGGAAGCCTT TGGATGACTG ATATCTCTCA TCACCACCTA GCGTTTGGTG TCTTTGCCAT CTTTGGTGGT CATATGTGGC GTAACGCGGT TCATGGCGTA GGTCACAGCA TGAAAGAGAT AATGGATGTT CATAAGGGTG ATCCAATCCT TTTCCCTGCT CCAAAAGGGC ATGAAGGGAT TTTTGATTTC TTAAGTAATA GTTGGCATGG CCAATTAAGT ATCAATCTCG CAATGGTTGG GTCAGCAAGT ATTGTTGTTG CCCATCACAT GTATGCGCTT CCTCCATATC CATACATTGC TATTGATTAT CCAACTGTAC TTGGATTATT TACTCACCAC ATGTGGATAG GTGGTTTATT TATCTGTGGA GCAGCAGCTC ATGCTGGCAT AGCGATGATC AGGGACTACG ACCCTGCGAT TCATGTAGAT AACGTTCTTG ATAGAATGCT CAAGGCTCGT GATGCAATTA TCAGCCACTT AAACTGGGTT TGCATGTGGC TTGGATTCCA TAGTTTTGGT CTTTATATTC ATAACGATGT CATGAGAGCT CTTGGACGCC CTCAAGACAT GTTTAGTGAT ACAGGAATTC AATTACAGCC TTTCTTGGCA CAATGGGTTC AAAACTTACA GCAAAGTGCT GTTGGTACAG GTCAGCTTGT AGGTGCAGGA AATTTACCTG GCAACGTTCT TAGTGAAGTA TTCAATGGAA ATGTCATTGA AGTTGGAGGA AAAGTTGCTA TTGGGCCGAT CCCATTAGGA ACTGCAGACT TGATGATTCA TCATGTTCAT GCATTCACAA TTCATGTAAC TCTCCTAATT CTTTTAAAAG GAGTTCTTTA CTCAAGAAGT TCTCGTCTCA TTCCTGACAA AGCTCAACTT GGCTTCCGTT TCCCATGTGA TGGGCCAGGA AGAGGAGGCA CATGTCAGGT TTCCTCATGG GATCATGTTT TCCTTGGCCT ATTTTGGATG TATAACAGTC TCTCTGTTGT TATCTTCCAT TTCTCATGGA AAATGCAAAG TGATGTATGG GGCTTAACTG GTGGAAATTT TGCCCAAAGC TCAATCACAA TTAATGGTTG GCTTAGGGAT TTCCTATGGG CACAATCTTC TCAGGTTCTA ACAAGCTATG GACAACCCAT TAGTATGTAT GGACTAATGT TCCTTGGTGC TCACTTTGTT TGGGCATTCA GCCTTATGTT CCTATTCAGT GGCCGTGGTT ACTGGCAGGA ACTTTTTGAG TCAATCGTTT GGGCCCATAA CAAATTGAAA GTGGCTCCAA CAATTCAACC TCGTGCATTG TCTATTACTC AAGGCCGTGC AGTTGGTGTT GCTCACTTCC TTCTAGGTGG AATAGCTACC ACCTGGGCCT TCTTCCACGC CCGCCTTATT GGGCTCGGCT AA
|
Protein sequence | MTISPPERGE KAKGGVPTPY DQPVDRGNAP VDFEKLNKPG FWSSKLSKGP KTTTWIWNLH ADAHDFDTHL GDLEETSRKI FSAHFGHLAV VFIWMSAAFF HGARFSNYTG WLADPTNVKP GAQVVWPVVG QEILNADLGG NYHGLQITSG IFQMWRAWGI TSEVQLMALA IGAVVMAALM LHGGIYHYHK AAPKLEWFQK IEPMIQHHQI ALIGLGSIAW AGHLIHIGAP VSALLDAIDA GNPLVVDGVS IASASDVTNL APKLCDPAIA SQIFPSLAGR TVENFFTLNW WAFTDILTNK GGLNPVTGSL WMTDISHHHL AFGVFAIFGG HMWRNAVHGV GHSMKEIMDV HKGDPILFPA PKGHEGIFDF LSNSWHGQLS INLAMVGSAS IVVAHHMYAL PPYPYIAIDY PTVLGLFTHH MWIGGLFICG AAAHAGIAMI RDYDPAIHVD NVLDRMLKAR DAIISHLNWV CMWLGFHSFG LYIHNDVMRA LGRPQDMFSD TGIQLQPFLA QWVQNLQQSA VGTGQLVGAG NLPGNVLSEV FNGNVIEVGG KVAIGPIPLG TADLMIHHVH AFTIHVTLLI LLKGVLYSRS SRLIPDKAQL GFRFPCDGPG RGGTCQVSSW DHVFLGLFWM YNSLSVVIFH FSWKMQSDVW GLTGGNFAQS SITINGWLRD FLWAQSSQVL TSYGQPISMY GLMFLGAHFV WAFSLMFLFS GRGYWQELFE SIVWAHNKLK VAPTIQPRAL SITQGRAVGV AHFLLGGIAT TWAFFHARLI GLG
|
| |