Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_19571 |
Symbol | psaA |
ID | 4779260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1613778 |
End bp | 1616084 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640085247 |
Product | photosystem I P700 chlorophyll a apoprotein A1 |
Protein accession | YP_001015777 |
Protein GI | 124026662 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01335] photosystem I core protein PsaA |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.300705 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATTA GCCCACCAGA AAAAGAACAA AAAAAAGAAC CGGTTCTCGA TAAACCTATC GAAACTGATG CAATCCCTGT AGATTTTTCC AAGCTTGATA AGCCTGGTTT TTGGTCAAAA TCCCTTGCTA AAGGGCCAAA GACTACTACA TGGATTTGGA ATCTTCATGC TGATGCGCAT GATTTTGATA CTCATGTTGG AGATCTCCAA GAAACCAGTA GAAAAGTATT TTCTGCTCAT TTTGGACATC TAGCAGTCAT CTTTATTTGG ATGAGTGCAG CTTTTTTCCA TGGAGCTCGC TTTTCTAATT ATTCTGGATG GCTCTCTGAT CCAACTCATG TCAAGCCAGG AGCACAAGTT GTTTGGCCAA TAGTTGGTCA GGAGATGCTT AATGCGGATT TAGGCGGTAA TTATCACGGT ATTCAGATCA CTTCTGGAAT TTTTCAGATG TGGAGAGGCT GGGGAATTAC CAATGAAACC GAGCTCATGG CTTTAGCTAT TGGTGCACTA CTAATGGCAG CAATAATGTT GCACGGTGGC ATATATCACT ATCACAAAGC TGCTCCCAAG CTTGATTGGT TTAGAAATCT CGAGTCTATG CTCAATCACC ACATAGCTGG TCTAGTGGGA TTGGGTTCGA TTGCTTGGGC TGGACATTGC ATTCACATTG GTGCACCTAC AGCAGCACTC ATGGATGCAA TTGATGCAGG AAAGCCTCTG ATTATTGATG GAATTCCAAT TGCTTCGATT GCGGACATGC CTCTGCCCCA CGAGCTTTGC AATCCTGCTA TTGCTAGTCA AATATTCCCT GGCCTCGCTG GAAGAACAGT TGAAAATTTC TTTACGACTA ATTGGTGGGC GTTTAGTGAT TTCCTAACTT TCAAAGGTGG TCTAAATCCA GTTACTGGTA GCTTATGGAT GACAGATATT TCTCATCATC ATTTAGCTTT TGGAGTGCTA GCTGTATTGG GCGGTCATCT ATATAGAACA ATGTTTGGCA TTGGCCATAG CCTGAAAGAA ATACTAGATA ATCATGCTGG AGATCCAATT CTTTTCCCTG CTCCAAATGG TCACAAAGGG ATTTATGAGT TTTTAGCTAA TAGTTGGCAT GCTCAGCTTG GTTTAAACCT TGCAATGATT GGCTCCTTGA GCATCATCAT TTCCCATCAC ATGTATGCGA TGCCCCCATA TCCTTACTTG TCGATTGATT ACCCAACTGT CCTAGGTCTA TTCACTCACC ACATGTGGAT AGGAGGATTA TTCATTGTTG GTGCAGCAGC TCATGCTGGT ATTGCAATGA TTAGAGACTA TGACCCAGCT GTTCATATTG ATAACGTTCT AGACAGAATC TTGAAAGCAA GAGATGCATT AATTAGTCAT CTTAATTGGG CTTGTATGTT CTTAGGTTTC CATAGTTTTG GTCTTTATAT TCATAACGAT GTAATGCGTG CATTAGGAAG GCCTGCAGAT ATGTTCAGTG ATACAGGGAT CCAACTTCAA CCTGTTTTTG CTCAGTGGAT TCAAAATATT CATAATTCAG CAGCTGGTTC TACCACTCTT GCTGGTGCAA ACGTAAACCT TCAACCTGGA TTAGTTAGTG AAGTTTTTAA TGGTTCCGTA AGTCAAGTTG GAGGAAAAAT TGGAATCGCT CCTATACCTT TAGGAACTGC TGATTTCATG ATTCACCATA TCCATGCTTT TACTATCCAC GTAACCCTTC TGATTCTTCT AAAAGGAGTT TTATTCGCAA GGAGCTCCAG ATTAATTCCT GACAAAGCGA ATCTTGGATT TAGATTCCCA TGTGATGGAC CAGGAAGAGG AGGTACATGC CAAGTTTCAT CTTGGGATCA TGTTTTCCTT GGATTGTTCT GGATGTATAA CGGCTTATCA GTAGTTATCT TCCACTTCTC ATGGAAGATG CAAAGTGATG TATGGGGTCT AACAGGAGGA AACTTTGCTC AAAGTTCCAT AACTATCAAT GGATGGCTTA GAGATTTCCT ATGGGCTCAG TCATCTCAGG TCCTAACAAG TTATGGTCAA CCTATAAGCA TGTACGGTTT GATGTTCTTA GGAGCTCATT TCGTTTGGGC ATTTAGTCTT ATGTTCCTAT TTAGTGGACG TGGTTACTGG CAAGAGTTAT TTGAGTCAAT CATTTGGGCT CATAATAAAC TTAACTTGGC TCCAACCATC CAACCAAGGG CTTTATCTAT TACTCAAGGT CGCGCAGTAG GAGCAGCTCA TTTCCTTCTT GGAGGAATTG CTACAACTTG GGCCTTCTTC CATGCTCGCT TAATTGGTCT CGGCTGA
|
Protein sequence | MTISPPEKEQ KKEPVLDKPI ETDAIPVDFS KLDKPGFWSK SLAKGPKTTT WIWNLHADAH DFDTHVGDLQ ETSRKVFSAH FGHLAVIFIW MSAAFFHGAR FSNYSGWLSD PTHVKPGAQV VWPIVGQEML NADLGGNYHG IQITSGIFQM WRGWGITNET ELMALAIGAL LMAAIMLHGG IYHYHKAAPK LDWFRNLESM LNHHIAGLVG LGSIAWAGHC IHIGAPTAAL MDAIDAGKPL IIDGIPIASI ADMPLPHELC NPAIASQIFP GLAGRTVENF FTTNWWAFSD FLTFKGGLNP VTGSLWMTDI SHHHLAFGVL AVLGGHLYRT MFGIGHSLKE ILDNHAGDPI LFPAPNGHKG IYEFLANSWH AQLGLNLAMI GSLSIIISHH MYAMPPYPYL SIDYPTVLGL FTHHMWIGGL FIVGAAAHAG IAMIRDYDPA VHIDNVLDRI LKARDALISH LNWACMFLGF HSFGLYIHND VMRALGRPAD MFSDTGIQLQ PVFAQWIQNI HNSAAGSTTL AGANVNLQPG LVSEVFNGSV SQVGGKIGIA PIPLGTADFM IHHIHAFTIH VTLLILLKGV LFARSSRLIP DKANLGFRFP CDGPGRGGTC QVSSWDHVFL GLFWMYNGLS VVIFHFSWKM QSDVWGLTGG NFAQSSITIN GWLRDFLWAQ SSQVLTSYGQ PISMYGLMFL GAHFVWAFSL MFLFSGRGYW QELFESIIWA HNKLNLAPTI QPRALSITQG RAVGAAHFLL GGIATTWAFF HARLIGLG
|
| |