Gene PMN2A_1082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1082 
SymbolpsaA 
ID3606469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1577169 
End bp1579475 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content42% 
IMG OID637687952 
Productphotosystem I P700 chlorophyll a apoprotein A1 
Protein accessionYP_292275 
Protein GI72382920 
COG category 
COG ID 
TIGRFAM ID[TIGR01335] photosystem I core protein PsaA 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATTA GCCCACCAGA AAAAGAACAA AAAAAAGAAC CGGTTCTCGA TAAACCTATC 
GAAACTGATG CAATCCCTGT AGATTTTTCC AAGCTTGATA AGCCTGGTTT TTGGTCAAAA
TCCCTTGCTA AAGGGCCAAA GACTACTACA TGGATATGGA ATCTTCATGC TGATGCGCAT
GATTTTGATA CTCATGTTGG AGATCTCCAA GAAACCAGTA GAAAAGTATT TTCTGCTCAT
TTTGGACATC TAGCAGTCAT CTTTATTTGG ATGAGTGCAG CTTTTTTCCA TGGAGCTCGC
TTTTCTAATT ATTCTGGATG GCTCTCTGAT CCAACTCATG TCAAGCCAGG AGCACAAGTT
GTTTGGCCAA TAGTTGGTCA GGAGATGCTT AATGCGGATT TAGGCGGTAA TTATCACGGT
ATTCAGATCA CTTCTGGAAT TTTTCAGATG TGGAGAGGCT GGGGAATTAC CAATGAAACC
GAGCTCATGG CTTTAGCTAT TGGTGCACTA CTAATGGCAG CCATAATGTT GCACGGTGGC
ATATATCACT ATCACAAAGC TGCTCCCAAG CTTGATTGGT TTAGAAATCT AGAGTCTATG
CTCAATCACC ACATAGCTGG TCTAGTGGGA TTGGGTTCGA TTGCATGGGC TGGACATTGC
ATTCACATTG GTGCACCTAC AGCAGCACTC ATGGATGCAA TTGATGCAGG AAAGCCTCTA
ATTATTGATG GAATTCCAAT TGCTTCGATT GCGGACATGC CTCTGCCCCA CGAGCTTTGC
AATCCTGCTA TTGCTAGTCA AATATTCCCT GGCCTCGCTG GAAGAACAGT TGAAAATTTC
TTTACGACTA ATTGGTGGGC GTTTAGTGAT TTCCTAACTT TCAAAGGTGG TCTAAATCCA
GTTACTGGTA GCTTATGGAT GACAGATATT TCTCATCATC ATTTAGCTTT TGGAGTACTA
GCTGTATTGG GCGGTCATCT ATATAGAACA ATGTTTGGCA TTGGCCATAG CCTGAAAGAA
ATACTAGATA ATCATGCTGG AGATCCAATT CTTTTCCCTG CTCCAAATGG TCATAAAGGG
ATTTATGAGT TTTTAGCTAA TAGTTGGCAT GCTCAGCTTG GTTTAAACCT TGCAATGATT
GGCTCCTTGA GCATCATCAT TTCCCATCAC ATGTATGCGA TGCCCCCATA TCCGTACTTG
TCGATTGATT ACCCAACTGT CCTAGGTCTA TTCACTCACC ACATGTGGAT AGGAGGATTA
TTCATTGTTG GTGCAGCAGC TCATGCTGGT ATTGCAATGA TTAGAGACTA TGACCCAGCT
GTTCATATTG ATAACGTTCT AGACAGAATC TTGAAAGCAA GAGATGCATT AATTAGTCAT
CTGAATTGGG CTTGCATGTT CTTAGGTTTC CATAGTTTTG GTCTTTATAT TCATAACGAT
GTAATGCGTG CATTAGGAAG ACCTGCAGAT ATGTTCAGTG ATACAGGAAT CCAACTTCAA
CCTGTTTTTG CTCAGTGGAT TCAAAATATT CATAATTCAG CAGCTGGTTC TACCACTCTT
GCTGGTGCAA ACGTAAGCCT TCAACCTGGA TTAGTTAGTG AAGTTTTTAA TGGTTCCGTA
AGTCAAGTTG GAGGAAAAAT TGGAATCGCT CCTATACCTT TAGGAACTGC TGATTTCATG
ATTCACCATA TCCATGCTTT TACTATCCAC GTAACCCTTC TGATTCTTCT AAAAGGAGTT
TTATTCGCAA GGAGCTCCAG ACTAATTCCT GACAAAGCGA ATCTTGGATT TAGATTCCCA
TGTGATGGAC CAGGAAGAGG AGGTACATGC CAAGTTTCAT CTTGGGATCA TGTTTTCCTT
GGATTGTTCT GGATGTATAA CGGCTTATCA GTAGTTATCT TCCACTTCTC ATGGAAGATG
CAAAGTGATG TATGGGGTCT AACAGGAGGA AACTTTGCTC AAAGTTCCAT AACTATCAAT
GGATGGCTTA GAGATTTCCT ATGGGCTCAG TCATCTCAGG TCCTAACAAG TTATGGTCAA
CCTATAAGCA TGTACGGTTT GATGTTCTTA GGAGCTCATT TCGTTTGGGC ATTTAGTCTT
ATGTTCCTAT TTAGTGGACG TGGTTACTGG CAAGAGTTAT TTGAGTCAAT CATTTGGGCT
CATAATAAAC TTAACTTGGC TCCAACCATC CAACCAAGGG CTTTATCTAT CACTCAAGGT
CGCGCAGTAG GAGCAGCTCA TTTCCTTCTT GGAGGAATTG CTACAACTTG GGCCTTCTTC
CATGCTCGCT TAATTGGTCT CGGCTGA
 
Protein sequence
MTISPPEKEQ KKEPVLDKPI ETDAIPVDFS KLDKPGFWSK SLAKGPKTTT WIWNLHADAH 
DFDTHVGDLQ ETSRKVFSAH FGHLAVIFIW MSAAFFHGAR FSNYSGWLSD PTHVKPGAQV
VWPIVGQEML NADLGGNYHG IQITSGIFQM WRGWGITNET ELMALAIGAL LMAAIMLHGG
IYHYHKAAPK LDWFRNLESM LNHHIAGLVG LGSIAWAGHC IHIGAPTAAL MDAIDAGKPL
IIDGIPIASI ADMPLPHELC NPAIASQIFP GLAGRTVENF FTTNWWAFSD FLTFKGGLNP
VTGSLWMTDI SHHHLAFGVL AVLGGHLYRT MFGIGHSLKE ILDNHAGDPI LFPAPNGHKG
IYEFLANSWH AQLGLNLAMI GSLSIIISHH MYAMPPYPYL SIDYPTVLGL FTHHMWIGGL
FIVGAAAHAG IAMIRDYDPA VHIDNVLDRI LKARDALISH LNWACMFLGF HSFGLYIHND
VMRALGRPAD MFSDTGIQLQ PVFAQWIQNI HNSAAGSTTL AGANVSLQPG LVSEVFNGSV
SQVGGKIGIA PIPLGTADFM IHHIHAFTIH VTLLILLKGV LFARSSRLIP DKANLGFRFP
CDGPGRGGTC QVSSWDHVFL GLFWMYNGLS VVIFHFSWKM QSDVWGLTGG NFAQSSITIN
GWLRDFLWAQ SSQVLTSYGQ PISMYGLMFL GAHFVWAFSL MFLFSGRGYW QELFESIIWA
HNKLNLAPTI QPRALSITQG RAVGAAHFLL GGIATTWAFF HARLIGLG