Gene NATL1_19571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_19571 
SymbolpsaA 
ID4779260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1613778 
End bp1616084 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content42% 
IMG OID640085247 
Productphotosystem I P700 chlorophyll a apoprotein A1 
Protein accessionYP_001015777 
Protein GI124026662 
COG category 
COG ID 
TIGRFAM ID[TIGR01335] photosystem I core protein PsaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.300705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTA GCCCACCAGA AAAAGAACAA AAAAAAGAAC CGGTTCTCGA TAAACCTATC 
GAAACTGATG CAATCCCTGT AGATTTTTCC AAGCTTGATA AGCCTGGTTT TTGGTCAAAA
TCCCTTGCTA AAGGGCCAAA GACTACTACA TGGATTTGGA ATCTTCATGC TGATGCGCAT
GATTTTGATA CTCATGTTGG AGATCTCCAA GAAACCAGTA GAAAAGTATT TTCTGCTCAT
TTTGGACATC TAGCAGTCAT CTTTATTTGG ATGAGTGCAG CTTTTTTCCA TGGAGCTCGC
TTTTCTAATT ATTCTGGATG GCTCTCTGAT CCAACTCATG TCAAGCCAGG AGCACAAGTT
GTTTGGCCAA TAGTTGGTCA GGAGATGCTT AATGCGGATT TAGGCGGTAA TTATCACGGT
ATTCAGATCA CTTCTGGAAT TTTTCAGATG TGGAGAGGCT GGGGAATTAC CAATGAAACC
GAGCTCATGG CTTTAGCTAT TGGTGCACTA CTAATGGCAG CAATAATGTT GCACGGTGGC
ATATATCACT ATCACAAAGC TGCTCCCAAG CTTGATTGGT TTAGAAATCT CGAGTCTATG
CTCAATCACC ACATAGCTGG TCTAGTGGGA TTGGGTTCGA TTGCTTGGGC TGGACATTGC
ATTCACATTG GTGCACCTAC AGCAGCACTC ATGGATGCAA TTGATGCAGG AAAGCCTCTG
ATTATTGATG GAATTCCAAT TGCTTCGATT GCGGACATGC CTCTGCCCCA CGAGCTTTGC
AATCCTGCTA TTGCTAGTCA AATATTCCCT GGCCTCGCTG GAAGAACAGT TGAAAATTTC
TTTACGACTA ATTGGTGGGC GTTTAGTGAT TTCCTAACTT TCAAAGGTGG TCTAAATCCA
GTTACTGGTA GCTTATGGAT GACAGATATT TCTCATCATC ATTTAGCTTT TGGAGTGCTA
GCTGTATTGG GCGGTCATCT ATATAGAACA ATGTTTGGCA TTGGCCATAG CCTGAAAGAA
ATACTAGATA ATCATGCTGG AGATCCAATT CTTTTCCCTG CTCCAAATGG TCACAAAGGG
ATTTATGAGT TTTTAGCTAA TAGTTGGCAT GCTCAGCTTG GTTTAAACCT TGCAATGATT
GGCTCCTTGA GCATCATCAT TTCCCATCAC ATGTATGCGA TGCCCCCATA TCCTTACTTG
TCGATTGATT ACCCAACTGT CCTAGGTCTA TTCACTCACC ACATGTGGAT AGGAGGATTA
TTCATTGTTG GTGCAGCAGC TCATGCTGGT ATTGCAATGA TTAGAGACTA TGACCCAGCT
GTTCATATTG ATAACGTTCT AGACAGAATC TTGAAAGCAA GAGATGCATT AATTAGTCAT
CTTAATTGGG CTTGTATGTT CTTAGGTTTC CATAGTTTTG GTCTTTATAT TCATAACGAT
GTAATGCGTG CATTAGGAAG GCCTGCAGAT ATGTTCAGTG ATACAGGGAT CCAACTTCAA
CCTGTTTTTG CTCAGTGGAT TCAAAATATT CATAATTCAG CAGCTGGTTC TACCACTCTT
GCTGGTGCAA ACGTAAACCT TCAACCTGGA TTAGTTAGTG AAGTTTTTAA TGGTTCCGTA
AGTCAAGTTG GAGGAAAAAT TGGAATCGCT CCTATACCTT TAGGAACTGC TGATTTCATG
ATTCACCATA TCCATGCTTT TACTATCCAC GTAACCCTTC TGATTCTTCT AAAAGGAGTT
TTATTCGCAA GGAGCTCCAG ATTAATTCCT GACAAAGCGA ATCTTGGATT TAGATTCCCA
TGTGATGGAC CAGGAAGAGG AGGTACATGC CAAGTTTCAT CTTGGGATCA TGTTTTCCTT
GGATTGTTCT GGATGTATAA CGGCTTATCA GTAGTTATCT TCCACTTCTC ATGGAAGATG
CAAAGTGATG TATGGGGTCT AACAGGAGGA AACTTTGCTC AAAGTTCCAT AACTATCAAT
GGATGGCTTA GAGATTTCCT ATGGGCTCAG TCATCTCAGG TCCTAACAAG TTATGGTCAA
CCTATAAGCA TGTACGGTTT GATGTTCTTA GGAGCTCATT TCGTTTGGGC ATTTAGTCTT
ATGTTCCTAT TTAGTGGACG TGGTTACTGG CAAGAGTTAT TTGAGTCAAT CATTTGGGCT
CATAATAAAC TTAACTTGGC TCCAACCATC CAACCAAGGG CTTTATCTAT TACTCAAGGT
CGCGCAGTAG GAGCAGCTCA TTTCCTTCTT GGAGGAATTG CTACAACTTG GGCCTTCTTC
CATGCTCGCT TAATTGGTCT CGGCTGA
 
Protein sequence
MTISPPEKEQ KKEPVLDKPI ETDAIPVDFS KLDKPGFWSK SLAKGPKTTT WIWNLHADAH 
DFDTHVGDLQ ETSRKVFSAH FGHLAVIFIW MSAAFFHGAR FSNYSGWLSD PTHVKPGAQV
VWPIVGQEML NADLGGNYHG IQITSGIFQM WRGWGITNET ELMALAIGAL LMAAIMLHGG
IYHYHKAAPK LDWFRNLESM LNHHIAGLVG LGSIAWAGHC IHIGAPTAAL MDAIDAGKPL
IIDGIPIASI ADMPLPHELC NPAIASQIFP GLAGRTVENF FTTNWWAFSD FLTFKGGLNP
VTGSLWMTDI SHHHLAFGVL AVLGGHLYRT MFGIGHSLKE ILDNHAGDPI LFPAPNGHKG
IYEFLANSWH AQLGLNLAMI GSLSIIISHH MYAMPPYPYL SIDYPTVLGL FTHHMWIGGL
FIVGAAAHAG IAMIRDYDPA VHIDNVLDRI LKARDALISH LNWACMFLGF HSFGLYIHND
VMRALGRPAD MFSDTGIQLQ PVFAQWIQNI HNSAAGSTTL AGANVNLQPG LVSEVFNGSV
SQVGGKIGIA PIPLGTADFM IHHIHAFTIH VTLLILLKGV LFARSSRLIP DKANLGFRFP
CDGPGRGGTC QVSSWDHVFL GLFWMYNGLS VVIFHFSWKM QSDVWGLTGG NFAQSSITIN
GWLRDFLWAQ SSQVLTSYGQ PISMYGLMFL GAHFVWAFSL MFLFSGRGYW QELFESIIWA
HNKLNLAPTI QPRALSITQG RAVGAAHFLL GGIATTWAFF HARLIGLG