Gene PCC8801_4103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4103 
SymbolpsaA 
ID7101895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4297790 
End bp4300054 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content52% 
IMG OID643477092 
Productphotosystem I P700 chlorophyll a apoprotein A1 
Protein accessionYP_002374191 
Protein GI218248820 
COG category 
COG ID 
TIGRFAM ID[TIGR01335] photosystem I core protein PsaA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATTA GTCCTCCCGA GAGAGAGGCG AAAGTCAAAG TCACAGTAGA TACTGATCCC 
GTTCCCGCTT CCTTTGAGAA GTGGGGTCAA CCAGGTCACT TCAGCCGGAC TTTGGCTAAA
GGTCCCAAAA CCACCACTTG GATTTGGAAT CTTCACGCCG ATGCACATGA TTTTGATAGT
CAAACCAGTG ACTTAGAAGA TGTTTCGCGC AAAATCTTTA GCGCGCACTT TGGTCACTTA
GCCGTTATCT TTGTTTGGCT GAGCGGCATG TATTTCCATG GCGCTCGATT TTCTAATTAC
GAAGCTTGGT TAACAGACCC CACCGCCATT AAGCCCAGTG CTCAAGTGGT TTGGCCGATT
GTTGGTCAAG GCATTTTGAA CGCTGACGTA GGCGGAGGCT TCCATGGGAT TCAGATCACC
TCTGGTCTGT TCTATCTGTG GAGAGCCTCT GGTTTCACCA ATAGCTACCA GTTGTACTGT
ACAGCGATTG GCGGTTTAGT GATGGCTGGC CTGATGCTGT TTGCCGGTTG GTTCCACTAC
CACAAAAAAG CTCCCAAGCT AGAGTGGTTC CAAAACGTCG AATCGATGAT GAATCACCAC
CTGGCAGGAC TACTGGGACT AGGCTCATTA GGTTGGGCTG GTCACCAGAT TCACGTCTCC
TTGCCCATCA ACAAACTTTT GGATGCAGGA GTGGCCGCCA AGGACATTCC CTTACCCCAT
GAGTTCATCC TCGATAGCAG CAAAATGGCT GAATTGTATC CCAGCTTTGC CCAAGGGTTA
ACCCCCTTCT TCACCCTGAA CTGGGGAGTC TATTCGGACT TCTTAACCTT CAAGGGTGGA
TTAAACCCCG TTACCGGAGG GTTATGGCTA TCGGATACCG CTCATCACCA CTTAGCGATC
GCGGTTCTGT TCATCATTGC TGGTCATATG TACCGCACCA ACTGGGGCAT TGGCCATAGC
ATGAAGGAAA TCTTAGACGG TCACAAAGGA GATCCCCTGC TGTTTGGCGG AGAAGGACAC
ACAGGATTGT ATGAAGTCCT GACAACTTCT TGGCACGCCC AACTAGCCAT TAACCTAGCC
CTGCTAGGCT CCTTAAGCAT CATCGTGGCC CACCATATGT ATGCTATGCC GCCCTATCCG
TACCAAGCGA TCGACTACGG AACCCAGTTG TCCCTCTTTA CCCACCATGT GTGGATTGGA
GGCTTCCTGA TCGTTGGTGC TGGAGCCCAC GGTGCCATCT TCATGGTACG CGACTACGAT
CCCGCCAAGA ACGTTAACAA CGCGCTTGAT CGCGTGATTC GCTCACGGGA TGCCATTATT
TCCCACCTCA ATTGGGTGTG TATTTTCCTC GGCTTCCATA GCTTCGGACT CTACATCCAC
AACGACACCA TGCGGGCCCT CGGTCGTCCC CAAGATATGT TCTCGGATAC GGCCATCAAA
CTACAACCCA TCTTTGCCCA GTGGGTTCAA AACCTCCATT TCCTAGCCCC TGGTGGCACT
GCGCCCTACG CTGGAGCCCC TGCTAGTTAC GCCTTTGGAG GAGAAACTGT AGCGATCGCT
GGCAAAGTGG CTATTATGCC CATTGCCCTA GGAACGGCGG ATTTCATGGT GCACCATATC
CATGCTTTCA CCATCCACGT CACCGTCTTA ATCCTTCTCA AAGGGGTACT CTACGCCCGT
AACTCTCGTC TGATTCCTGA CAAGAGCAAC CTCGGCTTCC GCTTCCCCTG TGATGGACCT
GGACGGGGTG GTACCTGTCA AGTCTCTGGT TGGGACCATG TGTTCCTCGG CTTGTTCTGG
ATGTACAACT CCCTCTCCAT CGTCATTTTC CACTTCAGTT GGAAGATGCA GTCGGATGTG
TGGGGAACCG TCGCACCCGA CGGCACCGTT AGTCATGTTA CTGGCGGAAA CTTTGCCCAA
AGTGCCATTA CCATCAATGG TTGGTTACGG GACTTCCTCT GGGCACAGGC AGCGAATGTG
ATCAACTCCT ACGGTTCGGC ACTGTCGGCC TACGGCATCA TGTTCCTAGC CGGTCACTTC
GTGTTTGCCT TTAGCCTGAT GTTCCTGTTC AGTGGTCGCG GCTACTGGCA AGAACTGATT
GAGTCTATTG TTTGGGCTCA CAACAAATTA AAAGTTGCAC CCGCCATTCA ACCTCGCGCT
CTGAGCATCA TTCAGGGACG GGCAGTGGGT GTTGCTCACT ACCTGTTAGG GGGAATCGTC
ACTACTTGGG CGTTCTTCCT GGCAAGAAGC CTATCAATTG GCTAA
 
Protein sequence
MTISPPEREA KVKVTVDTDP VPASFEKWGQ PGHFSRTLAK GPKTTTWIWN LHADAHDFDS 
QTSDLEDVSR KIFSAHFGHL AVIFVWLSGM YFHGARFSNY EAWLTDPTAI KPSAQVVWPI
VGQGILNADV GGGFHGIQIT SGLFYLWRAS GFTNSYQLYC TAIGGLVMAG LMLFAGWFHY
HKKAPKLEWF QNVESMMNHH LAGLLGLGSL GWAGHQIHVS LPINKLLDAG VAAKDIPLPH
EFILDSSKMA ELYPSFAQGL TPFFTLNWGV YSDFLTFKGG LNPVTGGLWL SDTAHHHLAI
AVLFIIAGHM YRTNWGIGHS MKEILDGHKG DPLLFGGEGH TGLYEVLTTS WHAQLAINLA
LLGSLSIIVA HHMYAMPPYP YQAIDYGTQL SLFTHHVWIG GFLIVGAGAH GAIFMVRDYD
PAKNVNNALD RVIRSRDAII SHLNWVCIFL GFHSFGLYIH NDTMRALGRP QDMFSDTAIK
LQPIFAQWVQ NLHFLAPGGT APYAGAPASY AFGGETVAIA GKVAIMPIAL GTADFMVHHI
HAFTIHVTVL ILLKGVLYAR NSRLIPDKSN LGFRFPCDGP GRGGTCQVSG WDHVFLGLFW
MYNSLSIVIF HFSWKMQSDV WGTVAPDGTV SHVTGGNFAQ SAITINGWLR DFLWAQAANV
INSYGSALSA YGIMFLAGHF VFAFSLMFLF SGRGYWQELI ESIVWAHNKL KVAPAIQPRA
LSIIQGRAVG VAHYLLGGIV TTWAFFLARS LSIG