Gene Cyan8802_4142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4142 
SymbolpsaA 
ID8393493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4268236 
End bp4270500 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content52% 
IMG OID644982057 
Productphotosystem I P700 chlorophyll a apoprotein A1 
Protein accessionYP_003139769 
Protein GI257061881 
COG category 
COG ID 
TIGRFAM ID[TIGR01335] photosystem I core protein PsaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0335436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.190714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATTA GTCCTCCCGA GAGAGAGGCG AAAGTCAAAG TCACAGTAGA TACTGATCCC 
GTTCCCGCTT CCTTTGAGAA GTGGGGTCAA CCAGGTCACT TCAGCCGGAC TTTGGCTAAA
GGTCCCAAAA CCACCACTTG GATTTGGAAT CTTCACGCCG ATGCACATGA TTTTGATAGT
CAAACCAGTG ACTTAGAAGA TGTTTCGCGC AAAATCTTTA GCGCGCACTT TGGTCACTTA
GCCGTTATCT TTGTTTGGCT GAGCGGCATG TATTTCCATG GCGCTCGATT TTCTAATTAC
GAAGCTTGGT TAACAGACCC CACCGCCATT AAGCCCAGCG CTCAAGTGGT TTGGCCGATT
GTTGGTCAAG GCATTTTGAA CGCTGACGTA GGCGGAGGCT TCCATGGGAT TCAGATCACC
TCTGGTCTGT TCTATCTGTG GAGAGCCTCT GGTTTCACCA ATAGCTACCA GTTGTACTGT
ACAGCGATTG GCGGTTTAGT GATGGCTGGC CTGATGCTGT TTGCCGGTTG GTTCCACTAC
CACAAAAAAG CTCCCAAGCT AGAGTGGTTC CAAAACGTCG AATCGATGAT GAATCACCAC
CTGGCAGGAC TACTGGGACT AGGCTCATTA GGTTGGGCTG GTCACCAGAT TCACGTCTCC
TTGCCCATCA ACAAACTTTT GGATGCAGGA GTGGCCGCCA AGGACATTCC CTTACCCCAT
GAGTTCATCC TCGATAGCAG CAAAATGGCT GAATTGTATC CCAGCTTTGC CCAAGGGTTA
ACCCCCTTCT TCACCCTGAA CTGGGGAGTC TATTCGGACT TCTTAACCTT CAAGGGTGGA
TTAAACCCCG TTACCGGAGG GTTATGGCTA TCGGATACCG CTCATCACCA CTTAGCGATC
GCGGTTCTGT TCATCATTGC TGGTCATATG TACCGCACCA ACTGGGGCAT TGGCCATAGC
ATGAAGGAAA TCTTAGACGG TCACAAAGGA GATCCCCTGC TGTTTGGCGG AGAAGGACAC
ACAGGATTGT ATGAAGTCCT GACAACTTCT TGGCACGCCC AACTAGCCAT TAACCTAGCC
CTGCTAGGCT CCTTAAGCAT CATCGTGGCC CACCATATGT ATGCTATGCC GCCCTATCCG
TACCAAGCGA TCGACTACGG AACCCAGTTG TCCCTCTTTA CCCACCATGT GTGGATTGGA
GGCTTCCTGA TCGTTGGTGC TGGAGCCCAC GGTGCCATCT TCATGGTACG CGACTACGAT
CCCGCCAAGA ACGTTAACAA CGCGCTTGAT CGCGTGATTC GCTCACGGGA TGCCATTATT
TCCCACCTCA ATTGGGTGTG TATTTTCCTC GGCTTCCATA GCTTCGGACT CTACATCCAC
AACGACACCA TGCGGGCCCT CGGTCGTCCC CAAGATATGT TCTCGGATAC GGCCATCAAA
CTACAACCCA TCTTTGCCCA GTGGGTTCAA AACCTCCATT TCCTAGCCCC TGGTGGCACT
GCGCCCTACG CTGGAGCCCC TGCTAGTTAC GCCTTTGGAG GAGAAACTGT AGCGATCGCT
GGCAAAGTGG CTATTATGCC CATTGCCCTG GGAACGGCGG ATTTCATGGT GCACCATATC
CATGCTTTCA CCATCCACGT TACCGTCTTA ATCCTTCTCA AAGGGGTACT CTATGCCCGT
AACTCTCGTC TAATTCCTGA CAAGAGTAAC CTCGGCTTCC GCTTCCCCTG TGATGGACCT
GGACGGGGTG GTACCTGTCA AGTCTCTGGT TGGGACCATG TGTTCCTCGG CTTGTTCTGG
ATGTATAACT CCCTATCGAT TGTGATTTTC CACTTCAGTT GGAAGATGCA GTCAGATGTC
TGGGGAACCG TCGCTCCCGA CGGCACCGTT AGTCATGTTA CTGGCGGAAA CTTTGCCCAA
AGTGCCATTA CCATCAATGG TTGGTTACGG GACTTCCTCT GGGCACAGGC AGCGAATGTG
ATCAACTCCT ACGGTTCGGC ACTGTCGGCC TACGGCATCA TGTTCCTAGC CGGTCACTTC
GTGTTTGCCT TTAGCCTGAT GTTCCTGTTC AGTGGTCGCG GCTACTGGCA AGAACTGATT
GAGTCTATTG TTTGGGCTCA CAACAAATTA AAAGTTGCAC CCGCCATTCA ACCTCGCGCT
CTGAGCATCA TTCAGGGACG GGCAGTGGGT GTTGCTCACT ACCTGTTAGG GGGAATCGTC
ACTACTTGGG CGTTCTTCCT GGCAAGAAGC CTATCAATTG GCTAA
 
Protein sequence
MTISPPEREA KVKVTVDTDP VPASFEKWGQ PGHFSRTLAK GPKTTTWIWN LHADAHDFDS 
QTSDLEDVSR KIFSAHFGHL AVIFVWLSGM YFHGARFSNY EAWLTDPTAI KPSAQVVWPI
VGQGILNADV GGGFHGIQIT SGLFYLWRAS GFTNSYQLYC TAIGGLVMAG LMLFAGWFHY
HKKAPKLEWF QNVESMMNHH LAGLLGLGSL GWAGHQIHVS LPINKLLDAG VAAKDIPLPH
EFILDSSKMA ELYPSFAQGL TPFFTLNWGV YSDFLTFKGG LNPVTGGLWL SDTAHHHLAI
AVLFIIAGHM YRTNWGIGHS MKEILDGHKG DPLLFGGEGH TGLYEVLTTS WHAQLAINLA
LLGSLSIIVA HHMYAMPPYP YQAIDYGTQL SLFTHHVWIG GFLIVGAGAH GAIFMVRDYD
PAKNVNNALD RVIRSRDAII SHLNWVCIFL GFHSFGLYIH NDTMRALGRP QDMFSDTAIK
LQPIFAQWVQ NLHFLAPGGT APYAGAPASY AFGGETVAIA GKVAIMPIAL GTADFMVHHI
HAFTIHVTVL ILLKGVLYAR NSRLIPDKSN LGFRFPCDGP GRGGTCQVSG WDHVFLGLFW
MYNSLSIVIF HFSWKMQSDV WGTVAPDGTV SHVTGGNFAQ SAITINGWLR DFLWAQAANV
INSYGSALSA YGIMFLAGHF VFAFSLMFLF SGRGYWQELI ESIVWAHNKL KVAPAIQPRA
LSIIQGRAVG VAHYLLGGIV TTWAFFLARS LSIG