Gene PCC8801_4104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4104 
SymbolpsaB 
ID7101896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4300257 
End bp4302485 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content50% 
IMG OID643477093 
Productphotosystem I P700 chlorophyll a apoprotein A2 
Protein accessionYP_002374192 
Protein GI218248821 
COG category 
COG ID 
TIGRFAM ID[TIGR01336] photosystem I core protein PsaB 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACTA AATTTCCAAA ATTTAGCCAG GATCTCGCCC AAGACCCGAC AACTCGTCGG 
ATTTGGTATG GTATTGCCAC CGCCCACGAC TTTGAAACCC ATGATGGCAT GACCGAGGAA
AATCTTTACC AAAAGATTTT TGCCTCCCAT TTCGGACACA TCGCCATCAT CTTTCTGTGG
ACTTCCGGCA CCCTCTTCCA TGTAGCCTGG CAAGGTAACT TCGAGCAGTG GATTACCGAT
CCCCTAAACA TCCGCCCCAT CGCCCATGCG ATTTGGGACC CCCACTTTGG GCAAGGAGCG
ATTGATGCCT TTACCCAAGC GGGTGCTTCT TCCCCAGTCA ATATTTCCTA CTCTGGGGTT
TATCACTGGT TCTACACCAT TGGTATGAGA ACCAACGGCG ATCTCTATCA AGGGTCAATT
TTCCTCTTGA TTCTGTCCTC GTTGTTCTTG TTTGCAGGTT GGTTACACCT ACAACCCAAG
TTCCGTCCTA GCTTAGCTTG GTTCAAAAAT GCTGAATCTC GTCTAAACCA CCACTTGGCA
GGTTTGTTCG GGGTTAGCTC TTTGGCCTGG ACTGGACACT TGGTACACGT CGCTATTCCT
GAAGCTCGCG GACAGCACGT TGGTTGGGAT AACTTCCTGT CTACCCCTCC CCATCCGGCT
GGCTTAGCAC CGTTCTTTAC CGGGAACTGG GGCGTTTACG CTCAAAATCC CGATACGGCA
AGCCATGTCT TCGGAACTTC TCAGGGGGCA GGAACGGCGA TCTTAACCTT CTTAGGTGGC
TTCCATCCTC AGACAGAAGC TCTGTGGTTG ACGGATATCG CCCATCACCA CTTGGCGATC
GCAGTAATCT TCATTGTTGC TGGCCATATG TACCGCACTA ACTGGGGTAT TGGTCACAGC
ATCAAAGAGA TCCTCAACGC CCACAACCCC CCTCAAGGGA CTCCCTTTGG TGGAGCGATC
GGAGCAGGAC ACAAAGGACT CTACGACACC GTTAACAATT CTTTACACTT CCAACTCGGT
TTAGCGTTGG CTTGTTTAGG GGTTGTTACC TCCTTGGTGG CGCAACATAT GTATTCCTTG
CCCTCCTACG CCTTCATCGC TAAGGACTAC ACCACCCAAG CAGCCCTCTA TACCCATCAC
CAGTACATCG CTGGATTCTT GATGGTAGGA GCCTTTGCCC ACGGTGCGAT CTTCTTCGTT
CGTGACTACG ATCCTGAAGC TAACAAAGAT AATGTGTTAG CTCGGATGCT CGAACACAAA
GAAGCCATCA TTTCTCACTT AAGTTGGGTT TCTCTCTTCT TAGGTTTCCA CACCTTAGGC
CTCTACGTTC ACAACGATGT CGTGGTTGCC TTTGGAACCC CTGAAAAGCA AATCCTGATC
GAGCCTGTGT TTGCTCAATG GATTCAAGCG GCTCATGGTA AAGCCCTCTA CGGATTTGAT
GTGTTATTGT CCAATCCTGA CAGTGTGGCT TCTACCGCTT ATCCTAACTA CGCCAACGTT
TGGTTACCTG GCTGGTTAGA TGCGATTAAC AGTGGTGCTA ACTCCTTGTT CTTAACCATT
GGACCTGGAG ACTTTTTAGT TCACCATGCG ATCGCTCTAG GATTACACAC CACCACCTTA
ATCCTCGTTA AAGGTGCTCT CGATGCACGC GGATCTAAGC TAATGCCCGA TAAAAAGGAC
TTTGGCTATT CCTTCCCTTG CGATGGACCT GGACGCGGCG GTACTTGTGA TATCTCTGCT
TGGGATGCTT TCTACCTCGC TATGTTCTGG ATGTTGAACA CCCTAGGGTG GTTAACCTTC
TACTGGCACT GGAAGCATTT AGGCATCTGG AGTGGTAACG TTGCTCAGTT CAACGAAAAC
TCTACCTACC TAATGGGTTG GTTCCGTGAC TATCTCTGGG CGAACTCTGC TCAGTTGATT
AACGGGTATA ACCCCTACGG TGTTAACAAC CTGTCGGTTT GGGCTTGGAT GTTCCTCTTC
GGACACCTGG TTTGGGCGAC TGGTTTTATG TTCCTCATCT CTTGGCGGGG TTACTGGCAA
GAGTTGATCG AAACGATTGT TTGGGCTCAC GAGCGTACTC CTCTGGCGAA CCTAGTTCGT
TGGAAAGATA AGCCCGTTGC TCTTTCTATC GTTCAAGCTC GTTTAGTTGG GTTAGCTCAC
TTTACCGTGG GTTACATTCT GACTTACGCA GCATTCCTCA TTGCTTCAAC GGCTGGTAAG
TTCGGTTAA
 
Protein sequence
MATKFPKFSQ DLAQDPTTRR IWYGIATAHD FETHDGMTEE NLYQKIFASH FGHIAIIFLW 
TSGTLFHVAW QGNFEQWITD PLNIRPIAHA IWDPHFGQGA IDAFTQAGAS SPVNISYSGV
YHWFYTIGMR TNGDLYQGSI FLLILSSLFL FAGWLHLQPK FRPSLAWFKN AESRLNHHLA
GLFGVSSLAW TGHLVHVAIP EARGQHVGWD NFLSTPPHPA GLAPFFTGNW GVYAQNPDTA
SHVFGTSQGA GTAILTFLGG FHPQTEALWL TDIAHHHLAI AVIFIVAGHM YRTNWGIGHS
IKEILNAHNP PQGTPFGGAI GAGHKGLYDT VNNSLHFQLG LALACLGVVT SLVAQHMYSL
PSYAFIAKDY TTQAALYTHH QYIAGFLMVG AFAHGAIFFV RDYDPEANKD NVLARMLEHK
EAIISHLSWV SLFLGFHTLG LYVHNDVVVA FGTPEKQILI EPVFAQWIQA AHGKALYGFD
VLLSNPDSVA STAYPNYANV WLPGWLDAIN SGANSLFLTI GPGDFLVHHA IALGLHTTTL
ILVKGALDAR GSKLMPDKKD FGYSFPCDGP GRGGTCDISA WDAFYLAMFW MLNTLGWLTF
YWHWKHLGIW SGNVAQFNEN STYLMGWFRD YLWANSAQLI NGYNPYGVNN LSVWAWMFLF
GHLVWATGFM FLISWRGYWQ ELIETIVWAH ERTPLANLVR WKDKPVALSI VQARLVGLAH
FTVGYILTYA AFLIASTAGK FG