Gene PCC8801_2016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2016 
Symbol 
ID7104784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2090110 
End bp2091180 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content48% 
IMG OID643475077 
Productphotosystem q(b) protein 
Protein accessionYP_002372209 
Protein GI218246838 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B)) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACCA CCTTACAACA ACGCGAGAGC GTTTCCGTTT GGGAGCAGTT CTGTCAGTGG 
GTAACAAGCA CCAACAACCG TCTTTATGTC GGCTGGTTCG GTACTTTAAT GATCCCCACC
CTCTTAACTG CAACCACTTG CTTCATCATC GCTTTCATCG CTGCACCTCC CGTGGACATC
GATGGTATCC GTGAACCCGT TGCTGGTTCT TTACTTTATG GAAACAACAT CATCTCTGGT
GCAGTTGTTC CTTCTAGCAA CGCTATCGGA TTACACTTCT ACCCCATCTG GGAAGCTGCT
TCTCTTGATG AGTGGCTCTA CAACGGCGGA CCTTACCAAT TAGTAGTCTT CCACTTCTTA
ATCGGAGTAT TTTGCTACTT AGGCCGTCAG TGGGAACTAT CTTACCGCTT AGGAATGCGT
CCTTGGATTT GCGTAGCCTA CAGCGCACCT GTTTCCGCAG CTACTGCCGT GTTCTTAATC
TACCCCATCG GACAAGGTTC TTTCTCTGAT GGAATGCCTT TAGGAATTAG CGGAACCTTC
AACTTCATGT TCGTGTTCCA AGCTGAGCAC AACATCCTAA TGCACCCCTT CCATATGTTG
GGAGTTGCTG GTGTCTTTGG TGGTTCTTTG TTCTCCGCTA TGCACGGTTC TTTAGTCACC
TCTTCCTTAG TCCGTGAAAC CACTGAAATC GAGTCTCAAA ACTACGGTTA CAAGTTCGGA
CAAGAAGAAG AAACCTACAA CATCGTAGCT GCTCACGGAT ACTTTGGACG TTTAATCTTC
CAATACGCAT CCTTCAACAA CAGCCGTGCC TTACACTTCT TCTTAGGTGC ATGGCCTGTA
ATCGGTATCT GGTTCACCGC AATGGGTGTA TCTACCATGG CTTTCAACCT CAACGGTTTC
AACTTCAACC AATCGATTCT TGACTCACAA GGTCGCGTAA TCGGAACCTG GGCTGATGTA
CTCAACCGTG CAGGAATTGG AATGGAAGTA ATGCACGAGC GCAACGCTCA CAACTTCCCC
TTAGACTTAG CTTCTGCTGA GCCTGTGTCT GCTCCTGCTA TCAATGGTTA A
 
Protein sequence
MTTTLQQRES VSVWEQFCQW VTSTNNRLYV GWFGTLMIPT LLTATTCFII AFIAAPPVDI 
DGIREPVAGS LLYGNNIISG AVVPSSNAIG LHFYPIWEAA SLDEWLYNGG PYQLVVFHFL
IGVFCYLGRQ WELSYRLGMR PWICVAYSAP VSAATAVFLI YPIGQGSFSD GMPLGISGTF
NFMFVFQAEH NILMHPFHML GVAGVFGGSL FSAMHGSLVT SSLVRETTEI ESQNYGYKFG
QEEETYNIVA AHGYFGRLIF QYASFNNSRA LHFFLGAWPV IGIWFTAMGV STMAFNLNGF
NFNQSILDSQ GRVIGTWADV LNRAGIGMEV MHERNAHNFP LDLASAEPVS APAING