Gene PCC8801_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0056 
Symbol 
ID7103721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp58944 
End bp60002 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content52% 
IMG OID643473172 
Productphotosystem II D2 protein (photosystem q(a) protein) 
Protein accessionYP_002370319 
Protein GI218244948 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B))
[TIGR01152] Photosystem II, DII subunit (also called Q(A)) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATTG CAGTTGGACG TGCCCCGGCA CAAAGAGGAT GGTTTGATGT CCTCGATGAC 
TGGTTAAAAC GCGATCGCTT TGTATTCGTT GGTTGGTCAG GTTTATTACT CTTCCCCTGT
GCCTACTTGG CTTTAGGGGG ATGGTTAACC GGAACCACCT TTGTTACCTC CTGGTACACC
CACGGTTTGG CTAGTTCCTA CCTCGAAGGC TGTAACTTCC TCACCGTTGC CGTCTCTTCC
CCCGCTAACG CCTTCGGTCA CTCCCTTCTC TTCCTGTGGG GACCCGAAGC GCAAGGCGAC
TTCACCCGTT GGTGTCAAAT TGGCGGACTT TGGACTTTTA CCGCCCTTCA CGGTGCTTTT
GGACTGATCG GCTTCATGCT GCGTCAGTTT GAAATTGCTC GCCTTGTTGG TATCCGTCCC
TACAACGCCA TCGCCTTCTC TGCTCCCATC GCCGTGTTCG TCAGTGTTTT CCTGATGTAC
CCCTTGGGAC AGTCTGGCTG GTTCTTCGGA CCTAGCTTTG GAGTGGCGGG AATTTTCCGC
TTTATCCTGT TCTTACAAGG GTTCCACAAC TGGACACTTA ACCCCTTCCA CATGATGGGA
GTAGCGGGTG TTCTCGGTGG TGCGTTACTC TGTGCTATCC ACGGGGCAAC CGTAGAAAAC
ACCCTGTTTG AAGATAGCGA TCAAGCTAAC ACCTTCCGCG CTTTTGAACC TACCCAAGCT
GAAGAAACCT ACTCCATGGT AACGGCGAAC CGTTTCTGGT CACAGATCTT CGGGATTGCT
TTTTCCAACA AACGTTGGTT ACACTTCTTT ATGCTGTTCG TCCCTGTGAC TGGACTGTGG
ATGAGTGCGA TCGGTATTGT GGGTTTAGCC CTCAACCTCC GCGCTTACGA CTTCGTATCG
CAAGAATTAC GCGCTGCTGA AGACCCTGAA TTTGAAACCT TCTACACCAA GAATATCTTG
TTAAACGAAG GTTTACGCGC TTGGATGGCT CCCCAAGACC AACCCCACCA GAATTTTGTA
TTCCCTGAGG AGGTACTCCC CCGTGGTAAC GCTCTCTAA
 
Protein sequence
MTIAVGRAPA QRGWFDVLDD WLKRDRFVFV GWSGLLLFPC AYLALGGWLT GTTFVTSWYT 
HGLASSYLEG CNFLTVAVSS PANAFGHSLL FLWGPEAQGD FTRWCQIGGL WTFTALHGAF
GLIGFMLRQF EIARLVGIRP YNAIAFSAPI AVFVSVFLMY PLGQSGWFFG PSFGVAGIFR
FILFLQGFHN WTLNPFHMMG VAGVLGGALL CAIHGATVEN TLFEDSDQAN TFRAFEPTQA
EETYSMVTAN RFWSQIFGIA FSNKRWLHFF MLFVPVTGLW MSAIGIVGLA LNLRAYDFVS
QELRAAEDPE FETFYTKNIL LNEGLRAWMA PQDQPHQNFV FPEEVLPRGN AL