Gene PCC8801_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1960 
Symbol 
ID7102326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2036321 
End bp2037379 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content52% 
IMG OID643475022 
Productphotosystem II D2 protein (photosystem q(a) protein) 
Protein accessionYP_002372154 
Protein GI218246783 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B))
[TIGR01152] Photosystem II, DII subunit (also called Q(A)) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTATTG CAGTCGGACG CGCCCCAGCA CAAAGAGGAT GGTTTGATGT CCTCGATGAC 
TGGCTCAAAC GCGATCGCTT TGTATTCGTT GGTTGGTCAG GTTTATTACT CTTCCCCTGT
GCCTACTTGG CTTTAGGCGG ATGGTTAACC GGAACCACCT TTGTTACCTC CTGGTACACC
CACGGTTTGG CTAGTTCCTA CCTCGAAGGC TGTAACTTCC TCACCGTTGC CGTCTCTTCC
CCCGCTAACG CCTTCGGTCA CTCCCTTCTC TTCCTGTGGG GACCCGAAGC GCAAGGCGAC
TTCACCCGTT GGTGTCAAAT TGGCGGACTT TGGACTTTTA CCGCCCTTCA CGGTGCGTTT
GGACTGATCG GCTTCATGCT GCGTCAGTTT GAAATTGCTC GCCTGGTCGG TATCCGTCCC
TACAACGCCA TCGCCTTCTC TGCTCCCATC GCCGTGTTCG TCAGTGTTTT CCTGATGTAC
CCCTTGGGAC AGTCTGGCTG GTTCTTCGGA CCTAGCTTCG GAGTGGCGGG AATTTTCCGC
TTTATCCTGT TCTTACAAGG GTTCCACAAC TGGACACTTA ACCCCTTCCA CATGATGGGA
GTAGCGGGTG TTCTCGGTGG TGCGTTACTC TGTGCTATCC ACGGGGCAAC CGTAGAAAAC
ACCCTGTTTG AAGATAGCGA TCAAGCTAAC ACCTTCCGCG CTTTTGAACC TACCCAAGCT
GAAGAAACCT ACTCCATGGT AACGGCGAAC CGTTTCTGGT CACAGATCTT CGGGATTGCT
TTTTCCAACA AACGTTGGTT ACACTTCTTT ATGCTGTTCG TCCCTGTGAC TGGACTGTGG
ATGAGTGCGA TCGGTATTGT GGGTTTAGCC CTCAACCTCC GCGCTTACGA CTTCGTATCG
CAAGAATTAC GCGCTGCTGA AGACCCTGAA TTTGAAACCT TCTACACCAA GAACATCTTG
TTAAACGAAG GTTTACGCGC TTGGATGGCT CCCCAAGACC AACCCCACCA GAATTTTGTA
TTCCCTGAAG AAGTTCTACC TCGCGGTAAC GCTCTCTAA
 
Protein sequence
MTIAVGRAPA QRGWFDVLDD WLKRDRFVFV GWSGLLLFPC AYLALGGWLT GTTFVTSWYT 
HGLASSYLEG CNFLTVAVSS PANAFGHSLL FLWGPEAQGD FTRWCQIGGL WTFTALHGAF
GLIGFMLRQF EIARLVGIRP YNAIAFSAPI AVFVSVFLMY PLGQSGWFFG PSFGVAGIFR
FILFLQGFHN WTLNPFHMMG VAGVLGGALL CAIHGATVEN TLFEDSDQAN TFRAFEPTQA
EETYSMVTAN RFWSQIFGIA FSNKRWLHFF MLFVPVTGLW MSAIGIVGLA LNLRAYDFVS
QELRAAEDPE FETFYTKNIL LNEGLRAWMA PQDQPHQNFV FPEEVLPRGN AL