Gene Cyan8802_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1987 
Symbol 
ID8391303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2006599 
End bp2007657 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content51% 
IMG OID644979968 
Productphotosystem II D2 protein (photosystem q(a) protein) 
Protein accessionYP_003137713 
Protein GI257059825 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B))
[TIGR01152] Photosystem II, DII subunit (also called Q(A)) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000347488 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0187293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATTG CAGTCGGACG CGCCCCAGCA CAAAGAGGAT GGTTTGATGT CCTCGATGAC 
TGGCTCAAAC GCGATCGCTT TGTATTCGTT GGTTGGTCAG GTTTATTACT CTTCCCCTGT
GCCTACTTGG CTTTAGGGGG ATGGTTAACC GGAACCACCT TTGTTACCTC CTGGTATACC
CACGGTTTGG CTAGTTCCTA CCTCGAAGGC TGTAACTTCC TCACCGTTGC CGTCTCTTCC
CCCGCTAACG CCTTCGGTCA CTCCCTTCTC TTCCTGTGGG GACCCGAAGC GCAAGGCGAC
TTCACCCGTT GGTGTCAAAT TGGCGGACTT TGGACTTTTA CCGCCCTTCA CGGTGCTTTT
GGACTGATCG GCTTCATGCT GCGTCAGTTT GAAATTGCTC GTCTGGTTGG TATTCGTCCC
TACAACGCCA TCGCCTTCTC TGCTCCCATC GCCGTGTTCG TCAGTGTTTT CCTGATGTAC
CCCTTGGGAC AGTCTGGCTG GTTCTTCGGA CCTAGCTTCG GAGTGGCGGG AATTTTCCGC
TTTATCCTGT TCTTACAAGG GTTCCACAAC TGGACACTTA ACCCCTTCCA CATGATGGGA
GTAGCGGGTG TTCTCGGTGG TGCGTTACTC TGTGCTATCC ACGGGGCAAC CGTAGAAAAC
ACCCTGTTTG AAGATAGCGA TCAAGCTAAC ACCTTCCGCG CTTTTGAACC TACCCAAGCT
GAAGAAACCT ACTCCATGGT AACGGCGAAC CGTTTCTGGT CACAGATCTT CGGGATTGCT
TTTTCCAACA AACGTTGGTT ACACTTCTTT ATGCTGTTCG TCCCTGTGAC TGGACTGTGG
ATGAGTGCGA TCGGTATTGT GGGTTTAGCC CTCAACCTCC GCGCTTACGA CTTCGTATCG
CAAGAATTAC GCGCTGCTGA AGACCCTGAA TTTGAAACCT TCTACACCAA GAATATCTTG
TTGAACGAAG GTTTACGCGC TTGGATGGCT CCCCAAGACC AACCCCACCA GAATTTTGTA
TTCCCTGAAG AAGTTCTACC TCGCGGTAAC GCTCTCTAA
 
Protein sequence
MTIAVGRAPA QRGWFDVLDD WLKRDRFVFV GWSGLLLFPC AYLALGGWLT GTTFVTSWYT 
HGLASSYLEG CNFLTVAVSS PANAFGHSLL FLWGPEAQGD FTRWCQIGGL WTFTALHGAF
GLIGFMLRQF EIARLVGIRP YNAIAFSAPI AVFVSVFLMY PLGQSGWFFG PSFGVAGIFR
FILFLQGFHN WTLNPFHMMG VAGVLGGALL CAIHGATVEN TLFEDSDQAN TFRAFEPTQA
EETYSMVTAN RFWSQIFGIA FSNKRWLHFF MLFVPVTGLW MSAIGIVGLA LNLRAYDFVS
QELRAAEDPE FETFYTKNIL LNEGLRAWMA PQDQPHQNFV FPEEVLPRGN AL