Gene Cyan8802_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_0053 
Symbol 
ID8389356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp55855 
End bp57237 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content49% 
IMG OID644978101 
Productphotosystem II 44 kDa subunit reaction center protein 
Protein accessionYP_003135860 
Protein GI257057972 
COG category 
COG ID 
TIGRFAM ID[TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast
[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000498494 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAACGC TCTCTAATGT TTCCGTTACC AGTGGACGTG ACCTAGAATC AACTGGTTTT 
GCATGGTGGT CAGGCAATGC TCGTCTGATC AACCTCTCCG GTAAGCTTCT CGGTGCTCAC
GTCGCTCACG CTGGTTTGAT TGTTTTCTGG GCCGGGGCAA TGACCCTGTT TGAAACCGCC
CACTTTATTC CCGAAAAGCC CATGTACGAA CAGGGCTTAA TTCTCCTGCC CCACATTGCT
ACCCTCGGTT GGGGTGTAGG ACCTGGTGGT GAAGTAATTG ATACCTTCCC CTTCTTTGTT
GCAGGGGTAT TACACCTGAT TTCTTCTGCT GTTCTCGGTT TTGGTGGTAT TTATCACGCT
CTGCGTGGTC CTGAAACCTT AGAAGAGTAT TCCAGCTTCT TCGGTTACGA CTGGAAGGAC
AAAAACCAGA TGACCAACAT CATCGGTTAT CACCTAATTC TTTTGGGTTG TGGTGCGCTG
TTGTTGGTAT TCAAAGCCAT GTTCTTTGGT GGCGTTTATG ACACCTGGGC TCCTGGTGGT
GGTGATGTCC GGGTAATCAC CAATCCTACC TTAAATCCTG CCGTGATCTT TGGTTATCTG
ACCAAGGCTC CCTTTGGTGG CGAAGGTTGG ATTATTAGTG TCAACAACAT GGAAGATATT
ATTGGCGGTC ACATTTGGAT CGGCCTAATT TGTATCTTCG GTGGTATTTG GCACATTTTA
ACCAAGCCCT TTGGTTGGGC TCGTCGCGCC TTTATCTGGT CTGGTGAAGC TTACCTATCT
TACAGTTTAG GAGCTTTATC CATGATGGGT TTCATCGCGG CGGTTTTTGT TTGGTTTAAC
AACACCGCTT ACCCCAGTGA GTTCTATGGA CCCACCGGGA TGGAAGCATC TCAATCTCAA
GCTTTCACCT TCTTGGTTCG TGACCAACGC TTAGGGGCTA ATATTGGTTC TGCTCAAGGT
CCGACTGGGT TAGGTAAATA TTTAATGCGT TCTCCTACCG GTGAAATCAT CTTCGGTGGT
GAAACCATGC GTTTCTGGGA CTTCCGTGGT CCTTGGTTAG AACCCCTGCG CGGTCCTAAC
GGTCTAGACT TAGACAAGTT AAAAAATGAC GTTCAGCCTT GGCAAATTCG TCGCGCTGCT
GAATATATGA CCCACGCGCC TTTAGGTTCT TTGAACTCTG TGGGTGGGGT TATCACCGAT
GTTAACTCCT TTAACTACGT TTCTCCCCGT GCGTGGTTGG CGACTTCTCA CTTTACTTTA
GCTTTCTTCT TCCTGATTGG TCATCTGTGG CACGCTGGAC GTGCACGGGC GGCTGCGGCT
GGATTTGAGA AAGGGATTGA TCGTGAGACT GAACCCGTAC TGTCTATGCC TGACCTTGAC
TAA
 
Protein sequence
MVTLSNVSVT SGRDLESTGF AWWSGNARLI NLSGKLLGAH VAHAGLIVFW AGAMTLFETA 
HFIPEKPMYE QGLILLPHIA TLGWGVGPGG EVIDTFPFFV AGVLHLISSA VLGFGGIYHA
LRGPETLEEY SSFFGYDWKD KNQMTNIIGY HLILLGCGAL LLVFKAMFFG GVYDTWAPGG
GDVRVITNPT LNPAVIFGYL TKAPFGGEGW IISVNNMEDI IGGHIWIGLI CIFGGIWHIL
TKPFGWARRA FIWSGEAYLS YSLGALSMMG FIAAVFVWFN NTAYPSEFYG PTGMEASQSQ
AFTFLVRDQR LGANIGSAQG PTGLGKYLMR SPTGEIIFGG ETMRFWDFRG PWLEPLRGPN
GLDLDKLKND VQPWQIRRAA EYMTHAPLGS LNSVGGVITD VNSFNYVSPR AWLATSHFTL
AFFFLIGHLW HAGRARAAAA GFEKGIDRET EPVLSMPDLD