Gene PHATRDRAFT_20331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20331 
SymbolPsbO 
ID7201024 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp759421 
End bp760679 
Gene Length1259 bp 
Protein Length308 aa 
Translation table 
GC content50% 
IMG OID 
Productoxygen-evolving enhancer protein 1 precursor 
Protein accessionXP_002180309 
Protein GI219119085 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TATACCGAAG CAATCTTGAC CATGAAGTTC ACTGCCGCCT GCTCTATTGC CCTCGCTGCT 
TCGGCTTCGG CCTTTGCCCC GATTCCCTCG GTTAGCGTGA GTTGATTTTG CAGTGGCCAT
GATAGGAAAC GGTCGAGAGT TGCAGAGAAC AAAAGCTGTT GAGCATTATG CCCTTTATTT
CGTTCTTCGT CTGCGCTGTT TACGTCACAA TGAATTTTAT TGGCAGTAAC TTTTTTGTTG
TTGGCTGTAG TGATTTGTGT CTGACAAGTT TCGTTTTCGC CCTTATTGAT ACTGCAGCGT
ACCACCGATC TTAGCATGTC TTTGCAAAAG GATCTCGCTA ATGTCGGCAA GGTTGCCGCT
GCCGGAGCCC TTGCCTTCGG TCTCGCCACG GCCCCAGCCA ATGCGTTAAC CAAGAGCCAG
ATCAATGAGC TCTCCTACTT GCAGGTCAAG GGAACCGGTT TGGCAAACCG CTGCCCGGAA
GTCGTCGGAG AAGACAGCAT CACCCCCAAG GGCGGACAAC GTCTCGTCGA TATGTGCATT
GAACCCAAGG CCTGGGCTGT AGAAGAGGAA ATTGGCAAGG CTGGGCGCAC CGAAAAGAAG
TTTGTCAATT CCAAGGTCAT GACTCGTCAG ACGTACACTC TTGATGGAAT TGAGGGTGCT
TTGAAGTCCG AAGGAGGAAG TATCGTCTTC CAGGAACAGG AAGGCATTGA TTATGCTGCC
ACTACCGTTC AGCTTCCAGG TGGGGAACGT GTTCCTTTCC TTTTTACCGT CAAAGACTTG
GTTGCCAAGG GTAACGGTGG ATCTTTCAAG CCTGGTTTCC AAATGGGAGG CGACTTCAAT
ACTCCTTCCT ACCGTACTGG TCTCTTCCTT GATCCCAAGG GACGTGGTGG AACCACCGGA
TACGACATGG CTGTTGCCCT TCCTGGTCTT CAATCCGGAG AAGAGGGTGA CGATGACCTT
TTCAAAGAGA ACAACAAGAC CTTCGACATC ACTACTGGCC GTATCGAAAT GGAAGTCAAC
AAGGTCAATG CGGAAGAGCA GGAAATTGGA GGTGTCTTTG TTGCCACTCA GCTGTCCGAC
ACCGATATGG GATCAAAGGT GCCTAAGAAA GTTCTCACTA AGGGTATCTT CTACGCCCGT
GTCGAGTAAA CATGTTTCAC TATGCTAGTG CAGCTTTCGA GACGAATTGC GATGGTGACG
GTCGACGGTT TAGCTCTAGC CTTTCGTCCC AATAGAACCT CTTTTTCACC ACAATCTTA
 
Protein sequence
MKFTAACSIA LAASASAFAP IPSVSRTTDL SMSLQKDLAN VGKVAAAGAL AFGLATAPAN 
ALTKSQINEL SYLQVKGTGL ANRCPEVVGE DSITPKGGQR LVDMCIEPKA WAVEEEIGKA
GRTEKKFVNS KVMTRQTYTL DGIEGALKSE GGSIVFQEQE GIDYAATTVQ LPGGERVPFL
FTVKDLVAKG NGGSFKPGFQ MGGDFNTPSY RTGLFLDPKG RGGTTGYDMA VALPGLQSGE
EGDDDLFKEN NKTFDITTGR IEMEVNKVNA EEQEIGGVFV ATQLSDTDMG SKVPKKVLTK
GIFYARVE