Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26717 |
Symbol | psbY |
ID | 5004850 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 93144 |
End bp | 94828 |
Gene Length | 1685 bp |
Protein Length | 416 aa |
Translation table | |
GC content | 63% |
IMG OID | 640420271 |
Product | possible psbY, PSII-Y, photosystem II polypeptide |
Protein accession | XP_001420585 |
Protein GI | 145352511 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.775264 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0834008 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGACGCGCGC GACACACCGC GCCGCACACG ATGCTCACGT TGACGCGCAC GCAAATCACG CTCCCCGCCC GCGCGCCCGC GCGCGCGAGG CGCACGGCGC GCACGGCGCA AAGCGCGCGC GTCATCGCGC GCGCGCGGAC GTCCGCGGTC GTCGAGGATG AATTCGCGAC GGACGACGAC GTCGTCAGAC GAGGACAGGT GCGCGCGAAC GACCGCGGGC GCGATGCGCA AGAATTCGAT GCGTCGCGCG GTAGGACGCG CGAACGCGCG CGGGCGTCGA TGGACGCGCG AGGGCGACGA AGCGATCGAT CGGACGGGGG TGCGATCGAT ATCTGGATAA ATGGCGCGCG CTGGAAAACG ACGCGTTCGG TCGAACGCTG ACGGCGATGC GACGCGTGAG GGCGGAGAAA GGGCGAAAGG TAAAATGTGT GAACGAATGA CTGACGACGT GACGCGCGAT GCGGTGCTTT CGCGATCGTA GACGGCGGCT ATGGCGACGG CGGCTTCCAT GTTTTTCGCG GATCGCGCCG ACGCGGCGCA AGAGTTGATG CAAACCGCGC TCGATGGACG CCCGCTCATC TTCTTGGCGG TGTTTGGACC GGTCTTGGGT TGGGTGGCGT ATAACATTCT CTCCCCGGGG TTGAGACAGC TTGAAAACAT GCAAGCGGTG AACGCAAAGT CGAGCAAGAA GCGCGCCGCC GGCGCGCTGG CGGGCGGCTT GACGGCGAGT GCCTTGATGG GCATGCCGGA AGCGTCTGAT GCGGCTCAAA ACGTTGCGGA TCTTGCCGGC ATCGATGGCC GCATCGCCAT CTTTTTTGTG TTCGCCCCGG TTCTCGGCTG GGTGGCGTAC AACATTCTCG GCCCGGGTCT TCGTCAGTTC GAAGACATGC AAAAGGCTGC CGCGAAGAAG AAGGGTGTGC TCGCGGGTGC GGGTCTCTCG GCTGCCGCTT TGATGGGCAT GCCGGAAGCC TCCGACGCTG CGGAACAGCT CGGTGAGCTC GCCGGCATCG ATGGCCGCAT CGCCATCTTT TTTGTGTTCG CCCCGGTTCT CGGCTGGGTG GCGTACAACA TTCTCGGCCC GGGTCTTCGT CAGTTCGAAG ACATGCAAAA GGCTGCCGCG AAGAAGAAGG GTGTGCTCGC GGGTGCGGGT CTCTCGGCTG CCGCTTTGAT GGGCATGCCG GAAGCGTCTG ATGCGGCTCA AGAAATTGGC ACCCTCGCTG GCATCGATGG CCGCATCGCC ATCTTTTTTG TGTTCGCCCC GGTTCTCGGC TGGGTGGCGT ACAACATTCT CGGCCCGGGT CTTCGTCAGT TCGAAGACAT GCAAAAGGCT GCCGCGAAGA AGAAGGGTGT GCTCGCGGGT GCGGGTCTCT CGGCTGCCGC TTTGATGGGC ATGCCGGAAG CCTCCGACGC TGCGGAACAG CTCGGTGAGC TCGCCGGCAT CGATGGCCGC ATCGCCATCT TTTTTGTGTT CGCCCCGGTT CTCGGCTGGG TGGCGTACAA CATTCTCGGC CCGGGTCTTC GTCAGTTCGA AGACATGCAA AAGGCTGCCG CGAAGAAGAA GTAGACATTT CGCAAGAGCT GCGTCAAGTC TCCGATTCAC GAACTCGACA ACAAACACAT AGTATTTAGT CCTCCGAGAA CGTTGCGCTC TCGACAGAGC ATCGCTCGAA AATGCAAGCT TGTAA
|
Protein sequence | MLTLTRTQIT LPARAPARAR RTARTAQSAR VIARARTSAV VEDEFATDDD VVRRGQTAAM ATAASMFFAD RADAAQELMQ TALDGRPLIF LAVFGPVLGW VAYNILSPGL RQLENMQAVN AKSSKKRAAG ALAGGLTASA LMGMPEASDA AQNVADLAGI DGRIAIFFVF APVLGWVAYN ILGPGLRQFE DMQKAAAKKK GVLAGAGLSA AALMGMPEAS DAAEQLGELA GIDGRIAIFF VFAPVLGWVA YNILGPGLRQ FEDMQKAAAK KKGVLAGAGL SAAALMGMPE ASDAAQEIGT LAGIDGRIAI FFVFAPVLGW VAYNILGPGL RQFEDMQKAA AKKKGVLAGA GLSAAALMGM PEASDAAEQL GELAGIDGRI AIFFVFAPVL GWVAYNILGP GLRQFEDMQK AAAKKK
|
| |