Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29411 |
Symbol | lhcb5 |
ID | 5006571 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 191721 |
End bp | 193261 |
Gene Length | 1541 bp |
Protein Length | 324 aa |
Translation table | |
GC content | 65% |
IMG OID | 640421992 |
Product | putative CP26, photosystem II light harvesting complex |
Protein accession | XP_001422513 |
Protein GI | 145356594 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0569828 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0538058 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CACTCCCGCA CGCGCGCACA GGACCATGCA AGTCAACGCC CTGCTCAAGA AAGCCGCCCC CGCGCCCGCC AAGACCGCCA AGGCGCCGGC GAAGACGCAG ATCAAGAAGG CGCCGGCGAA GAAGGCGTCG CCGTCGCTGC CGTCGCTGCC GAATCCGTTC GGCGGCGCCG CGCCCAAGGC GAGCGCGAAG ACGGTCAAGA AGGCCGCGCC GGCGAAGAAG GCGGTCGCGA AGAAGGCGCC GGTCAAGAAG GCGCCGGTCA AGAAGAAGGC GCCGGTCGCG AAGAAGGCGG CCAAGTCGCC GACCGCGGCC AAGGAGGCGC TCGCGAAGTG GTACGGTGCG TATCTTTTAA AGGCGATCGG ATAGGATCGT TCGACGCGCG CGACGACGCG CGGACGCGCA CAGGACGCGC GCGCGCGATG GGAATCGATC GTTAAGTGCG CGTTCGCGCG AACGGGGCGA CGCGGGACGC GCGCGCGGGA GGACGCGTGG CGGAAATCGG AAGATATGGT TGCGATCGCG CGCGCGGGAC GGAGGCGGGA CGCGGGACGG ACGCGCGCGA GGGACGATAG AGCGGGCGCG AGGGACGCGC GGCACGCGGG GGGGTGAGAA GGGCGAAGGA AACGCGCGCG ATGGGGAGAC TGACGATGCG CGACGACGCG TTCGACGCAG GTCCGGACCG TAAGTTGTAC TTGCCGGGCG GCCTCTTGAC CAAGGCGGAT TTGCCGAGCT ACCTCGACGG TACGCTCGCG GGTGATTACG GCTTCGACCC GCTCGGCCTC GGCGCCGACG GTGCGATCAA GCAATACCGC GTCGCGGAAG TCATCCACGC TCGATGGGCG ATGCTCGCGA TCCCGGGTGT CGTCATCCCG GAGGCGCTCG GCTTGCCGGG CGGCGTCTGG ACGGAAACCG GCAAGGTGTT CCTCGACGGC GAAACCGGCC GTCCGTTCTT CCTCGAGAAC CCGATCGTGT TCGCGGCCGT CCAAGTTGCG CTCATGGCTG GTGTCGAGCA ATTCCGCTCG AGCGGTGAAG GTCCGGCCGG TTTCGTGCCG TTCAAGGGCA AGTTCGACGA GTCGGCGTTC AAGGGCCTCG ACCCGATCAA CCCGGGTGGC CCGCTCGACT TCTTCAACGT CGCCTCCAAC CCGCAAGACT TGGCCCTCTT GAAGGTCAAG GAAATCAAGA ACGGTCGCTT GGCGATGATC GCCATGTTGG GCGTCTTCGT GCAAGGTCTT ACCACCGGTG AAGGTCCGGC TGCCAACTGG GGCAAGCACG TCGCCGATCC GTTCGGTACG TAACGCGCGC GCGCGCCTCG CCCATCGACG ACCGCGTTCG CGTGTTTTCC ACGAGCACTC ACTGACCCCG CTATCTCTCG TCTCCCACGC AGGTTACAAC TTCGTCACCC TTCAAGCCGT CGACCGCACG CCGGTTCTTT AAGCGCGAGT TTGAAACAAT TTTTCAAACT TTTGTGACGC GACCTGTTCA TGTATAGCCG CTCGACCGGC TCGCTCGTTC ACTTGTAACG GAAGACCGTG ATATTCAACC A
|
Protein sequence | MQVNALLKKA APAPAKTAKA PAKTQIKKAP AKKASPSLPS LPNPFGGAAP KASAKTVKKA APAKKAVAKK APVKKAPVKK KAPVAKKAAK SPTAAKEALA KWYGPDRKLY LPGGLLTKAD LPSYLDGTLA GDYGFDPLGL GADGAIKQYR VAEVIHARWA MLAIPGVVIP EALGLPGGVW TETGKVFLDG ETGRPFFLEN PIVFAAVQVA LMAGVEQFRS SGEGPAGFVP FKGKFDESAF KGLDPINPGG PLDFFNVASN PQDLALLKVK EIKNGRLAMI AMLGVFVQGL TTGEGPAANW GKHVADPFGY NFVTLQAVDR TPVL
|
| |