Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27706 |
Symbol | lhcp2.1 |
ID | 5005661 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | - |
Start bp | 39300 |
End bp | 40567 |
Gene Length | 1268 bp |
Protein Length | 233 aa |
Translation table | |
GC content | 65% |
IMG OID | 640421082 |
Product | prasinophyte light harvesting complex, chlorophyll binding |
Protein accession | XP_001421688 |
Protein GI | 145354851 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.59606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCGGCATTCC TGCACACGCG CATTTCTACA ATGTCTGCCC TTCTCGCTTC CTCCTTCGTC GGCCGCGTCG CCGCCTTCAA GGCGACCAAG ATCCAAGTGC GTATTTTTTA GCCTTCGAGC GACGCGCGGA GACGCGGGGA CGCGGCGCGC GGAGACGCGG CGACGCGGGA CGCGATGGGA CGCGCGCGAT GGGATGGGAT CGCGCGCGAC GGGGCGCGGC GACGCGGGAC GCGCGCGCGG CGAGGCGACG GCGCTGGGAC GACGCGAACG CGCGGAGAGA AATGATTTGA ATCGGAGGCG CCGCGCGGGG GGGAGGGCGC GGGCGAGAGA CGCGCGAGAC GCGCGCGATA CGCGATTCGT GGCGGTTTCC GTATCGTATC TTTATCGCGC GGGGATTCGG GAGGGGGTCG GCGCTGAGTC GTGAACGGTG GATTGGGTTC GCGGTTCGCG CGCGACGCGC GGAGAGGCGG GACGGCGGCG TGAAACGACG CGCGCGCGAT CGGTGACTGA CGAACGTTCG CTTCGCGCGC TTTTTAACGA TCGCAGGCCA AGTCTGTCTC CACGACGGTC AAGGCTGACA TCTACCCGGA ATTCGGTACC TACCCGGGCG GTGGCGAATC CCCGATCATC CCGTTCGGCG ACGAAAAGAA CGCCGAGCGT GAAGTGATCC ACGGCCGCTG GGCGATGCTC GGTGTCACCG GCGCGTGGGC CGCCGAAAAC GGCACCGGTA TCCCGTGGTT CACCGCGGGT ACCTTGTGCA CCCCGGATGA CTGCACCGCC GTTGCGGACA AGTTCCCGGG CGCCGTCGCC CCGCTCGCGC CGGAAGGCTC TGGCTACCCG TCCTTCTGGA ACGTTCTCAT CATCGAGATC GTTCTCGTCG GCGCCGCGGA AGCGTACCGT ACCGGTATCT CCGACTCTCC GTTCGATGAT GGCCTCACCG TCGGTGACGT CAACCCGGGT GGACGCTTCG ACCCGCTCGG CCTCGCCGAG TCTGGCGACC TTGAAGAACT CAAGATCAAG GAGCTCAAGC ACTGCCGCTT GTCCATGTTC GCGTGGTTGG GCTGCATCTT CCAAGCGCTC GCCACCCAAG AAGGCCCGAT CGCCAACTGG CAATCCCACG TTGCGGACCC GGTTCACTCC AACGTCCTCA CCAACGCGGC CAAGGGCTTC GGCTTCTACT AAGCGGTTCA CCTCCTGCGA TGAACTCGTT TTCGCGTGAG GCGCTCGAAT TTACTCATTA ATATTATGCG TTTCAATGTT ACTTGCTC
|
Protein sequence | MSALLASSFV GRVAAFKATK IQAKSVSTTV KADIYPEFGT YPGGGESPII PFGDEKNAER EVIHGRWAML GVTGAWAAEN GTGIPWFTAG TLCTPDDCTA VADKFPGAVA PLAPEGSGYP SFWNVLIIEI VLVGAAEAYR TGISDSPFDD GLTVGDVNPG GRFDPLGLAE SGDLEELKIK ELKHCRLSMF AWLGCIFQAL ATQEGPIANW QSHVADPVHS NVLTNAAKGF GFY
|
| |