Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30780 |
Symbol | lhcp2.2 |
ID | 5000804 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 735869 |
End bp | 737158 |
Gene Length | 1290 bp |
Protein Length | 233 aa |
Translation table | |
GC content | 65% |
IMG OID | 640416225 |
Product | prasinophyte light harvesting complex, chlorophyll binding |
Protein accession | XP_001417039 |
Protein GI | 145345055 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.71449 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.246837 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTCGTAATGT CTGCCCTTCT CGCTTCCTCC TTCGTCGGCC GCGTCGCCGC CTTCAAGGCG ACCAAGATCC AAGTGCGTAT TTTTTAGCCT TCGCGCGACG CGCGACGACG CGCGGGACGA CGACGCGCGC GAGGGACGCG ACGCGCGCGC GTCGAACGAA CGAACGAACG ATCGGGCGAT CGATCGCGGT GGGCGATCGC GCGCGGCGCC CTCGCGCGGG CGGTGTCGCG CGGACGGGCG CGGGACGGCG CGCGGTGGAT GCTGGAAATG TGTCGCGCGC GTCGCGCGAC GTCGGGGGTC GAGGCGTCGG TCGCGGGCGA CGCGCGCGCG ACGCGGGCGA TCGGGCGGGA CGCCGCGCGC GAGATACGGT TTCGCTCTTA TCGCCGCGGA TGGGGGTCGG CACGAGTCGT GAACCGTGAA AACGTGGATC GGGGGGGGAA CAGACTTTTT AGAGCGGCGG GCGCGTCGGC GGACGACGAC GGATGGGCGT GCGCGATCAC GAGACGACGA GACTGACTGG AGAATATAAA CTTCGTTCGA TCGCAGGCCA AGTCTGTCTC CACGACGGTC AAGGCTGACA TCTACCCGGA ATTCGGTACC TACCCGGGCG GTGGCGAATC CCCGATCATC CCGTTCGGCG ACGAAAAGAA CGCCGAGCGT GAAGTGATCC ACGGCCGCTG GGCGATGCTT GGCGTCACCG GTGCGTGGGC CGCCGAAAAC GGCACCGGCA TCCCGTGGTT CACCGCGGGT ACCTTGTGCA CCCCGGATGA CTGCACCGCC GTCGCGGACA AGTTCCCGGG CGCCGTCGCC CCGCTCGCGC CGGAAGGCTC TGGCTACCCG TCCTTCTGGA ACGTTCTCAT CATCGAGATC GTTCTCGTCG GCGCCGCGGA AGCGTACCGT ACCGGTATCT CCGACTCTCC GTTCGATGAT GGCCTCACCG TCGGTGACGT CAACCCGGGT GGACGCTTCG ACCCGCTCGG CCTCGCCGAG TCTGGCGACC TTGAAGAACT CAAGATCAAG GAGCTCAAGC ACTGCCGCTT GTCCATGTTC GCGTGGTTGG GCTGCATCTT CCAAGCGCTC GCCACCCAAG AAGGCCCGAT CGCCAACTGG CAATCCCACG TTGCGGACCC GGTTCACTCC AACGTCCTCA CCAACGCGGC CAAGGGCTTC GGCTTCTACT AAGCGGTTCA CCGTGCGTGA CGTCGCTCTC GCGTCGTCTC TCTACCAACT CGTCTCTCTA CCAACTTAGA AGATTGCATT TTAATAAAAA CGATTTTCGA ATCAACAAAA
|
Protein sequence | MSALLASSFV GRVAAFKATK IQAKSVSTTV KADIYPEFGT YPGGGESPII PFGDEKNAER EVIHGRWAML GVTGAWAAEN GTGIPWFTAG TLCTPDDCTA VADKFPGAVA PLAPEGSGYP SFWNVLIIEI VLVGAAEAYR TGISDSPFDD GLTVGDVNPG GRFDPLGLAE SGDLEELKIK ELKHCRLSMF AWLGCIFQAL ATQEGPIANW QSHVADPVHS NVLTNAAKGF GFY
|
| |