Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33699 |
Symbol | lhca2 |
ID | 5003874 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 658246 |
End bp | 659280 |
Gene Length | 1035 bp |
Protein Length | 235 aa |
Translation table | |
GC content | 66% |
IMG OID | 640419295 |
Product | chlorophyll a/b binding, possibly photosystem I light harvesting complex |
Protein accession | XP_001419664 |
Protein GI | 145350546 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGTTT CCATGCGCAT TTCCTCCACG GCCGGGTGCG TGTTCGCGCT CGTTCGTTCC GCGTCCGCGT CCGCGTCCGC GCTCGCGCGT CGCGCGTTCG CGCCCGTCGC GCGCGGTCGC GCCAGACGGA TCGTATCGAG CGTATCTTCG ATGCGCTCGA GCGTGCGATC GGGGGCGCGC GCGCGTTCGC CCGGCGCGCG CGCGCGTACG CGCTCGGGTC GCGCGTCGAT GACGATGCGA ACGAATCGAA CGCGTCGCAT CGCGCTCGAC GCGCCGCGTC GTTCGCGGCG TGGACGCCCG ATTCCGTCGA CCCTTCGATC GCCGACTGAC TGATATCCCT CCTCGCTCTT CGCGCTCGCC AGGCTCAAGA CCCGCGTCGC GACGAAGACT CGCGCGTCCC GCGGCCCGGT GGCGGTGTCC GCGAACGCCG ACCGTCCGGT GTGGTACCCG GGTAAGGCGC CGGCGGCGCA CCTCGATGGG TCCTTGCCGG GCGACTACGG CTTCGACCCG CTCTCGCTCT CGGCGGATCC GGAGATGCGC CAGTGGATGG TGCAAGCCGA ACTTCAACAC GCGCGCTGGG CGATGTTGGG CGTCGCCGGT TGCGTTGCCC CGGAGCTTTT GACCAAGATC GGTGTCGCGG ACTTGCCGAA CTGGGTCGAC GCCGCCACTT ACCAATACTG GGCCCCGGGC GGGACGCTGT TCTTCATTCA AATGGCCATG TTCAACTGGG CCGAAATTCG CCGCTGGCAA GACATGAAGA ACCCGGGCTC GGTCAACACG GATCCGTTGT TCGGCTACAA CTCCAACGAC ACCAACACGG ACGTTGGCTA CCCGAAGGGC TTGTTCGACA AGTTCGGCTG GGCTAAGGAT GAGAAGACCA CGGCGGAGCT CAAGTTGAAG GAGATCAAGA ACGGGCGCTT GGCGATGCTC GCCTTCCTCG GCATCTGCGC GCAATACGTC CAAACGGGCG TCGGTCCCGT CGAGAACTTG TTCAGCCACA TGGGTAACCC GGGTCAAGTT GGTGTTTTCA TGTAG
|
Protein sequence | MSVSMRISST AGLKTRVATK TRASRGPVAV SANADRPVWY PGKAPAAHLD GSLPGDYGFD PLSLSADPEM RQWMVQAELQ HARWAMLGVA GCVAPELLTK IGVADLPNWV DAATYQYWAP GGTLFFIQMA MFNWAEIRRW QDMKNPGSVN TDPLFGYNSN DTNTDVGYPK GLFDKFGWAK DEKTTAELKL KEIKNGRLAM LAFLGICAQY VQTGVGPVEN LFSHMGNPGQ VGVFM
|
| |