Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_47897 |
Symbol | lhca3 |
ID | 5006296 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | + |
Start bp | 232863 |
End bp | 234155 |
Gene Length | 1293 bp |
Protein Length | 281 aa |
Translation table | |
GC content | 62% |
IMG OID | 640421717 |
Product | photosystem I light harvesting complex, chlorophyll a/b binding |
Protein accession | XP_001422134 |
Protein GI | 145355794 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.266881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.436801 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATTCCCGCAC GCACCCCGAT GTTCGCGCGA ACCTTGAACA CCGTCACCCT CGCGAGCGCC CCCGCGCGCG CGACGAAGCG CTGCGCGAAG CGCGCGACCG TGGTCGTGCG CGCCGAGGGC GAAACCGAGG CCGAGGCCGC GCCGGCGCCG GTCAAGGTGC GACGATTCGA CGACGCGCGC GCGCGATGCG GGATCGCGTT TCGGACGTTC GGGCGGTGTG ACGACGCGCG CGAACGCGCG GTGGGACGAA CGACGGCGCG AAACTGTGGA TGGGAAAAGT TTGCGGCGGA TCGACGGCGT CGCGAGGGCG CGCGCGCGAG GGCGCGCGCG CGAGGGAGGG GCGATGGCGA TCGGGGCGTT GATACGATCG CGCACCGTAT CTCGCGACGG GGTTCAAAGA TTTGGGTGAT TTGTTGAGTC GTGACTCGCC AATCGTGGGG GTTGGGCGAG CGCGCGCGAG ACGAGACGAG ACGCGCGGTG ACTGACGACG AATGCGTGTT TTTGTGTGTT TTACAGAAGG CGGCGCGCAA GACGGATGAC CTCGCGCCGT TGTACGTGCT CGGGAACTCT GACCAGTCGT TGTCTTACCT CGACGGCTCT TTGCCGGGCG ACTACGGGTT CGATCCGCTC GGGTTGTCCG ACCCGGAAGG TGCCGGTGGT TTCATCAACC CGCAATGGTT GGCGTACTCT GAAGTGATCC ACGGTCGATG GGCAATGCTC GGTGTGGCCG GTATGGTTGC CCCGGAGGTG CTCGGCGGCC TCGGCATCAT CCCGCAAGAA ACCGGCTTGG TCTGGTACAA GGCGGGTATG ATCCCGGCGC AAGGCACGTA CGATTACTGG GCCAACCCGT TCACGATCTT CTGGATCAAC GCCGCGTTGA TGAACTTTGC TGAACTTCGC CGCGCGCAAG ATTACTGGAA CCCGGGCTCC ATGGGCAAGC AAGAACTCAT CGGCTGGGAA AAGATGCTCG GGGGCTCCGG CGAACCGGCA TACCCGGGTG GTTTCTTCAA CATCATGGGC CAAGGCAAGT CCGACATGGC GAAGATGCGT GTCAAGGAAA TCAAGAACGG TCGTCTTGCG ATGATGGCGT GCTTCGCGTG CGGCGCGCAA GCCGTGATGA CGGGTGAAGG CCCGGTGAAG AACTTGATTG ACCACGTCTC CGATCCGTTC GGCCACAACT TGCTCGTGAA CTTCCAAAAC ATTGGCGGCG TCTCTCCGTT CTAAGCGGGT TGACATTTAG CTAGTAGCAG CGCTTTGTTT TGTATGTAAA CACAACAAAA GCATCATCAA AAA
|
Protein sequence | MFARTLNTVT LASAPARATK RCAKRATVVV RAEGETEAEA APAPVKKAAR KTDDLAPLYV LGNSDQSLSY LDGSLPGDYG FDPLGLSDPE GAGGFINPQW LAYSEVIHGR WAMLGVAGMV APEVLGGLGI IPQETGLVWY KAGMIPAQGT YDYWANPFTI FWINAALMNF AELRRAQDYW NPGSMGKQEL IGWEKMLGGS GEPAYPGGFF NIMGQGKSDM AKMRVKEIKN GRLAMMACFA CGAQAVMTGE GPVKNLIDHV SDPFGHNLLV NFQNIGGVSP F
|
| |