Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24045 |
Symbol | lhca4 |
ID | 5000142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 280656 |
End bp | 281836 |
Gene Length | 1181 bp |
Protein Length | 203 aa |
Translation table | |
GC content | 69% |
IMG OID | 640415563 |
Product | Photosystem I light harvesting complex, chlorophyll a/b binding |
Protein accession | XP_001416356 |
Protein GI | 145343493 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0272669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCTCTTCCA CCGCCGTCGC CAGGTGCGCG CTCGACGCGA CGTCGCGCGC TCGGAGACGC GCCGCGAACG GATCGCGCGC GTCGTATCGC GATCGTATCG TATCGCGCGA GGGAGATACG GCGACGGGCT CGCGCGCGCG CGGACGCGGC GCGCGCGCGC GGACGCGCGT CTTCGAACGG GGATTTCGAT TCGAGGGAAT TTCGCGCGTC GCGCGCGGCG ATCGAGCGGC GCGGAAGCGC GCGCGAGGGC GCGGGGGGGG CGGACGTTCG AGCGGCGATC GGGCGCGGCG GGACGGGGCG GGACGCGCGC GGCGACGCGC GCGGGGCTTC GGACTCGACG CCGTCCGCGG CGCCGCCGCG GGCGCGCGAG CGACGCGAGG GAGCGACGGC GCGCGGCGAG GCGGCGCGGC GAGACGCGCG GCGCGAGGGA GGGATGGGAT GGACGGGACG CGAACGCGAC GAACGGGCGA CTGACGACGA ACGCGCGATT TCGCGAATGC AGTTTGCGCG CGGCGAAGGC GCAAAAGCGT CAAAGCTTGA AGACGCGCGC GAGCGCCGCG ACGCAACCGA TGTGGATGCC GGGTGCGACC GCGCCGGCGC ACTTGAAGAA CGAGTTGCCG GGTGATTACG GCTTCGATCC GCTCGAACTC GGCAAGGATC CGGAGGATTT GAAGTGGTAC GTGCAAGCCG AGCTTCAACA CGGCCGCTGG GCGATGCTCG GCGTCGCCGG CGCCGCGGCG CCGGAGATCC TCACCAACAT GGGCATCAGC AACTTGCCGA ACTGGCACGA GGCCCCGTCG TACGACGGCT ACTTCACCGA CGCCACGACT TTGTTCTGGG TGCAAATGCT CATGATGAAC TGGGCCGAAG TGCGCAGATG GCAAGACATT CGCAAGCCGG GTTCGGTGAG CGAAGATCCG ACGCCGTTCT CCAACGCCAA GCTCCCGGCT GGCGTTGTGG GTTACCCGGG TGGCATCTTC GACCCGCTCG GCTACGCCAA GGGTGACTTG AAGACCTTGA AGGCGAAGGA GATCGCCAAC GGTCGTCTCG CGATGGTCGC CTTCGCCGGC ATCATGGTGC AATACGACCA CACCGGCGTC GGCCCGGTGG CCAACTTGGT GTCCCACATG GCGGATCCCG CGCACAACAA CGTGTTCGCC GCCAAGTTCA TCGGCTTCTA A
|
Protein sequence | MWMPGATAPA HLKNELPGDY GFDPLELGKD PEDLKWYVQA ELQHGRWAML GVAGAAAPEI LTNMGISNLP NWHEAPSYDG YFTDATTLFW VQMLMMNWAE VRRWQDIRKP GSVSEDPTPF SNAKLPAGVV GYPGGIFDPL GYAKGDLKTL KAKEIANGRL AMVAFAGIMV QYDHTGVGPV ANLVSHMADP AHNNVFAAKF IGF
|
| |