Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_41212 |
Symbol | |
ID | 5002508 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 658510 |
End bp | 660210 |
Gene Length | 1701 bp |
Protein Length | 555 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417929 |
Product | predicted protein |
Protein accession | XP_001418296 |
Protein GI | 145347692 |
COG category | [C] Energy production and conversion |
COG ID | [COG0281] Malic enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0240958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.449043 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCCGT GGACGCGACA GGTGGTGTCT GGGGTGGAAC TGCTGCGGTC GGGGAAGTAT AACAAAGGGA TGAGCTTCAC GCGGGACGAA CGCGATAGGT TGAACTTGCG CGGGTTGTTG CCGCCGGCGG TGTTCGATCA GAGCGTGCAG GTCGAGCGCG TGATCGAGCG CTTGCGTCGG GTGACGAACG ACGTCGAAAA ACATGCGTGG CTCGCGTCGC TGTACGAACG AAACGAGCGG CTGTTTTATC GCGTGGTGAA GGACCATTTG GAGGAGTTGT TGCCGATTTT ATCGGCGCCG ACGGTTTGGC AAGTGTGCGC CGAGTTTGGT TTGATGTACA GACGACCGCG CGGGTTGTAC ATCTCCATCA AAGACCGCGG GTCCGTGTAT AGGCTGTTGA AAAATTGGCC CGTGCGCGAC GTCAAGGCGA TCGTGCTCAC GGACGGGCAA CGCGTCACGG GACTCGGCGA TTTGGGCGTC CAAGGCATGG GCACGGCGGT GGGCAAGTCG ACGTTATTCA CCGCGCTCGG CGGCCTCGAC CCGGCGGACG TGTTGCCAAT TTGTATCGAT GTGGGTACGG ACAACCAGGC GTTGCTCGAG GATAAGTTTT ACATCGGTCT GAGGCAAAAG CGAACCGGCG GCGAGGAATA CGACGACTTG CTCGACGAAG TCGTCTACGG GTTGAAGCGG CGTTTCGGCC CGCGCGTGTT GCTATGCTTC GAAGAATTTT CCAACAAGAA CGCAAAGCGA TTGCTTGATC GATACAACAG CAATTCTGTC GCGTATTGCG ATGATTTGCA GGGCATCGCT GCGACGACGT TGGCGGCGAT TATCTCGGCG CTGCCGCAAA TGGGCGGATC ATTGAGAGAG CAGCGGTTTC TTTTCGCCGG CGCCGGGGAG ACCGGTGCAC ACACCGCAGA TTTATTGGCG ACGTATATTT CGCAACAACA CGGCATATCG TTGCCCGAGG CGCGAGAGAA CATTTATTTC ATAGATAGAA AAGGTTTGGT GACGCGCGAT CGGGCGCAAC GCGAGGACGA TTTGGAAATT CACAAACTAC CCTACGCGCA CGACATGGAA GGTGCGGCGT CCGTGCGCGA ATCCGTCGAG CTCATCAAAC CGACGGCGTT GATTGGCGTC CGTCGACACC GATTTTCCTT CTTCGAAGGC TCAGTGCTGA AGGATGAAAA GCTCTTCACC GAGGACGTGT TGCGCGCGAT GGCGAAACAT AGCGAGAAAC CGCTCATCAT GGCGCTTTCG CGACCGAGCG CGCTGCGCGA GTGCACGGCC AAGGAAGCGT ACGAGGCGAC GAACGGGAAG TGCATCTTCG TCGGCGGGTG CAAGTCGACG CCGTTCGAGT ACGCGGGTAG AGAAATCGCG CCGTCCGAAT GCAGCACCGA ATACGTCTTT CCGGGATTGG GACTCGGCTT GACCATCGCC GAAGGCACGC GCGTGCGAGA CTCTTTACTC ATGGAAGCCG CCGAAGTCGT GGCCAACAGC GCCACTCCGG GCGATATCGC TCGCGGCGCC GTGTTCCCGC GAAAGCGCCA CATTCCCGAC GTCTCCGCGC GCGTCGCCGC GCGCGTCGCC GGTAAGGCGT TCGCGAGCGG TTTGTCCGCG CTCCCGGGCA AGCCCATGGA CTGGCTTCGC TTGGCGAAAT CGTGGATGTT CGACCCGACG TATCGCCCCT ACACGCCTTG A
|
Protein sequence | MVPWTRQVVS GVELLRSGKY NKGMSFTRDE RDRLNLRGLL PPAVFDQSVQ VERVIERLRR VTNDVEKHAW LASLYERNER LFYRVVKDHL EELLPILSAP TVWQVCAEFG LMYRRPRGLY ISIKDRGSVY RLLKNWPVRD VKAIVLTDGQ RVTGLGDLGV QGMGTAVGKS TLFTALGGLD PADVLPICID VGTDNQALLE DKFYIGLRQK RTGGEEYDDL LDEVVYGLKR RFGPRVLLCF EEFSNKNAKR LLDRYNSNSV AYCDDLQGIA ATTLAAIISA LPQMGGSLRE QRFLFAGAGE TGAHTADLLA TYISQQHGIS LPEARENIYF IDRKGLVTRD RAQREDDLEI HKLPYAHDME GAASVRESVE LIKPTALIGV LLKDEKLFTE DVLRAMAKHS EKPLIMALSR PSALRECTAK EAYEATNGKC IFVGGCKSTP FEYAGREIAP SECSTEYVFP GLGLGLTIAE GTRVRDSLLM EAAEVVANSA TPGDIARGAV FPRKRHIPDV SARVAARVAG KAFASGLSAL PGKPMDWLRL AKSWMFDPTY RPYTP
|
| |