Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31760 |
Symbol | |
ID | 5002103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 488624 |
End bp | 489889 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | |
GC content | 63% |
IMG OID | 640417524 |
Product | predicted protein |
Protein accession | XP_001418025 |
Protein GI | 145347119 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.264983 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0187143 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGCGC TGTCGCCGAC GATGGAGCGC GGTGGGATCG CGCGGTGGCA TCGCGCGATC GGCGATGAAA TCAAAGCCGG CGACGCGATC GCGGACGTGG AGACGGATAA GGCGACGATG GCGATGGAAG CGACGGATGA TGGGTACCTG GCGGCGATCC TGGTGCCCGA GGGAGCGACG GACGTGGAGG TGGGGACGCC GGTGTGCGTG ATGTGCGAGG AGGCGAGCGC GGTGGCGGCG TTTAAGGATT ATAAAGCGAC GGAGACGGTG ACGACGGAAC CGGCGAAGAG CGCGGTGGAG ACGGCGGTGA CGATGCCGGT GGTGAGGGCG TCGACGCGCG CGACGGCGCG GATGAGCGCG CGCGCGAGCG GGGAACGGGT GTTCGCGTCG CCGCTGGCGA GACGGTTGGC CGAGGAACGA GGCGTGCGAT TGGAGACCGT GAGCGGGAGT GGGCCGAACG GGCGCGTGAT CGCCGAGGAC GTGCTCACGG CGCGCGCGTC GAGCGCGAGT GAGGCGGTGA CGCACACGGT TGTGGCGGAA CATCCGTTGT CCAAGTTTTT CCCGGATTTC GAAGACGTGA GCGTGAGCGC CATCAAGCGC GTCACCGCGG AGCGCTTGAC GGAAAGCAAG CAGCAACTGC CGCATTTCTA TCTCACCGTA GACGTGCGCT TGGATAACAT GATGGGGATT CGTGAAACGT TGAACAAGCA GCTCGCTGAT GATAAGGCTG CGGAAGGGGC GAAGATTAGC GTGAACGATT TCATCGTCAA GGCGAGCGCC AAGGCGTTGC TCGCGGTGCC GGACGTGAAC GCCAGCTGGT TGGGCGATAA GATTCGCAAG TACAAAAAAG CTGATATTTC TGTAGCGGTG CAGACCGAGC GCGGGCTGAT GGTGCCCATC GTGCGATCTG CGTGCTGCCT GGGGTTGAAG TCTATCAGCG CCGAAGTCAA GTCGCTCGCC GGCCGCGCGC GCAGCGGCTC TCTCACGCCC CAGGACATGA CTGGAGGCAC GTTTACCATT AGCAACCTCG GCATGTTCGG CGTGAAGAAC TTCGCTGCGA TCGTAAACCC GCCGCAAGCG GCGATCCTCG CCGTCGGCGG AGCTAGAAAG GAAGTCGTCA AGAACGCAGA GGGCGGCTAC GAGGAAGTCT TGGTAATGAG CGCGACTTTG AGCTGCGATC ATCGCGTCGT GGATGGAGCC GTGGGGGCGC AGTGGCTTCA ATCCTTCAAG TGTTATTTAG AAGATCCGAT GACGATGTTA CTCTAG
|
Protein sequence | MPALSPTMER GGIARWHRAI GDEIKAGDAI ADVETDKATM AMEATDDGYL AAILVPEGAT DVEVGTPVCV MCEEASAVAA FKDYKATETV TTEPAKSAVE TAVTMPVVRA STRATARMSA RASGERVFAS PLARRLAEER GVRLETVSGS GPNGRVIAED VLTARASSAS EAVTHTVVAE HPLSKFFPDF EDVSVSAIKR VTAERLTESK QQLPHFYLTV DVRLDNMMGI RETLNKQLAD DKAAEGAKIS VNDFIVKASA KALLAVPDVN ASWLGDKIRK YKKADISVAV QTERGLMVPI VRSACCLGLK SISAEVKSLA GRARSGSLTP QDMTGGTFTI SNLGMFGVKN FAAIVNPPQA AILAVGGARK EVVKNAEGGY EEVLVMSATL SCDHRVVDGA VGAQWLQSFK CYLEDPMTML L
|
| |