Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37802 |
Symbol | |
ID | 5006004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | - |
Start bp | 79451 |
End bp | 80590 |
Gene Length | 1140 bp |
Protein Length | 380 aa |
Translation table | |
GC content | 61% |
IMG OID | 640421425 |
Product | predicted protein |
Protein accession | XP_001421964 |
Protein GI | 145355429 |
COG category | [C] Energy production and conversion |
COG ID | [COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0314386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCGA CCGCGACGAG CGCGCACGTC GCGCGAGGCG CGAAGACGGC GAGACGAGGG ACGGCGGCGC GCGCGGGAGC GACGAAGACG AAGGGACCGC CGGCGAATTT GGAGCGGTTT CGCATCGAGA CGCCGTGGGA TAAGGCGAAG CAAGTGTTGA AGGAAGAGTT TGGGGCGACG GACGAGGAGT TGAAGCGATG CGAGGACTTG AGTGATGAGG ATTTGAATAA GGCGTATTAC ACGATGCAGT TGTGCCGGGA CTTTGAAAAC GAGTGCAACC AAGCGTACAT GGCGGGTAAG ATTCGCGGTT TCATGCACCT TGACAACGGT CAAGAGTCGA TTCCGGCGCT CCTGGCAGAC GCGATCCGTA AGGATGATTT GAAGCACTCC TACTACCGCG ACCATTGCCA CGCCTTGGCG TGCGGCGTGG ATAGCGGCGC CGTCATGGCG GAGTTGTTCG GTAAGGACGG TGGCACGTGC CGAGGTACCG GGGGGTCGAT GCACGTGTAC GACATGGATA CGAACTTTCA AGGTGGATGG GCGCTCGTCG CGGAGCAATT GCCGTACGCG GTCGGTGCGG CGCGATCCAT CGTTCTCGAC AAGATGCTCG GGCGCGATGA CGCGCACGAC CGCGTCACCG TCGTCTTCGT CGGCGAAGGT GGCGCCCAAA ACGGGCGCAT GGCGGAGTGC TTGAACGCGG CGGCGAAGGA GAACTTGCCG ATTCTTTTCT TGGTGATCGA TAACGGTCGC GCGATCAACA CCTTCACCAA GGATGTCGCG ACGAACCAAG AGGTTTTCAA CCAAGGCAAG CACTACGGCG TTCCGGGCGT GCTCGTCGAT GGTCAAAACG TGCAAGATGT CTTGCGCGTC GGTCGCGCAG CGATCAAGCA CGTGCGCACG AAAGGTCCGG CGATCTTGCA AGTGCACACT TTCCGATTCA ACGGGCACTC TCCGGCTGAT CCGGAGCACG AGCGCAACCG CAAGGATGAA AAGCGTTGGG CGCGCGCCGA GTGCGACCCG ATTAAGATTT TCGAAGAGTC TGCCGACGCC AAGCGCATCG ACCTCGGGGC ACAGACTGCC AAGGCTAAGG AAGAAGTTCA GCGCGCTTTG GCGTTCGCCG ATGCTTCTCC GCCGCCGCCC
|
Protein sequence | MRATATSAHV ARGAKTARRG TAARAGATKT KGPPANLERF RIETPWDKAK QVLKEEFGAT DEELKRCEDL SDEDLNKAYY TMQLCRDFEN ECNQAYMAGK IRGFMHLDNG QESIPALLAD AIRKDDLKHS YYRDHCHALA CGVDSGAVMA ELFGKDGGTC RGTGGSMHVY DMDTNFQGGW ALVAEQLPYA VGAARSIVLD KMLGRDDAHD RVTVVFVGEG GAQNGRMAEC LNAAAKENLP ILFLVIDNGR AINTFTKDVA TNQEVFNQGK HYGVPGVLVD GQNVQDVLRV GRAAIKHVRT KGPAILQVHT FRFNGHSPAD PEHERNRKDE KRWARAECDP IKIFEESADA KRIDLGAQTA KAKEEVQRAL AFADASPPPP
|
| |