Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_49109 |
Symbol | |
ID | 5001075 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 366527 |
End bp | 369817 |
Gene Length | 3291 bp |
Protein Length | 994 aa |
Translation table | |
GC content | 53% |
IMG OID | 640416496 |
Product | predicted protein |
Protein accession | XP_001416923 |
Protein GI | 145344821 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.808055 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.27646 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAAG CGGTGTTCGC GCGATCGAAG GCGAGCGCGA GCGTGGCGCC GCCGAAACCG ACGCCGAATC GAGAGATGAG GGATGACTTT TTAAACGCCT CGAGCGCGGC GTACTTGGAG GCGATGGAGG ACGAGTACAG GAAGGATCCG AAGAGCGTAC CGGAATCGTG GGCGTCGTTG TTGAGACAGA TGGGTACGTA GCGATCGTCG ATGGATGCGC GAGACGGCGA GGATGAGGTT TTGAAGGCGA ACTGGAACGG CGCGCGATTC AATTCCCTCA ATTCGGTTGG CGGACCAATC CAAATTCGGA GCGAACGAAG ACTGACGAGA ATTCCGTGCA TTTTTTCGCG TAAACAGATG CGGGCGTGAG TGGAGCTGAA ATATCTGAAA TGTACACCGC GATGTCGACG GGTACGGCGC CTATGGCTGT GGGGCGACCG TTGGACGCGC AAACGATTCA GGAATCCATG CGGTTGATGA TGTTGATCCG CTCGTATCAA ATTAGCGGAC ATAGCATCGC CAATCTCGAT CCATTGGCGC TTGATGAGCG TGAGATGCCA ATCTCTCTCG ATCCAAGCCT GTACGGGTTC ACGGAAGACG ATATGGATCG AGATTTCTTT ATTGGAACTT GGAAAATGAA GGGCTTCTTG AGCGAAGACC GTCCGGTGCA GACGTTGCGT CAAATCTTGA CCAGGCTCAA AGAGACGTAC TGCGGCACGG TGGGCTACGA GTACATGCAT ATTCAAGATC GTGAGCAATG TAACTGGCTT CGAGCGAAAA TCGAGACCGA GCGCAAGAAG CAGTATTCGC CGGAACGGAA ACAAATTATT TTGGACCGCT TAGCTTGGGG CGAACTTTTC GAAGGTTTCT TGTCCAACAA GTACAGCGCA GCGAAGCGAT TCGGTTTGGA AGGTTGTGAA AGCCTCGTGC CTGGATTTAA GGAAGCGATC GATAAGGCGG CCGAGATGGG GGTTGAAGCT ATCACGATCG GTATGCCGCA CCGCGGTCGT TTAAATGTGC TCGCGAACGT CGTGCGCAAA CCGCTTCAAA CGATTTTCAA CGAGTTTAAG GGTGGTCCAA AACTCGTGGA CGAGTTACCG AACACGGAAT CGCAGTATAC TGGATCAGGC GATGTGAAAT ATCATCTCGG GACTTCGTTC GATCGTCCGA CGTTACGCGG CGGTCAAATC CATCTTTCCT TGGTTGCGAA TCCATCGCAT CTTGAGGCGG TAAACACGGT GGTTACTGGA AAGACTCGTG CGAAGCAGTT CTACACGAAA GATCCGAACG GCGATCGTTC TATGCCTATC TTGCTGCACG GTGACGGTGC TTTCAGTGGT CAAGGGATTG TGTACGAGAC GCTTGACATG AGCAAATTGC CCGAGTACAG CGTCGGTGGG ACTTTGCACA TCGTCGTGAA CAACCAAGTC GCCTTTACGA CTGATCCAAA GTACTCACGC TCGAGCGCTT ACTGCACCGA TGTAGCTAAG GGCATGGAAG TTCCAGTGTT TCACGTTAAC GGTGATGACG TGGAAGCGGT GGCGTGGGTC ATGGAACTCG CCACCGAATG GAGAATGAAA TGGAAGACGG ACGCCGTAGT CGACATCGTG TGCTATCGCA AATACGGTCA CAACGAGATT GATGAACCCA TGTTCACGCA ACCTCTTATG TACAAGGTGA TTCAGCAACA TCCCAGTGTT TTGACGAAAT ATAGCGCCAA ACTCGTCAAC GAAGGCATCA TCACCCCGGA AGACTTCGTC AGCATGAAGG AAAAGATCAA TAACATTATG GAAGAGGAGT TCACGGCGTC AAAGGACTAT GTTCCGAAGC AACGTGACTG GCTCGCATCG CATTGGCAAG GTTTCAAGAG CCCGGATCAG CTTTCGCGCA TCGCAGACAC TGGCTTACCG ATGGATCACA TAAAGAATCT CGGACAACTT ATCACCGCGA TTCCTGCGGG ATTTACACCA CATCGAGTCG TGAAGCGCGT TTATGAAAAT CGTCGCGCCA TGATTGAAAA CGGCAACGGT ATCGATTGGG CCATGGGCGA AGCTTTGGCG TTCGCTTCTT TACTCGATGA AGGCAACCAT GTGCGTTTGT CTGGACAAGA TGTCGAGCGT GGCACTTTTT CTCATCGTCA CGCGCTGATT CATGATCAAA TCACGGGCGA GCGTTTCATT CCTTTGCGAA ACGTCTACAG CGGCAACCCG GGTCGAGGTC AGAACTTCTT CACGGTTTGC AACTCTTCTT TGTCCGAGTA CGGTGTGCTC GGTTTCGAAC TTGGCTACTC TCTTGAGCAT CCCAACTGTT TGATCTTATG GGAAGCGCAG TTTGGTGATT TCTCGAATAC GGCTCAAGTC ATCATCGATC AATTTATCAG CAGCGGTGAG GCAAAGTGGT TACGTCAAAG CGGTCTCACG CTACTTCTTC CGCATGGTTA CGACGGACAA GGTCCCGAGC ACAGTAGCGC TCGACTTGAA CGTTTCTTGC AAATGGCTGA CGAAGATCCG ACGCAGATCC CTGAGATGGA AATGGAACGC CGCACGCAGT TGCAAGAATG CAACTGGCAA ATTTGCAACG TCACCACGCC CGCAAACTAT TTCCACATGT TGCGCCGACA GGTTCATCGT GAATTCCGCA AGCCGCTCGT CGTGATGAGC CCGAAGAATC TATTGCGTCA TCCGAAAGCG GTCTCCGATA TCAGCGAGTT CGATAACAGC GACGACAACG ATTCACTTCA AGGCGTTCGC TTCAAGCGAC TTATCATGGA CAAGACTTCG AAGTCGCGCA GCATGGACTC TCCTGCGGAG AATGAGGTGG AAAGAGTCAT CTTCTGCTCC GGAAAGGTTT ACTACGATCT CGACGACGAA CGCGATGCGG CGAAGAACAT CGACAGGGTG AAAATCTGCC GCATCGAACA ACTCGCGCCG TTCCCGTGGG ATCTCGTCAA GCGCGAATTG AAGCGTTACC CGAATGCCGA AGTTGTTTGG TGCCAAGAGG AACCGATGAA CATGGGCGCG TGGTGGCACG TCCAACCGAG AATGAGTACG TTGTTCAAAG ACCTCGGTCG ATCGGGCGAA ACGCGCTACG CTGGTCGCAA ACCCGCGTCT TCGCCCGCAA CCGGGTACGC CGCCGTGCAC GCGCAAGAGC AAGCCCAACT CGTCGCCGAC GCCATTCGGT AAACTTCCAC CCGCGCCGTG TACAACGGCA AATTGACAAT CGAATCGAGC GCATCGCTCG CGCGCGTCTT AGTCTCAACC GCACGCGTTC GCGATCATTT ACCAACTCAA T
|
Protein sequence | MAEAVFARSK ASASVAPPKP TPNREMRDDF LNASSAAYLE AMEDEYRKDP KSVPESWAGA EISEMYTAMS TGTAPMAVGR PLDAQTIQES MRLMMLIRSY QISGHSIANL DPLALDEREM PISLDPSLYG FTEDDMDRDF FIGTWKMKGF LSEDRPVQTL RQILTRLKET YCGTVGYEYM HIQDREQCNW LRAKIETERK KQYSPERKQI ILDRLAWGEL FEGFLSNKYS AAKRFGLEGC ESLVPGFKEA IDKAAEMGVE AITIGMPHRG RLNVLANVVR KPLQTIFNEF KGGPKLVDEL PNTESQYTGS GDVKYHLGTS FDRPTLRGGQ IHLSLVANPS HLEAVNTVVT GKTRAKQFYT KDPNGDRSMP ILLHGDGAFS GQGIVYETLD MSKLPEYSVG GTLHIVVNNQ VAFTTDPKYS RSSAYCTDVA KGMEVPVFHV NGDDVEAVAW VMELATEWRM KWKTDAVVDI VCYRKYGHNE IDEPMFTQPL MYKVIQQHPS VLTKYSAKLV NEGIITPEDF VSMKEKINNI MEEEFTASKD YVPKQRDWLA SHWQGFKSPD QLSRIADTGL PMDHIKNLGQ LITAIPAGFT PHRVVKRVYE NRRAMIENGN GIDWAMGEAL AFASLLDEGN HVRLSGQDVE RGTFSHRHAL IHDQITGERF IPLRNVYSGN PGRGQNFFTV CNSSLSEYGV LGFELGYSLE HPNCLILWEA QFGDFSNTAQ VIIDQFISSG EAKWLRQSGL TLLLPHGYDG QGPEHSSARL ERFLQMADED PTQIPEMEME RRTQLQECNW QICNVTTPAN YFHMLRRQVH REFRKPLVVM SPKNLLRHPK AVSDISEFDN SDDNDSLQGV RFKRLIMDKT SKSRSMDSPA ENEVERVIFC SGKVYYDLDD ERDAAKNIDR VKICRIEQLA PFPWDLVKRE LKRYPNAEVV WCQEEPMNMG AWWHVQPRMS TLFKDLGRSG ETRYAGRKPA SSPATGYAAV HAQEQAQLVA DAIR
|
| |