Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42872 |
Symbol | |
ID | 5003438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 62629 |
End bp | 64140 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | |
GC content | 62% |
IMG OID | 640418859 |
Product | predicted protein |
Protein accession | XP_001419272 |
Protein GI | 145349712 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01780] succinate-semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.701812 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACCG CCGGTGAAGA CGACGACGCG CGTCGAAGCG CGCTCGCCGC GGCGCTGAAA CTGCGAAACG CAAATTTGCT TCGTTTCGCG GGCGCGAGCG CGACGGACGC GTCGCGAGGC GCGTCGCGCG CGAGCGAAAC GTTCGAGGTG AAGAACCCGG CTACAAACGC GACGCTCGCG ACGGTGCTCT CCACGCCGCG CGCCGCGATT TCCGACGTCT TGAGGCGAAG CGAAGACGCA CAGAAAGTGT GGGCGACGGA ATTCAGCGCG CACGCGCGAG GAAAGGTGAT CAGACGATGG TTTGAGCTCG TCGAGGCGAA CGCCGAGGAC TTGGCGAGAA TCATGACGGC GGAACAAGGC AAACCTTTGA TAGAGTCCCG GGGAGAGGTG GCGTACGCGG CATCGTTTTT AGAGTGGTTC GCGGAAGAGG GGAAGCGCGT GTACGGTGAT GTCGTGCCTT CGTCGTCGAC GGGGACGAGA ATCATGGTTG TGAAGCAACC AGTCGGCGTG ACGGCGGCGA TTACACCGTG GAATTTTCCT TTGGCGATGA TCACGAGGAA AGCTGGTGCG GCGCTCGCGG CGGGGTGTTC GATGGTCGTC AAGCCGAGTG AAGAAACGCC GCTGAGCGCG TTTGCGCTCG GGGTGCTCGC GGAGCAAGCT GGGTGCCCGG ATGGTGTTTT ACAATTCATC GTGGGCGATC CGAGCGCGAT AGGCGCGGCG CTGTGCGAGT CTCCCGTCGT TCGAAAAATT ACCTTCACCG GAAGCACGCG CGTGGGAAAG TTATTGATGA AGCAGAGTGC GGACACCGTG AAGCGCGTAA GCATGGAGTT GGGTGGTAAC GCGCCGTTTG TGGTGTGCGC CGACGCCGAC GTGGACGCCG CCGTTCAGGG CGCGATGGCG AGCAAGTTTC GTAACGCGGG TCAAACGTGC GTATGCGCGC AACGTTTCAT CGTGCACGCG TCCGTGGAGG CTGAATTCGT GCAAAAGCTC GCGGACGCTG CGAGCGCGCT CGTTATGGGC GATGGCTTGG AAAATGAAGA CGCCACGCAA GGCCCGTTGA TCAACGCCGC GCACGCCGAA AAAGTCGATT CGCACGTTCG CGACGCGATG AGCAAAGGCG CGGTGTGTCA CACCGGTGGC AAGCGCGCGC ACGGTAGCTT TTATGAACCG ACCGTGTTGT CCAAGTGCAC GGAAGACATG CTGGTGATGC GCGAAGAAAC GTTCGGACCT GTCGCCGCGG TGACGACGTT CGTCGACGAC GCCGAAGCCA TTCGCATCGC CAACGCCACC ACCGCCGGTT TAGCGTCGTA CGTGTACACC TCTGACGTCA AGCGTACGTT TTACTTTAGC GAAAAGCTTG ACTTTGGTAT CGTGGGCGTG AACACGGGCG CCATATCCAC CGCCCAAGCT CCGTTCGGCG GGACGAAGGA GAGCGGGATC GGTCGCGAGG GCGGCAAGGA CGGCGTTCAC GAGTACGTCG AGCAGAAATA CGTCTGCGTC GGCGGCCTTT AG
|
Protein sequence | MSTAGEDDDA RRSALAAALK LRNANLLRFA GASATDASRG ASRASETFEV KNPATNATLA TVLSTPRAAI SDVLRRSEDA QKVWATEFSA HARGKVIRRW FELVEANAED LARIMTAEQG KPLIESRGEV AYAASFLEWF AEEGKRVYGD VVPSSSTGTR IMVVKQPVGV TAAITPWNFP LAMITRKAGA ALAAGCSMVV KPSEETPLSA FALGVLAEQA GCPDGVLQFI VGDPSAIGAA LCESPVVRKI TFTGSTRVGK LLMKQSADTV KRVSMELGGN APFVVCADAD VDAAVQGAMA SKFRNAGQTC VCAQRFIVHA SVEAEFVQKL ADAASALVMG DGLENEDATQ GPLINAAHAE KVDSHVRDAM SKGAVCHTGG KRAHGSFYEP TVLSKCTEDM LVMREETFGP VAAVTTFVDD AEAIRIANAT TAGLASYVYT SDVKRTFYFS EKLDFGIVGV NTGAISTAQA PFGGTKESGI GREGGKDGVH EYVEQKYVCV GGL
|
| |