Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29222 |
Symbol | |
ID | 4999995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 1023582 |
End bp | 1025147 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | |
GC content | 59% |
IMG OID | 640415416 |
Product | predicted protein |
Protein accession | XP_001416007 |
Protein GI | 145341832 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | [TIGR02734] phytoene desaturase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.821914 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTCGC GGCTGGCGAA CGCCGGGTTT GAGGTGACGC TTTTGGAGAA GAACGGCGAC GTGGGGGGAC GGTGTCGAAG CGAGACGTTC GCGGGGGGGG GGGAAGGGTA CAGGTTTGAC ACCGGACCGT CGTTGATGCT GCTTCCGGAG CGGTACATGG AGCAGTTCAC GTCGGTGGGG GAGAAGATGG AGGATTACAT GGACGTCGAA CGCGTGGACC CGGCGTATCG CGCGCACTTT GGCGATCACA CGACGCTCGA TTTGTTGTAC GACATCGAGG CGATGCGTAA GCAGTTAGAT GAAGTCGAGT TCGGGGCGGG AGGGAGATAC ATCGATTGGC TCGGACGCGC GCGGGCGAGT TTGGATTACG GCGTCGCGGC GTTCATCGAG CGAGACGCGA ACTCTATTCT GGATTTTGTG GATCTCTCTC GCGTCGGTCC GCTCGCGCTC GCGGTGAACC CGATCGATTT GTTGCTTCCG CAGTTTAACC AAATGGCCAA GTATTTCAAG GATGAGCGCT TGCGCGCGCT CTTCTCGTAC CAAGAGTTGT ACGTCGGCTT GAGCCCGTAC AACGCGCCGG GCGTCTTCTC TTTGCTCGCC GCCACCGAGC TCACGGACGG GGTGTGGTAC CCCAAGGGCG GGTTCACCAA AGTGCGCGAA AGCTTCAACG CGCTAGCGGA GAAGAAGGGC GCCAAGGTTC GCCTCAACAG CGAAGTCGCC GAGATTCTCA CCGAGGAAGT GAGCGGTTTG GCGGACGCCA AGGGCGGCTC GCACACGGGA CGAAAAGTCA CCGGCGTGCG CTTGGCTTCG GGCGAGATAA TGAACGCAGA CGTCGTCGTC GCTAATCCGG ATTTACCGTG CGTGTGGGAT CAAATGCTCG ACTCGGTGGG CGAGCCGGCG AAGAAGGAGA GCGAAAAGTG GGAAAACGCC GATTATTCGT GCTCCGTGCT CGAGTTCAAC TGGTGCTTGA AGAAGGAAAT CCCTGGATTG TTGCACCACA ACGTTTTTCT ATCTGGTGAC TACAAGGGAA GCTGGGAGCG TCCGGCGACG GTGGATGACT TTGCAGCGCC GCGTCAACAC AACTTTTACT GCCACAACCC GGTGTACACG GATAAGACGT GCGCGCCGGA GGGCGGTGCC TCGCTGATGA TTTTGCTCCC CATTGCGAAC ATTAAAGAGC AAGAAAATAT TTGCAAGAAA CGGGGCGTTC CGGTGCCGAG TGAGACCGAA CTCGTCGAGG CTGGCCGACA AGCCATCTTG CGACGCTTCA AGGAAGCGGG TCACGGCGAC CTCACAGAAA TCATCGCGCA CGAATCCGTC ACCGTGCCCT CGGAATGGCG CGACAAGTTC AACATTAAGA ACGGCGCTGT CTTTGGTTTG TCTCACGGGT TGTTGCAGCT CGCCGCGTTC CGACCGCCGG TGCGCACGGG CATCAAGCAG CTTGACTCGC CCGTCGTCGA CGGCTTGCAT TTCGTAGGCG CGTCGACGCG ACCGGGTAAC GGCGTTCCGC TCGTGCTCAT GGGCGTCAAG GTTGTCGCCG AACAAATCAT GAAAGAAGCG GCTTAA
|
Protein sequence | MASRLANAGF EVTLLEKNGD VGGRCRSETF AGGGEGYRFD TGPSLMLLPE RYMEQFTSVG EKMEDYMDVE RVDPAYRAHF GDHTTLDLLY DIEAMRKQLD EVEFGAGGRY IDWLGRARAS LDYGVAAFIE RDANSILDFV DLSRVGPLAL AVNPIDLLLP QFNQMAKYFK DERLRALFSY QELYVGLSPY NAPGVFSLLA ATELTDGVWY PKGGFTKVRE SFNALAEKKG AKVRLNSEVA EILTEEVSGL ADAKGGSHTG RKVTGVRLAS GEIMNADVVV ANPDLPCVWD QMLDSVGEPA KKESEKWENA DYSCSVLEFN WCLKKEIPGL LHHNVFLSGD YKGSWERPAT VDDFAAPRQH NFYCHNPVYT DKTCAPEGGA SLMILLPIAN IKEQENICKK RGVPVPSETE LVEAGRQAIL RRFKEAGHGD LTEIIAHESV TVPSEWRDKF NIKNGAVFGL SHGLLQLAAF RPPVRTGIKQ LDSPVVDGLH FVGASTRPGN GVPLVLMGVK VVAEQIMKEA A
|
| |