Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119472 |
Symbol | Ccd-Arp |
ID | 5000320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 404514 |
End bp | 407258 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | |
GC content | 47% |
IMG OID | 640415741 |
Product | Carotenoid dioxygenase plastidic fusion protein, putative |
Protein accession | XP_001416379 |
Protein GI | 145343543 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0452333 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTCAC TATCGGCACA CAACGCTCCT GTGCGCTACT ATAAAACATA CCGCCGCAAG AGCTGTCTTC TTAGTTTGCA GACAAGGCGT GTGTCCTCTA TACTTCGAGT ACAAACGGAA ACAGATTTGT CTACGAGAAC GATCCAAGAA ATAGATTCAA TTCAGCCTTC GAGTGATGAG GATACAGGCT ATATTTCTGC CGCCGCTTCT ACGTCGTATC AACTTCTAGA TCCAGTGGAA TACGGCGAAC ATGGTGCCGC TTTTCTTTCT GGAAACTATT ATCCGGTAGG ACAGGAAGTC ACGGCCTGTT GCGATTATGA TGCGCAAGCT GGTGATGATT CTTCGACGAT GAAAATTAAC GGTAAAATTC CAAGCGATTT TCCACTCGGT CAGTACGCGT ACGTAGGCCC CAATCCAAAG TTTGCTGTTG ATCACTACAA GCGTTGGGGT GCAGGTCCAG GGCAAGTAGA TTTTGGTTTA GGATCAGGCT GGCATCATTG GTTCGAGGGT GATGGTATGA TCTACGCTGT AGACTTTTGC ACTGGCAACA GAGTTAAATT TAGAAATCGT TTTATTCGGA CGTCTAGCTG GAAGCGTGAG ATTGAAAGCG GAAAACGCAT ATTTAGACCA CTCATGAATG CTGACGGTGG AACATTTCTG GCAAATGCAC TTTCCAACCT CATCACGGGT GGAATGTTTA TGAAAGACAG TGCAAATACG GCGCTTCTCC ATTTTGCTGG TAGGACGTTC GCGCTACAGG ATACCTGTCC GCCTTGGGAA CTTGATCCAG ATACCCTCGA AACAATTTCA GCATGTACGT TTGATGGCAA ACTACCTCCT TACGTTCCTT TCACTGCGCA TCCAAAAGTG TTACCGTCAA CCGGTGAACT TATTTTCTTT GGTTTCAATC CAGTCTCACC GCCACATTGC TCTGTGGGTA CGTTGAAGCC AGATGGTACA ATCTCGAGCA TCAACTCGCT TTGGTCTATC CCGTTTGTTG GTAGCATTTT CATGCACGAT TTCTGTGTAA CTGAGCATTT TACGGTTCTT TTTGAAGGCA GCATGGATAT CAAGCCTGTT CGACAGTTGC TAGGGAAACA CCCTCTTCAG TATAACGAAG ACAAGCGGGC GCGTTTTGGA GTGTTGAATA GAGATGCACA GTCAACAAGC GTGACTTGGT TTAATTGTAG CTCGTCACAA ATGGTATACC ACTTCATCAA TTCGTGGGAG GAAAAAAATG AAAGGAATGA GCAGTTGATA GTCATTACTG GAATTCGTGA AGACGGCTTT TTCCAAGAGG CGATGCGCGC TAACGGTTCA AGTGAGTGGA TCAAAGAGGC CGTGCGAGTT GGCAAGATTC CAAAAGTTCA TGAATGGAGA ATAAATCTTG TCACTGGAAA GGTCACAGAA AGATATCTAT TTAGTATTCC TTTAGAAACC CCACGCATCA ACGACGCATT TGTTGGAAGG CGAAACAGGT ACGCCTACGC CGGGCGAGTC CTTTTGTCAG AGTTGGAATC GACCACTCAG CTCAAGTTTG ACGCAGTCGT GAAAATTGAT ATGCAAGAGC AGAAGACGGT CACTTATGAG CATGGTCCAA ACCGTTACGG TATGGAAGCC CAGTTTGTTG GAAGACCAGG AGGTGTAGAT GAAGATGATG GCTGGTTAGT TATGTACGTT CATGATGAGA GCTATACAGC AACAGACGTT AATGGGCGCA CCGAGTGCGT CATAATTGAT GCACAAGATG TAGAGGCTGG TCCCGTAGCA ACAATTCTTC TGCCCGAAAG AGTGCCATAT GGGGCTCATT GCATGTGGCG GGCGGATCTA AAATACCCAG GTGCAACATT TGAAGACACA GCAGTAGCAT TTGAAGCACT CAGTGCGAAT TTTCAAGTTA ATGAGCCAAA GATGTTTGCC TTCGAACAGT CGCAGCGAGG AGAGCTTCTC GAGGCTGTTT GGAAAGGGCT CTCTCGAGTT GCTTTGGGTC TGTTTGTACA CGGTTGGTCC CCACGCATCG CGCGTGAAAA CGTTACAGAG TACGCTTTCG TGAGAGGCGC TGGACTACGG TTTTTGGAGG TCAGCAAGCT TGGTAGCCAC CGTCTGAGAG AGGCAGTCAA AGAACGTGCC GACGCTACGC CAGCTCCACC AACGCTGACA CTTTATGACG TCGAGGATGA CGACTCCTGT CGGCTTGTAC GAGAGGTTTT GAGTATTTGT GACGTGAGTT ACTTGTGCAA ACCGTGTCCT ACCGCGTCGT GTTCGAATAG CTCCGAGTTA GCGATTCTAC AGGGTGTTGA ACTTGGGTCT GAAGTAGTCC CTTTCCTACG AGATGATCGA AACGACGACA TCGCTGTGAA AGGTGCTGAC GGCATCATTC AGTATCTCTA TCAGGAGTAT CTGGATGGCG AAGAACCAGC TTCTCTTGTC TCATTGTTTC AGAGGTTCGC TCAAGCGTCA AAGATCAATG ATGCTTCGCA TCGGCGTACT TCACGTGCAG GTGAGCAGCC TTTGATTTTT TGGGGTTACG AAGCTTCTCC ATTTTGCGCT CTAGTGCGAA AAGCTCTGAA CGAGCGGGGA ATTTCTTATG TTTTTCGTCC ATGTGCACGC GGTAGCCCAA GAAGGAGTTT GCTTCTCAAT CGCACCGGTA TATTTCAAGT ACCTTATCTT GAAGATCCGA ACACCGGGAT CTCGTTATTC GAGAGTGTCG ATATCATCAG CTATCTGAAG AAGACCTACG ACTAG
|
Protein sequence | MGSLSAHNAP VRYYKTYRRK SCLLSLQTRR VSSILRVQTE TDLSTRTIQE IDSIQPSSDE DTGYISAAAS TSYQLLDPVE YGEHGAAFLS GNYYPVGQEV TACCDYDAQA GDDSSTMKIN GKIPSDFPLG QYAYVGPNPK FAVDHYKRWG AGPGQVDFGL GSGWHHWFEG DGMIYAVDFC TGNRVKFRNR FIRTSSWKRE IESGKRIFRP LMNADGGTFL ANALSNLITG GMFMKDSANT ALLHFAGRTF ALQDTCPPWE LDPDTLETIS ACTFDGKLPP YVPFTAHPKV LPSTGELIFF GFNPVSPPHC SVGTLKPDGT ISSINSLWSI PFVGSIFMHD FCVTEHFTVL FEGSMDIKPV RQLLGKHPLQ YNEDKRARFG VLNRDAQSTS VTWFNCSSSQ MVYHFINSWE EKNERNEQLI VITGIREDGF FQEAMRANGS SEWIKEAVRV GKIPKVHEWR INLVTGKVTE RYLFSIPLET PRINDAFVGR RNRYAYAGRV LLSELESTTQ LKFDAVVKID MQEQKTVTYE HGPNRYGMEA QFVGRPGGVD EDDGWLVMYV HDESYTATDV NGRTECVIID AQDVEAGPVA TILLPERVPY GAHCMWRADL KYPGATFEDT AVAFEALSAN FQVNEPKMFA FEQSQRGELL EAVWKGLSRV ALGLFVHGWS PRIARENVTE YAFVRGAGLR FLEVSKLGSH RLREAVKERA DATPAPPTLT LYDVEDDDSC RLVREVLSIC DVSYLCKPCP TASCSNSSEL AILQGVELGS EVVPFLRDDR NDDIAVKGAD GIIQYLYQEY LDGEEPASLV SLFQRFAQAS KINDASHRRT SRAGEQPLIF WGYEASPFCA LVRKALNERG ISYVFRPCAR GSPRRSLLLN RTGIFQVPYL EDPNTGISLF ESVDIISYLK KTYD
|
| |