Gene OSTLU_119472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119472 
SymbolCcd-Arp 
ID5000320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp404514 
End bp407258 
Gene Length2745 bp 
Protein Length914 aa 
Translation table 
GC content47% 
IMG OID640415741 
ProductCarotenoid dioxygenase plastidic fusion protein, putative 
Protein accessionXP_001416379 
Protein GI145343543 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0452333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTCAC TATCGGCACA CAACGCTCCT GTGCGCTACT ATAAAACATA CCGCCGCAAG 
AGCTGTCTTC TTAGTTTGCA GACAAGGCGT GTGTCCTCTA TACTTCGAGT ACAAACGGAA
ACAGATTTGT CTACGAGAAC GATCCAAGAA ATAGATTCAA TTCAGCCTTC GAGTGATGAG
GATACAGGCT ATATTTCTGC CGCCGCTTCT ACGTCGTATC AACTTCTAGA TCCAGTGGAA
TACGGCGAAC ATGGTGCCGC TTTTCTTTCT GGAAACTATT ATCCGGTAGG ACAGGAAGTC
ACGGCCTGTT GCGATTATGA TGCGCAAGCT GGTGATGATT CTTCGACGAT GAAAATTAAC
GGTAAAATTC CAAGCGATTT TCCACTCGGT CAGTACGCGT ACGTAGGCCC CAATCCAAAG
TTTGCTGTTG ATCACTACAA GCGTTGGGGT GCAGGTCCAG GGCAAGTAGA TTTTGGTTTA
GGATCAGGCT GGCATCATTG GTTCGAGGGT GATGGTATGA TCTACGCTGT AGACTTTTGC
ACTGGCAACA GAGTTAAATT TAGAAATCGT TTTATTCGGA CGTCTAGCTG GAAGCGTGAG
ATTGAAAGCG GAAAACGCAT ATTTAGACCA CTCATGAATG CTGACGGTGG AACATTTCTG
GCAAATGCAC TTTCCAACCT CATCACGGGT GGAATGTTTA TGAAAGACAG TGCAAATACG
GCGCTTCTCC ATTTTGCTGG TAGGACGTTC GCGCTACAGG ATACCTGTCC GCCTTGGGAA
CTTGATCCAG ATACCCTCGA AACAATTTCA GCATGTACGT TTGATGGCAA ACTACCTCCT
TACGTTCCTT TCACTGCGCA TCCAAAAGTG TTACCGTCAA CCGGTGAACT TATTTTCTTT
GGTTTCAATC CAGTCTCACC GCCACATTGC TCTGTGGGTA CGTTGAAGCC AGATGGTACA
ATCTCGAGCA TCAACTCGCT TTGGTCTATC CCGTTTGTTG GTAGCATTTT CATGCACGAT
TTCTGTGTAA CTGAGCATTT TACGGTTCTT TTTGAAGGCA GCATGGATAT CAAGCCTGTT
CGACAGTTGC TAGGGAAACA CCCTCTTCAG TATAACGAAG ACAAGCGGGC GCGTTTTGGA
GTGTTGAATA GAGATGCACA GTCAACAAGC GTGACTTGGT TTAATTGTAG CTCGTCACAA
ATGGTATACC ACTTCATCAA TTCGTGGGAG GAAAAAAATG AAAGGAATGA GCAGTTGATA
GTCATTACTG GAATTCGTGA AGACGGCTTT TTCCAAGAGG CGATGCGCGC TAACGGTTCA
AGTGAGTGGA TCAAAGAGGC CGTGCGAGTT GGCAAGATTC CAAAAGTTCA TGAATGGAGA
ATAAATCTTG TCACTGGAAA GGTCACAGAA AGATATCTAT TTAGTATTCC TTTAGAAACC
CCACGCATCA ACGACGCATT TGTTGGAAGG CGAAACAGGT ACGCCTACGC CGGGCGAGTC
CTTTTGTCAG AGTTGGAATC GACCACTCAG CTCAAGTTTG ACGCAGTCGT GAAAATTGAT
ATGCAAGAGC AGAAGACGGT CACTTATGAG CATGGTCCAA ACCGTTACGG TATGGAAGCC
CAGTTTGTTG GAAGACCAGG AGGTGTAGAT GAAGATGATG GCTGGTTAGT TATGTACGTT
CATGATGAGA GCTATACAGC AACAGACGTT AATGGGCGCA CCGAGTGCGT CATAATTGAT
GCACAAGATG TAGAGGCTGG TCCCGTAGCA ACAATTCTTC TGCCCGAAAG AGTGCCATAT
GGGGCTCATT GCATGTGGCG GGCGGATCTA AAATACCCAG GTGCAACATT TGAAGACACA
GCAGTAGCAT TTGAAGCACT CAGTGCGAAT TTTCAAGTTA ATGAGCCAAA GATGTTTGCC
TTCGAACAGT CGCAGCGAGG AGAGCTTCTC GAGGCTGTTT GGAAAGGGCT CTCTCGAGTT
GCTTTGGGTC TGTTTGTACA CGGTTGGTCC CCACGCATCG CGCGTGAAAA CGTTACAGAG
TACGCTTTCG TGAGAGGCGC TGGACTACGG TTTTTGGAGG TCAGCAAGCT TGGTAGCCAC
CGTCTGAGAG AGGCAGTCAA AGAACGTGCC GACGCTACGC CAGCTCCACC AACGCTGACA
CTTTATGACG TCGAGGATGA CGACTCCTGT CGGCTTGTAC GAGAGGTTTT GAGTATTTGT
GACGTGAGTT ACTTGTGCAA ACCGTGTCCT ACCGCGTCGT GTTCGAATAG CTCCGAGTTA
GCGATTCTAC AGGGTGTTGA ACTTGGGTCT GAAGTAGTCC CTTTCCTACG AGATGATCGA
AACGACGACA TCGCTGTGAA AGGTGCTGAC GGCATCATTC AGTATCTCTA TCAGGAGTAT
CTGGATGGCG AAGAACCAGC TTCTCTTGTC TCATTGTTTC AGAGGTTCGC TCAAGCGTCA
AAGATCAATG ATGCTTCGCA TCGGCGTACT TCACGTGCAG GTGAGCAGCC TTTGATTTTT
TGGGGTTACG AAGCTTCTCC ATTTTGCGCT CTAGTGCGAA AAGCTCTGAA CGAGCGGGGA
ATTTCTTATG TTTTTCGTCC ATGTGCACGC GGTAGCCCAA GAAGGAGTTT GCTTCTCAAT
CGCACCGGTA TATTTCAAGT ACCTTATCTT GAAGATCCGA ACACCGGGAT CTCGTTATTC
GAGAGTGTCG ATATCATCAG CTATCTGAAG AAGACCTACG ACTAG
 
Protein sequence
MGSLSAHNAP VRYYKTYRRK SCLLSLQTRR VSSILRVQTE TDLSTRTIQE IDSIQPSSDE 
DTGYISAAAS TSYQLLDPVE YGEHGAAFLS GNYYPVGQEV TACCDYDAQA GDDSSTMKIN
GKIPSDFPLG QYAYVGPNPK FAVDHYKRWG AGPGQVDFGL GSGWHHWFEG DGMIYAVDFC
TGNRVKFRNR FIRTSSWKRE IESGKRIFRP LMNADGGTFL ANALSNLITG GMFMKDSANT
ALLHFAGRTF ALQDTCPPWE LDPDTLETIS ACTFDGKLPP YVPFTAHPKV LPSTGELIFF
GFNPVSPPHC SVGTLKPDGT ISSINSLWSI PFVGSIFMHD FCVTEHFTVL FEGSMDIKPV
RQLLGKHPLQ YNEDKRARFG VLNRDAQSTS VTWFNCSSSQ MVYHFINSWE EKNERNEQLI
VITGIREDGF FQEAMRANGS SEWIKEAVRV GKIPKVHEWR INLVTGKVTE RYLFSIPLET
PRINDAFVGR RNRYAYAGRV LLSELESTTQ LKFDAVVKID MQEQKTVTYE HGPNRYGMEA
QFVGRPGGVD EDDGWLVMYV HDESYTATDV NGRTECVIID AQDVEAGPVA TILLPERVPY
GAHCMWRADL KYPGATFEDT AVAFEALSAN FQVNEPKMFA FEQSQRGELL EAVWKGLSRV
ALGLFVHGWS PRIARENVTE YAFVRGAGLR FLEVSKLGSH RLREAVKERA DATPAPPTLT
LYDVEDDDSC RLVREVLSIC DVSYLCKPCP TASCSNSSEL AILQGVELGS EVVPFLRDDR
NDDIAVKGAD GIIQYLYQEY LDGEEPASLV SLFQRFAQAS KINDASHRRT SRAGEQPLIF
WGYEASPFCA LVRKALNERG ISYVFRPCAR GSPRRSLLLN RTGIFQVPYL EDPNTGISLF
ESVDIISYLK KTYD