Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30717 |
Symbol | |
ID | 5000764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 639526 |
End bp | 641346 |
Gene Length | 1821 bp |
Protein Length | 533 aa |
Translation table | |
GC content | 57% |
IMG OID | 640416185 |
Product | predicted protein |
Protein accession | XP_001416719 |
Protein GI | 145344396 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.990452 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTCGGACGCG CGCGTCGACG ACGACGCGGA CGATGACGAC GGCGACGATG ATCTCGACCG CGCGCGCGCC CGCGTCGGCG ACGCGACGAC GCGCGCGCGG CGTCGCGACG ACGACGGCGA CGCGACGCGC GGTGGCGACC GCGGCGGCGA ACGAAAAGGC GAGCGTGAAC GCGGCGACGG CTGAGCGCGA CGCGACGACG ACGATGGAAG CGAAGAACTG GGATCTTCCG CGCGCGCCGT ACGAGGCCGG GAACGCACCG GCGAATGGGA CGTGGGCGCC GACGAATCAA ACGATTAAAG ATTGGCGCTC GGTGTATCGC AACACGACGC TGAGCGTCAC GGGCGAGGGC GAACCGAAGG CGACGATCGT CGGGGAGATC CCGAAAGAGT TGAAAGGGAC GCTGTTTCGC AATGGACCGG GGTTGTTGGA GATTTACGGG AAGAAGCTGA ATCAATTCTT CGATGGCGAT GGGTTGGTGT ACGCGGCGGC GTTTGAGAAT GGCGACGTGA AATTCAGGCA CAATTTTGTC GGCACAAAGG GATTCACCGA AGAGCAAGCG GCGCAAAAGA TGATTTACAA GGGCGCTTTC GCGATTGGTA ACCCCAAGGG TGATTCGCTT TACAACCCTT TCGATTTCGA TGTTAAGAAC GTGGCTAACA CTGGCGTCGT CAACTGGGGC GGCGAGACGT ACGCGTTGTG GGAAGGTGGC AAACCGCACA AGATGCGCTC GGAGGATTTG AAGACTGTCG GCGAGGTTGA TGAAGTTCTC GGCACGAAGC TTCGTGTGCC GCAAATGGCC GCGCACTACA GAATTTGGGA AGGTGCAACC GAGGAAGAGA AACGCCTGAT CGGCTTTAGC ATCGAAAGCC AAGCGTCTTT TCCGCAAAAC GTCAACCGCG CCGCGTTTTA CGAGTGGGAC TCCCAAGGCA AGGCGACGGC TGTGAATAAC TTTGACGTCC CGAACGCCAC TTTTGGATTC TTTCACGATT GCCTCGTCAC GGAGAACTGG TACTTTTTGT TCCAAAATCC CACGAGTTTG AACCCGCAAA AGCTTCTCAC CGAGTACATG TTTGCCAAGT GCGCGTTGGC GGAGACGATT TGCATGGTGG ATGGAAAGAG CGCAATTTTG CACTGCTTGC CTCGCACGGA GAAGGCGCGT AAGATTGGAC AAAAGGCCAT CGAGCTCAAA CCGCAATTCT TCTTTCACCA CATCAACGCA TTCGAGCGCG AGAACGGCGA CGTCGTCTTG GACTCCTTGC CTTGGGTCTT TATGGACTTT TCGATGAATC TCGACTCGGT GAATCCGGCG TTTTTCAACG GTGGTTCCAG AGTCGAGTAC ACGCGATTCG TTGTCAATCC AGTGCGTGAG ACGTGCAAGC AGACGGTATT GAACAATCGT GTTGGGGAAT TTCCGACGAT GAACAACAAG TTTGTTGGTC GTAAATACAC GCACGCTTAC AGCGCGGGAA CCATCGTCAA GGATGACGTC AAGTGGGGCC CGAATCAGTG CTTGGTGAAA CATACCACGG ACCCGAACAG CGAGGCGCCG GCGCAAGAGC AAGTTTGGTA CTATGGCGAA CGGTGTTTTG TGCAGGAACC GATCTTTGCG CCACGTCCTG GAGCGACAAA GGAAGACGAG GGATGGATTC TGCAAGTCGT GAACGACGCC GGCGCAGAGA CGTCGACGCT CCACATCTTC GACGCGTTGG ACATCACGAA GGGACCGGTT GCAAGCGTGC TGTTCGAGGG CGAGCACTTG CCGCCAGGAC TCCACGGCAT GTGGGCGCAA GACGCTGTGT ATTAGGATTT AGCAGTACAC C
|
Protein sequence | MEAKNWDLPR APYEAGNAPA NGTWAPTNQT IKDWRSVYRN TTLSVTGEGE PKATIVGEIP KELKGTLFRN GPGLLEIYGK KLNQFFDGDG LVYAAAFENG DVKFRHNFVG TKGFTEEQAA QKMIYKGAFA IGNPKGDSLY NPFDFDVKNV ANTGVVNWGG ETYALWEGGK PHKMRSEDLK TVGEVDEVLG TKLRVPQMAA HYRIWEGATE EEKRLIGFSI ESQASFPQNV NRAAFYEWDS QGKATAVNNF DVPNATFGFF HDCLVTENWY FLFQNPTSLN PQKLLTEYMF AKCALAETIC MVDGKSAILH CLPRTEKARK IGQKAIELKP QFFFHHINAF ERENGDVVLD SLPWVFMDFS MNLDSVNPAF FNGGSRVEYT RFVVNPVRET CKQTVLNNRV GEFPTMNNKF VGRKYTHAYS AGTIVKDDVK WGPNQCLVKH TTDPNSEAPA QEQVWYYGER CFVQEPIFAP RPGATKEDEG WILQVVNDAG AETSTLHIFD ALDITKGPVA SVLFEGEHLP PGLHGMWAQD AVY
|
| |