Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32217 |
Symbol | |
ID | 5002549 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 472538 |
End bp | 474151 |
Gene Length | 1614 bp |
Protein Length | 504 aa |
Translation table | |
GC content | 62% |
IMG OID | 640417970 |
Product | predicted protein |
Protein accession | XP_001418493 |
Protein GI | 145348098 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.340213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0518556 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGCGT ATTACGGGAA GAAGGTGCGA GAGGGTGGGC GAACGCGCGA GACGCGCGAG ACGCGCGAGA CGCGCGAGAC GCGCGACTGA CGACGGCGAC GCGCGATCGG AACGAACGAT CAGGTGATCG TGGTGGAATC GCATTACGCC GTGGGTGGTG CCGCGCACGG GTTCGCGCGG AAGACGGCGC GAGGGGAATT TAAGTTCGAC ACCGGGCCGA GCTTTTTCGC CGGTTTGACG AAGCCGAACG CGTTGAATCC GCTGGCGAGC GTGTTTCGCT TGCTCGGGGA GGAGTTGGAG ACGGTGTCGT ACGACCCACT GGGGACGTTT CACATAGAAC CTGACGCTCC GGGTCTTCGC AGGCACGCGG ATCTGGCCAA GCTCGTGTCG GAGGTGAAGC GGTTTTCTCC GCGAGGCGCC GAAGAGCTCG AACGGGCGGT GCCGAAGATT CGGACGATGT ACGCCTCGCT CAGCGGTCTG CCGACCACGG CGCTTCGCGC GGATTGGAAG GTGGCTCTCA TGATCTTGAG TCGATACATG AAAGCCATGG CGGGCCTGGG TCCGTACTCG GGCGTCCTGC CGCAGCCGAC GGTGAAACTT TTAGATTTTC TGGATATTAA AGATCCTTGG ATGCGCTACT TGGCCGATTT AGAGTGTTTT TTGTTGAGCG GCGTGGACGC GAGCGGTACG GTGAGCGCTG AGTTTGCCGC CGTGTTCGGG GCGAGTGATA GTTTGGGCGT GAGCGAGTTT CCTCGCGGCG GCGCGGAGGA AATCGCCAAG GCGCTGCAGC GAGGCTTGGA AAAGTACGGC GGCGAGGTTC GTCTGAAGAC GCACGTGGAT GAGATCATCG TAGAAAACGG CACTGCGGTG GGCGTCAAGT TAGCGAACGG TAAGGGAGAA GTTCGTGCTC CGATCGTGCT CTCGAACGCG AGCGTGTGGG ACACGTACGG AACTCTCCTT CCGAAAGGCG CCGCGCCGGC GACCGAGACG CGTGAAGCGA TGGAGACGCC TTACAGCGAG TCGTTCATGC ACCTCCATCT CGGCATCGAG GGCGAGGGCT TAGACTTCAC CCACACCGGT GGTCATCACG TCGTGGTGTT GGACAAGAGT AAACCGATCT CGCAACCCGG GAACGTGTGC ATGATTTCAA TCGCGAGCGT CTGGGAGCCC GACATGGCGC CGGCTGGGTG CCACTGCGTG CACGCGTACA CCATGGAACC TTTCGAAGGC TGGGAAGAGC TCAAGGCGAA CGACAAACAA GCGTACGAAA AGCGCAAGAA GGAGGCTTCG GATAAGTTAT ACGTCGCTCT CGAGCGCGTG ATCCCCGACG TACGCGATCG CGTGCTCCTC GAACTCGTCG CCTCGCCGGC GACGCACAAA TCTTGGCTCC GTCGTCACAA AGGCACGTAC GGCGCCGCCA TTCGCGCCCC CGCGATGTTC CCCGGTCCCA CCGTCGCCGG CATCAAGAAC CTCTACCGCG TCGGCGATTC CGTCGCCCCC GGCGTCGGCG TCCCCGCCGC CGCCGGTTCC GGCGTCATTT GCGCCAACAC CCTCGCGTCC CTCGACGACC ACTTCGCCTG TCAGGACCGA ATGGATGCCT TAAACGCGCG ATGA
|
Protein sequence | MLAYYGKKVI VVESHYAVGG AAHGFARKTA RGEFKFDTGP SFFAGLTKPN ALNPLASVFR LLGEELETVS YDPLGTFHIE PDAPGLRRHA DLAKLVSEVK RFSPRGAEEL ERAVPKIRTM YASLSGLPTT ALRADWKVAL MILSRYMKAM AGLGPYSGVL PQPTVKLLDF LDIKDPWMRY LADLECFLLS GVDASGTVSA EFAAVFGASD SLGVSEFPRG GAEEIAKALQ RGLEKYGGEV RLKTHVDEII VENGTAVGVK LANGKGEVRA PIVLSNASVW DTYGTLLPKG AAPATETREA METPYSESFM HLHLGIEGEG LDFTHTGGHH VVVLDKSKPI SQPGNVCMIS IASVWEPDMA PAGCHCVHAY TMEPFEGWEE LKANDKQAYE KRKKEASDKL YVALERVIPD VRDRVLLELV ASPATHKSWL RRHKGTYGAA IRAPAMFPGP TVAGIKNLYR VGDSVAPGVG VPAAAGSGVI CANTLASLDD HFACQDRMDA LNAR
|
| |