Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_44390 |
Symbol | |
ID | 5004325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 142951 |
End bp | 144252 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | |
GC content | 59% |
IMG OID | 640419746 |
Product | predicted protein |
Protein accession | XP_001420426 |
Protein GI | 145352164 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0516105 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCCCG ACGGCGTCGC GAGCGCTTCG GCGACGTCGA TCGCGGCGAG CGAAAACGGC GAGGCGGAGG AGTTCAATTT ATTGCGGTCG TGCTACGTGT GCAAGTCGCG ATTTCGAAAG ATGCATCACT TTTACGCAGC GTTGTGTCCG GAGTGTGCCG AGTTGAATTG GGAAAAGCGG TCGCAGACGG CGGACATGCG AGGGAAGTTT TGCGTGGTGA CGGGCGCGCG CGTGAAGATT GGGTATCGCA TCGCGTTGAA ACTTCTTCGC GCGGGCGCGA ACGTGATCGC GACGACGCGT TTTCCCGTGG ACGCGCTGAA GCGCTTCGAA GGAGAAGATG ACGCGGGACA ATGGATCGGT CGACTGCAGA TACTCGCCAT GGATTTGCGC GACTTGCCCG CTTTGGAAAA GCTCTGCGCG CATTTGCTCG CGACGCTGCC TCGATTGGAC GTCCTCGTGA ACAACGCGTG TCAGACCGTG CGGCGACCTC CAGCGTATTA CAAGCACCTT CTCCAAGCCG AAGCGCGCTC GGGGTTTACT CGCTCGCTCG CTGGCGCGAA CGCTCGGTGC GACTTTTTAC CGAGCGCCGA CGCCGAGGCG CCGTCGCCCA CGGCGCTGAG CGATTGGCGA CAGGGCGATA AGTTTAAAGA AGCTGGATGG ATGGCGCCCT CAGCCGCAAT GTCGCAGCTC CAACTGATCG CTTCCGACGC CGATGAGTCC AACGAACACT TCCCTGAAGG CGCGCTCGAT GTCAACGGGC AGCAGGTGGA TTTAAGGACG AAAAACTCGT GGACGATGAA GCTTGGCGAA ATTGAGACTC CAGAGTTGCT CGAAGTGCTC GCGGTGAACG CCGCCGCACC TTTCGTGCTG AACGGCAAGC TTCGTCCATT GATGGCCAAG ACCGTCGCTG TCGATAAAGA GGCTGGGGTC GAGTACGCCG CGGCGTTTAT CATCAATGTT TCCGCTATGG AAGGCCAATT CGCTCGGGCG AAACTGAGTA CGCACTCGCA CACCAACATG GCGAAGGCGG CGTTGAACAT GATGAGCGCG ACGTCGGGGA AAGACTTCGC GGAAGATGGC ATCTATATGA ACTCAATCGA TACCGGTTGG ATTAACGACG AGAATCCACT GGAAAAAGCG GCGAGATTGG CGAAAGAGAA GGGTTTCCAG ACGCCCATCG ATGAGGAAGA CGCCGCGGCG CGCGTTCTCG CGCCGGTGTT TGAAGGTTTC AGCGAACCGC CGCAAGCGTG GCCGCCTGTG TTCGGCAAGT TCCTCAAGGA TTACCGCGAA ACATTCTGGT AA
|
Protein sequence | MIPDGVASAS ATSIAASENG EAEEFNLLRS CYVCKSRFRK MHHFYAALCP ECAELNWEKR SQTADMRGKF CVVTGARVKI GYRIALKLLR AGANVIATTR FPVDALKRFE GEDDAGQWIG RLQILAMDLR DLPALEKLCA HLLATLPRLD VLVNNACQTV RRPPAYYKHL LQAEARSGFT RSLAGANARC DFLPSADAEA PSPTALSDWR QGDKFKEAGW MAPSAAMSQL QLIASDADES NEHFPEGALD VNGQQVDLRT KNSWTMKLGE IETPELLEVL AVNAAAPFVL NGKLRPLMAK TVAVDKEAGV EYAAAFIINV SAMEGQFARA KLSTHSHTNM AKAALNMMSA TSGKDFAEDG IYMNSIDTGW INDENPLEKA ARLAKEKGFQ TPIDEEDAAA RVLAPVFEGF SEPPQAWPPV FGKFLKDYRE TFW
|
| |