Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_89603 |
Symbol | |
ID | 5006420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009373 |
Strand | - |
Start bp | 7762 |
End bp | 8808 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | |
GC content | 71% |
IMG OID | 640421841 |
Product | predicted protein |
Protein accession | XP_001422410 |
Protein GI | 145356381 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 76 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCGC CGAACGCGTC GGCCGCCGAC GCGCTGACCG AGCTCGAACT GCGCGTGTAC GACCGGCAGA TTCGCGTGTG GGGCGTGGAG ACGCAGCGCC GACTCGGTCG CGCGTCCGTG CTCGCGTGCG CGGGGGCGAC GACGACGCGC GCGACGACGA CGACGCGCGT CGGCGCGCTC GCGGAGACGC TCAAGAACGT CGCGCTCGCG GGCGTCGGAC GCGCGGTGAT CAGGGACGAC GCGGGCGAGC GCGCGGAGGC GTCGCGCGGC GAGGATGGGA ATTTTTTAAA CGCGGCGTCG ACGCGCGACG ACGACGCGGA CGACGTCTCG GTCTCGCGCG CCGAGGCGAT GGCGACGACG CTGCGAGAGA TGAACGCGTT CGGTGAATTC GAGGCGTCGA CGCCGAACGG GCGCGCGCTC GCGGACGACG CGGAAGCCTT GGACGGGATC GAGGGGTTCG ACGCCGTCGT CGTCGCGGAG ATGGGATTGG AGCGCGCGAT GCGCGTGAAC GAGGCGTGCA GGCGACACGG GAAGCCGTTT TTCGCCGCGT TTAGCGGGGC GTCAGCGGCG TGGTTCTTCG CCGATCTCGG CGACGCGTTC GAGTACGCGG AGGGAGACGA AGTAAAAATC GCGCCTCGAG GCGCGACGCT GCGACGAGCG CTCGACGCCG CCGAGGCGGA TTTCGGGCGC GTTAAGCGGC GGTCGCCGCG CATGCCGCTC GCCGTGCGCG TCGTCGCCGA GTTCGAGCGC GCGCACGGGC GCGCGCCGAC GATGGAGGAT TGGGACGCCC TGGACGCGCT GCGCGTCGAG TTGCCGACGC GATTCGGCGC GAGCGCCGAC TGCGTCGACG CCGAGCACGT GCGCGCTTTG GTGTCGGGAG AGCGCGAATT TCCCGCGATA AACGCCATCG TCGGCGGGGT GCTGGCGCAA GAGATTTTGA AATCCATCAG CCGCAAGGGC GCGCCGTGCG TCAATCTGTT CACGTTCGAC GTCGCGAGCG GGCAAGGCGC GACGTACGAC TTGGGCGGCG GCGAAACGGC GCGCTAG
|
Protein sequence | MPAPNASAAD ALTELELRVY DRQIRVWGVE TQRRLGRASV LACAGATTTR ATTTTRVGAL AETLKNVALA GVGRAVIRDD AGERAEASRG EDGNFLNAAS TRDDDADDVS VSRAEAMATT LREMNAFGEF EASTPNGRAL ADDAEALDGI EGFDAVVVAE MGLERAMRVN EACRRHGKPF FAAFSGASAA WFFADLGDAF EYAEGDEVKI APRGATLRRA LDAAEADFGR VKRRSPRMPL AVRVVAEFER AHGRAPTMED WDALDALRVE LPTRFGASAD CVDAEHVRAL VSGEREFPAI NAIVGGVLAQ EILKSISRKG APCVNLFTFD VASGQGATYD LGGGETAR
|
| |