Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_13090 |
Symbol | PYR4 |
ID | 5004800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 47925 |
End bp | 49079 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | |
GC content | 65% |
IMG OID | 640420221 |
Product | dihydroorotate dehydrogenase |
Protein accession | XP_001420736 |
Protein GI | 145352825 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00338155 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGGCGG TGCGATTCGC GGACGGAGAC TGGGCGGAGG ACGCGGAGTT CGCGGCGTAC GGCGCGGCGA CGCCGGCGCT GCGAGCGCTC GACGCGGAGA CGGCGCACGA CGTCGCGGTG GCGGCGCTCG CGCTGGGACT GGGACCGAGA CGAAGGCGGC GAGACGGGGA GGCGCTGCGC GTGGAGGCGC TCGGGACGAC GTTTTCGAAC CCGATCGGTC TGGCGGCGGG ATTCGATAAG GACGCGAGGG CGTTCGAGGC GTTGCTGAGG GTGGGATTCG GGTTCGTGGA GATCGGGAGC GTGACGCCGA AGCCGCAGCC GGGGAATCCG AAACCGCGCG CGTTCCGCCT GCGCGAACAC GGGGCGGTGA TCAATCGGTA TGGGTTCAAC AGCCAGGGAC ACGAGAGCGC GAGGACGCGA TTGGCGAGGC GGCGCGACGC CGTCGCCGCC GAGGGCGACG ACGCGACGGC GGAGCCGCGC GGGGTGCTCG GCGTAAATCT CGGGAAGAAT AAGCTCACTC CTGAAGACAA CGCGGCGGAT GATTACGTCT TGGGGGTGGA GAATATCGGG GAATTCGGCG ATTACATCGT CGTGAACATT TCCTCGCCGA ACACGCCGGG TTTGAGAAAC TTGCAAGGTC GCAAGCATTT GAGCGGGTTG CTGCGCAAGG TTTTGGACGC GCGCGACAAA AATCCGGGCA CCGCGAAGAC GCCGGTGTTG GTGAAAATCG CCCCCGATCT CACCGACGCC GCGTTGAGGG ATATCGCGAG CGTGGTGAAG AGCGAAAAGG TCGACGGAGT CATCGTGAGC AACACCACCA TCGCGCGACC GGACGCCATC AAAGCGCACG CGCACGGCGA CGAAGCGGGC GGTCTGAGCG GTAAACCGCT CATGGAGCCG AGCACTAAGG TGTTGCACGA CCTGTACAAG CTCACGGGCG GCAAAATCAC CTTGGTGGGA TGCGGCGGCA TCGCCAGCGG CGAAGACGCG TACGCAAAGA TTCGCGCGGG CGCGTCGTTG GTGCAGTTGT ACACCGCGTT TGCCTTCGAA GGTCCGCCTT TGATACCTAG AATCAAGCGA GAACTCGAGG AGTGCTTGGC GCGCGACGGT TTCAAGAGTG TGCAAGACGC CATCGGCGCG GCGCACCGCA AGTAG
|
Protein sequence | MAAVRFADGD WAEDAEFAAY GAATPALRAL DAETAHDVAV AALALGLGPR RRRRDGEALR VEALGTTFSN PIGLAAGFDK DARAFEALLR VGFGFVEIGS VTPKPQPGNP KPRAFRLREH GAVINRYGFN SQGHESARTR LARRRDAVAA EGDDATAEPR GVLGVNLGKN KLTPEDNAAD DYVLGVENIG EFGDYIVVNI SSPNTPGLRN LQGRKHLSGL LRKVLDARDK NPGTAKTPVL VKIAPDLTDA ALRDIASVVK SEKVDGVIVS NTTIARPDAI KAHAHGDEAG GLSGKPLMEP STKVLHDLYK LTGGKITLVG CGGIASGEDA YAKIRAGASL VQLYTAFAFE GPPLIPRIKR ELEECLARDG FKSVQDAIGA AHRK
|
| |