Gene OSTLU_13090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_13090 
SymbolPYR4 
ID5004800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp47925 
End bp49079 
Gene Length1155 bp 
Protein Length384 aa 
Translation table 
GC content65% 
IMG OID640420221 
Productdihydroorotate dehydrogenase 
Protein accessionXP_001420736 
Protein GI145352825 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00338155 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCGGCGG TGCGATTCGC GGACGGAGAC TGGGCGGAGG ACGCGGAGTT CGCGGCGTAC 
GGCGCGGCGA CGCCGGCGCT GCGAGCGCTC GACGCGGAGA CGGCGCACGA CGTCGCGGTG
GCGGCGCTCG CGCTGGGACT GGGACCGAGA CGAAGGCGGC GAGACGGGGA GGCGCTGCGC
GTGGAGGCGC TCGGGACGAC GTTTTCGAAC CCGATCGGTC TGGCGGCGGG ATTCGATAAG
GACGCGAGGG CGTTCGAGGC GTTGCTGAGG GTGGGATTCG GGTTCGTGGA GATCGGGAGC
GTGACGCCGA AGCCGCAGCC GGGGAATCCG AAACCGCGCG CGTTCCGCCT GCGCGAACAC
GGGGCGGTGA TCAATCGGTA TGGGTTCAAC AGCCAGGGAC ACGAGAGCGC GAGGACGCGA
TTGGCGAGGC GGCGCGACGC CGTCGCCGCC GAGGGCGACG ACGCGACGGC GGAGCCGCGC
GGGGTGCTCG GCGTAAATCT CGGGAAGAAT AAGCTCACTC CTGAAGACAA CGCGGCGGAT
GATTACGTCT TGGGGGTGGA GAATATCGGG GAATTCGGCG ATTACATCGT CGTGAACATT
TCCTCGCCGA ACACGCCGGG TTTGAGAAAC TTGCAAGGTC GCAAGCATTT GAGCGGGTTG
CTGCGCAAGG TTTTGGACGC GCGCGACAAA AATCCGGGCA CCGCGAAGAC GCCGGTGTTG
GTGAAAATCG CCCCCGATCT CACCGACGCC GCGTTGAGGG ATATCGCGAG CGTGGTGAAG
AGCGAAAAGG TCGACGGAGT CATCGTGAGC AACACCACCA TCGCGCGACC GGACGCCATC
AAAGCGCACG CGCACGGCGA CGAAGCGGGC GGTCTGAGCG GTAAACCGCT CATGGAGCCG
AGCACTAAGG TGTTGCACGA CCTGTACAAG CTCACGGGCG GCAAAATCAC CTTGGTGGGA
TGCGGCGGCA TCGCCAGCGG CGAAGACGCG TACGCAAAGA TTCGCGCGGG CGCGTCGTTG
GTGCAGTTGT ACACCGCGTT TGCCTTCGAA GGTCCGCCTT TGATACCTAG AATCAAGCGA
GAACTCGAGG AGTGCTTGGC GCGCGACGGT TTCAAGAGTG TGCAAGACGC CATCGGCGCG
GCGCACCGCA AGTAG
 
Protein sequence
MAAVRFADGD WAEDAEFAAY GAATPALRAL DAETAHDVAV AALALGLGPR RRRRDGEALR 
VEALGTTFSN PIGLAAGFDK DARAFEALLR VGFGFVEIGS VTPKPQPGNP KPRAFRLREH
GAVINRYGFN SQGHESARTR LARRRDAVAA EGDDATAEPR GVLGVNLGKN KLTPEDNAAD
DYVLGVENIG EFGDYIVVNI SSPNTPGLRN LQGRKHLSGL LRKVLDARDK NPGTAKTPVL
VKIAPDLTDA ALRDIASVVK SEKVDGVIVS NTTIARPDAI KAHAHGDEAG GLSGKPLMEP
STKVLHDLYK LTGGKITLVG CGGIASGEDA YAKIRAGASL VQLYTAFAFE GPPLIPRIKR
ELEECLARDG FKSVQDAIGA AHRK