Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_45212 |
Symbol | |
ID | 5000703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 616398 |
End bp | 617831 |
Gene Length | 1434 bp |
Protein Length | 427 aa |
Translation table | |
GC content | 57% |
IMG OID | 640416124 |
Product | predicted protein |
Protein accession | XP_001416712 |
Protein GI | 145344381 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.249426 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGACGACGCG CGCGCGCGAC GAGACGCTCG AAAGCGCGCG CGAACGCGTC GAAATATATC GCCGCCATTC GCTCGACGCC ACCGAATCGC GCGCGAGAGA CGGCGACGAC GGCGATAAGA TAAAGCGTCA ACGATGGCGA CCGTCCCGAG TAAACGAAAG TTGGTCGGGT GCGCGAACTT CGTGCGATCG AACCCGCTGA GCGACGCGTT CGAGTGTGAA AAGTTTGACC ACATCGAGTT TTGGTGCGGG GATGCGACGA ACGCGGCGGC GAGGTTCGGG GTTGGCTTAG GCATGGGGCT GCGATGCAAG AGCGACGCGA CCACGGGGAA CGGGACGTAC GCGTCGTACG CGATGAAGTC GAACGATCTG ACGTTCGTGT TCACCGCACC GTACGGAGTC GAGAGCGGAG GTAGTCGAGG GGAAGCGCCG CATCCGGGAC ACGAGGGACG GGCGATGATG CGATTTTTTG AGAAGCACGG GCTGGCGGCG CGCGCGGTGG GCGTGCGAGT CAAAGACGCG CGCGCGGCGT ATGAGGAGGC AGTGAAACGT GGTGCGCGTG GCGTGCTGGC GCCGACGGTT TTGACACACA CAGTAGACGA CGGATGTGCG AAGGGTGGAC AAGTCATCGC GGAGATTGAG CTATATGGCG ATGTCGTCTT GCGCTTCGTC AACGCGACGG ATGGATTTGA CGGAGACTTT CTGTGCAATT ATTCGGCGAC GCGCGATGCG CCAGATGTGT CGTATGGGTT GCAGCGCCTC GATCACGCCG TCGGTAACGT GCACGATTTG ATCGAAACCG TGGATTATAT CACCAAAGTC ACGGGCTTTC ACGAGTTTGC TGAGTTCACG GCGGAGGACA TCGGAACGAT CGATAGCGGG TTGAATAGCA TGGTGTTGGC AAACAATAAC GAGTACGTGT TATTGCCTGT GAACGAGCCG ACGTTCGGGA CGAAGCGGAA GAGTCAAATC CAAACATATC TTGAGCAAAA CAATGGCCCT GGGTTGCAGC ACTTGGCGTT GAAAACGGAT GACATCTTTG CGACGGTGCG AGAAATGCGC AAGTACTCGC ACTTGCGAGG CGGATTCGAC TTTCAAGCGC CGGCAAGCGA TGACTATTAC AAGCAACTCA AGGCGAAAAT CGGCGATGCT TTGAACGATG AGCAGTACGC GCTTGTCGAA GAGTTGGGTT TGCTCGTCGA TAAGGACGAC CAGGGCGTAT TGATTCAGGT CTTCACGAAG CCCGTGGGCG ACCGGCCGAC GTTATTTTTA GAAATCATCC AGCGCATAGG CTGCATGCGT AGAAAAGCGG ACTCGGAATC ATTTGAGCAA GCAGCCGGAT GCGGTGGGTT CGGCAAGGGT AATTTCTCCG AACTGTTCAA ATCTATTGAA GCGTACGAAG CGACGCTTCA AATTTAGCCA GGCATTGTTA TCAA
|
Protein sequence | MATVPSKRKL VGCANFVRSN PLSDAFECEK FDHIEFWCGD ATNAAARFGV GLGMGLRCKS DATTGNGTYA SYAMKSNDLT FVFTAPYGVE SGGSRGEAPH PGHEGRAMMR FFEKHGLAAR AVGVRVKDAR AAYEEAVKRG ARGVLAPTVL THTVDDGCAK GGQVIAEIEL YGDVVLRFVN ATDGFDGDFL CNYSATRDAP DVSYGLQRLD HAVGNVHDLI ETVDYITKVT GFHEFAEFTA EDIGTIDSGL NSMVLANNNE YVLLPVNEPT FGTKRKSQIQ TYLEQNNGPG LQHLALKTDD IFATVREMRK YSHLRGGFDF QAPASDDYYK QLKAKIGDAL NDEQYALVEE LGLLVDKDDQ GVLIQVFTKP VGDRPTLFLE IIQRIGCMRR KADSESFEQA AGCGGFGKGN FSELFKSIEA YEATLQI
|
| |