Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18726 |
Symbol | |
ID | 5006303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | + |
Start bp | 257061 |
End bp | 257882 |
Gene Length | 822 bp |
Protein Length | 273 aa |
Translation table | |
GC content | 67% |
IMG OID | 640421724 |
Product | predicted protein |
Protein accession | XP_001422140 |
Protein GI | 145355806 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01993] pyrimidine 5'-nucleotidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.245332 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGCG ACGGCGCGGG CGACGGCGCG TCGCGCGCGA GCGCGCGCGC GTACGTGTTC GATCTCGACG GCACGCTGTA CGCGATCGCG AACGGGTACG AGGCGGCGTG TCGACGACGC GTGTACGAAT TCATGGCGAC GCGGTGCGCG GGCGTCGACG ACGTCGCGGA GGCGCGCGTC GTGTGGGAAA AATGGTTCAA GAGGTATAAT CAAACGCTGC GAGCGCTGCG ACACGGCGCG GGGTACGAGT TCGACGCGGC GGAGTATTGG CGATTCACGC GCGGCGACGC GCGCGAGCAC CTCGCGCCGA GCGCGGACGT GCGCGCGTTC GTCGAATCGT TACCGGGCGG GCGAGAGAAT AAGTACGTGT TCACGAATTG CAACGAGACG CAGGCGCTGG AGGCGCTGGA GGCGCTCGGG TTGCGAGACT GTTTCGCGGA CCGCGTCTTC GGCGCGGGAG GCATGGGAGA GTGCTGCAAA CCCGAACGCG AGGCGTTTGA AAAGTTCTTC GCGTTTTGCG GCGTCGACGT CGCGGACGCG TCGGAGTGCG TGTTTTTCGA GGACTCGCTC AAAAACTTGC GCGCGGCGAA GGAGATTTTC GGGATGACGA CCGTGCTCGT CGCCGGCGAG ACGTTTTACG AGGAGACGCG CGACGCGAAG ACCGACGTCG CGGACGCGGC GCCGTCTTAT GTCGATCTCG TGCCGTCTTA CGTCGATCTC GTCGTTCACG GCGGCGAATT GACCGAGCGC GCCGTTTCTC GCGCGCTGCG CGACGTTCCG AGCGCCGCGC GCTGTCGCGT CGCGCTCGGT CTGGACGCCT GA
|
Protein sequence | MARDGAGDGA SRASARAYVF DLDGTLYAIA NGYEAACRRR VYEFMATRCA GVDDVAEARV VWEKWFKRYN QTLRALRHGA GYEFDAAEYW RFTRGDAREH LAPSADVRAF VESLPGGREN KYVFTNCNET QALEALEALG LRDCFADRVF GAGGMGECCK PEREAFEKFF AFCGVDVADA SECVFFEDSL KNLRAAKEIF GMTTVLVAGE TFYEETRDAK TDVADAAPSY VDLVPSYVDL VVHGGELTER AVSRALRDVP SAARCRVALG LDA
|
| |