Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27888 |
Symbol | |
ID | 5005723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | - |
Start bp | 377515 |
End bp | 378636 |
Gene Length | 1122 bp |
Protein Length | 297 aa |
Translation table | |
GC content | 64% |
IMG OID | 640421144 |
Product | predicted protein |
Protein accession | XP_001421786 |
Protein GI | 145355054 |
COG category | [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0126093 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.151084 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGCGACGCG GACGCGACGA CGGCGACGCG CTCTCGGATT CGCGAGCGGA CGCGCGTCGA ACGCGCTCGG GCGGCGTTTC GGACGCGCGA ACGCGCGACG CCGGCGCGAC GACGAAAAAG ACGCGTCATC GATAAGAGGG ACGGATTCGA TGGCGATGAC GGCGATGACG ACGTCGACGC GCGTTGAACG CGGCGCGAGG ACGACGGCGG CGAAGAAGAC GACGAAGACG ACGCGACGCG CGACGACGAC GCGCGCGAAC GCGGGATCGA ACACGAAGAA CGCGCGATTC GCGCTTTTGT TTGATTGCGA CGGCGTCATC GTGGAGACGG AGGAGCTGCA CCGGTTGGCG TACAACGGCG CGTTCGAGGC GTTTGATTTG AAAATCGACG GCGAGGGCGT GGAGTGGGTG GTGAAATATT ACGACGTGCT GCAAAATACC GTGGGCGGGG GGAAGCCGAA GATGCGATGG CATTTTAACG AGGATAAAAA GGCGTGGCCG ACGTCGACGA TGTTCGCGGA GGCGCCGTCG AGCGACGCGG ATCGAGACGC GCTCATCGAC GCGTTGCAGG ACAAGAAGAC GGAAATTTAT AAAAAGATCG TCGAAGAAGT GGCTGTGGCG CGACCTGGGG TGCTCGCGCT CATGGACGAG GCGATCGCCG ATCCCTCCAT CGCGGTGGGG ATTTGCAGCG CGGCGACCAA GGCGGGCTTC GAAAAGGTTG TCAACTCCGT CGTCGGCGTC GAGCGTTTGA GCAAGCTCGA CGTGCTCATG GCTGGTGACG ACGTCACGCG CAAGAAGCCC GATCCCTTGA TCTACAACCT CGCCAGAGAC AAGGTTGGCT TGCCCGCGAG CAAGTGCCTC GTCGTGGAAG ATTCCATCGT CGGCTTGCGC GCCGCCGTCG GCGCCGACAT GGCCTGTCTC ATCACCCCGT GCGGCAGCAA CATCGGCGCC GATTTCATGG GCGAAGGCGC GAGCAAGGTC GTCAACGATC TCGGCGCGGT CAAGCTCGCG ATGTTGTTCC CCGAAGGCGC GGAGACGCCC GCGTTCGACG GCTTGTAGTC GCCGACCGAC CGACCGCGCG TCTCTCTCGC GCCAGTCCCT CGCTCGCGCC CACGCGCGTT CG
|
Protein sequence | MTTSTRVERG ARTTAAKKTT KTTRRATTTR ANAGSNTKNA RFALLFDCDG VIVETEELHR LAYNGAFEAF DLKIDGEGVE WVVKYYDVLQ NTVGGGKPKM RWHFNEDKKA WPTSTMFAEA PSSDADRDAL IDALQDKKTE IYKKIVEEVA VARPGVLALM DEAIADPSIA VGICSAATKA GFEKVVNSVV GVERLSKLDV LMAGDDVTRK KPDPLIYNLA RDKVGLPASK CLVVEDSIVG LRAAVGADMA CLITPCGSNI GADFMGEGAS KVVNDLGAVK LAMLFPEGAE TPAFDGL
|
| |