Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_92236 |
Symbol | |
ID | 4999633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 1089980 |
End bp | 1090846 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | |
GC content | 60% |
IMG OID | 640415054 |
Product | predicted protein |
Protein accession | XP_001415685 |
Protein GI | 145341167 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000528701 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCGCG AAGGCGATCC GAACGGCGCC TCGCGCGCGC CCCTGCGCGC GGTTTTCTTC GACTTTGACG ACACACTCGC GGAAACCACC CTGGCCGATC GCGTCGCGTA TCGCGAATGC GCGATTCGCA TGGAGACCGT ATACGGGCTG TCGAAAAAGC GACAAGATGA AGTGATCGCT GCGTACAAGC GGCGGCTCGC GGAGCGTCCC TGGAACGACG AGTTTGCGCA CGTGTGGACG CACCGCGAGC GGCTGTGGGC GGAAGCGTTC GGCGACGACG ACAGAGGGCT CGCAATGCGG CACGACGTGA ACAGCACATT TAGGGACTGT CGCTTGGAAC AGCTACGTTT AAACAGTTCT GTGTGCGGTG GCATCGAGAA GTTGCGCGCG AAGAATGTGC ACGTCGTCAT CATCACGAAT GGCCACCACG TCGTGCAGCG AGAGAAGCTC GCCGCGTGCG GGATATACGA AGTAGTGAAG TTGGAAAACA TCCTCGTCGG TGGCGAAGAA GTTCTCGCCG GTCGCGACGA GAAACCAGAG GCGTCCATCT TTCACGAGGC GTGCAAACGC GTCGACGTGG TACCAGACGA AGTTATGCAC GTAGGCGACT CGTGGACCGC CGACATGGTC GGCGCCGAAA ACGCTGGTCT GCGTTGGAGA GTGTGGGTGT CCCAACGTCC CGACGACGAG AAGTGCGAGA GCGAACAGGA ACTGTCATCG TCGAAGCGAG CGAAAAAGGT CGACGCCGTT CCGCGCGTAG AAAACATCAA AGAATTTTTC GAGCTCCTGG ACGAGTGGTT GGACGAAGAC GGCACGCTGC CCGCATCCAA CATTCTCTTG AAGACGCGGA GCCGTAGCGA GCGTTAG
|
Protein sequence | MSREGDPNGA SRAPLRAVFF DFDDTLAETT LADRVAYREC AIRMETVYGL SKKRQDEVIA AYKRRLAERP WNDEFAHVWT HRERLWAEAF GDDDRGLAMR HDVNSTFRDC RLEQLRLNSS VCGGIEKLRA KNVHVVIITN GHHVVQREKL AACGIYEVVK LENILVGGEE VLAGRDEKPE ASIFHEACKR VDVVPDEVMH VGDSWTADMV GAENAGLRWR VWVSQRPDDE KCESEQELSS SKRAKKVDAV PRVENIKEFF ELLDEWLDED GTLPASNILL KTRSRSER
|
| |