Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50249 |
Symbol | |
ID | 5003434 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 208071 |
End bp | 209755 |
Gene Length | 1685 bp |
Protein Length | 495 aa |
Translation table | |
GC content | 54% |
IMG OID | 640418855 |
Product | predicted protein |
Protein accession | XP_001419309 |
Protein GI | 145349786 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.430177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.311045 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGATG AATTGTCGCA CTATTTTGTG AATTCTTCGC ATAATACGTA CTTGGACGGC GGACAGTTGT TCTCGAGGTC GACGAGTTCG GCCATCGCGT GGGCGCTCGA ACGCGGGTGC AGAGTCGTCG AGCTGGACTG TTACGACGGA GGGTCGAAGG GGCCGATCAT CACGCACGGA GGGACGGCGG TGCGCCCGAT GTTGTTTAAG GACGCGATCG CGGTTATCAA CGACAGGGCA CACGTAGCTA GCGAGTATCC GGTGATTGTG ACTTTGGAGA ATCACGCGAG TCGGGAGACG CGCGCGGTGA TGGCGAAAAT CATGCGACAC ACGTTCGGCG ATAAACTCTG GACGCCGCCG TCGAAAGGCG AAGGCGAAGG CGAAGAAGAA TCATACGTGT TGGACCGCTG GCCATCGCCG GCCGAGTTGA AGGGTAAAGT GATAATTCGG GACAAGGTGA AGCACAAGCA AGACGAAGTC AAGAACGCAT CGTTCGGCGT CGGAAAAGTG TTCGAGGCGA TATCCTCGAG GGGCAAGTCT AAGTCTACTC GGAAGCTTCT ACAGGAGAGC ACCAAAGCTA CACTTTCAAC TGGGAAAAAC AAGCTCTCGG TGGCGAACTC CGTGGTCAAC CTCTCCAAAA GTGCGCCTGC AGGCGTTCCT GAAGACTCCG ATGGGACGAG CGAGGACGAC GGCGGCGAGG ACGACGAGGA CATCAAGGCA CTCGTCTCAT TGCGAAATTT GAAGTTTCAT GGTTTCAAGG AAGCGAAAGA TCTCGGTACA AAGTTTTCTT GCAGTTGGAG CGAGAACAAG GCCAAGAAAT TAGTCGAAAA GTCGAGCCAA AAGGATTTGC TCGAATTCAC CAAGGCGCAT TTGCTTCGCA CGTACCCGGG CGGTCAACGC ATCATGAGTA ACAATTACGA TCCCTCCGAC GCGTGGTCCA TAGGCGCATC GCTCGTCGCG CTCAACTTTC AGGCGCAAGA CAGATATATG TGGGTGAACC AAGCCAAGTT TGCGGTCAAC GGTGGGTGCG GTTACGTGAA AAAGCCCGAC TATTTAATCA ACCCGTCGGT TCAAAGACCG ACCAAGCCTA GAATTCTGCG CATACACGTC TTCTGCGGAC TAGGTTGGGA AAATTTCAAG GATGCCGATT TCATGTCGGC ACCGGATACG TTCATGAAGA TTTCCCTCTT CGGTTGCGTC GCCGATCGCT TGTCCGCGAC TTCCAGAGGG AATTCAATGA GAACGTCGGT GTACTCCAAG GCACGAGTCG GTCCGTGCGC TCAACCCATT TGGAACGAGC ACTTTGACTT GGAAATTCGC GAGCCCGAAC TCACGGTGCT ACAAATCCAA GCCATGGACA AAGATGGCGC GCGCGATGAG TTCCTCGCAC ACTACGACGT CGCCGTTAGC GCTTTGCGCG AAGGCGTCCG CATTGTACCG TTGCTCGCGC GCGACGACGA ATACGTACAC GACAGCAAGT CCTGCGCTGG CGTTTTGTGC AAGTTTGAGT GGCTCGATGA GAAAAGCTCG TCCAACGATG CTCTGCCAGC TCGAACGAAC GAGTCAAAAG AGACGACCAA CATCTCCGAA GATGCGTAAC AGACTAATAT GTGGCTGTGG ACGTCGCCGA CGTCTCAGCG TCTCACACTT CTTACTTTTC TATGTGTAGA ACATTCGCGT TGTAA
|
Protein sequence | MTDELSHYFV NSSHNTYLDG GQLFSRSTSS AIAWALERGC RVVELDCYDG GSKGPIITHG GTAVRPMLFK DAIAVINDRA HVASEYPVIV TLENHASRET RAVMAKIMRH TFGDKLWTPP SKGEGEGEEE SYVLDRWPSP AELKGKVIIR DKSTKATLST GKNKLSVANS VVNLSKSAPA GVPEDSDGTS EDDGGEDDED IKALVSLRNL KFHGFKEAKD LGTKFSCSWS ENKAKKLVEK SSQKDLLEFT KAHLLRTYPG GQRIMSNNYD PSDAWSIGAS LVALNFQAQD RYMWVNQAKF AVNGGCGYVK KPDYLINPSV QRPTKPRILR IHVFCGLGWE NFKDADFMSA PDTFMKISLF GCVADRLSAT SRGNSMRTSV YSKARVGPCA QPIWNEHFDL EIREPELTVL QIQAMDKDGA RDEFLAHYDV AVSALREGVR IVPLLARDDE YVHDSKSCAG VLCKFEWLDE KSSSNDALPA RTNESKETTN ISEDA
|
| |