Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_94867 |
Symbol | |
ID | 5004010 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 325251 |
End bp | 327020 |
Gene Length | 1770 bp |
Protein Length | 575 aa |
Translation table | |
GC content | 57% |
IMG OID | 640419431 |
Product | predicted protein |
Protein accession | XP_001419971 |
Protein GI | 145351197 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0513] Superfamily II DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0308188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0300986 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAAGC GACAATACAT GGGCGGAGAC GCGAAGTCGA AAAAGGTGAA GACAGCAAAT AGAGGAAAGT TTGTGTTCGA TTGGCGGAAA GAGGAAGACA CGTCGAGAGA TTTGAATCCA CTGTACGATC GGCCGCACGA AGTGGCGCCG ATGTTCGGTC GAGGGATGAT CGGGGGCGTG GATCGAAGGG AGCAGGCGCG CTCGAACGCC GAGCGCGAGC GCGAACTCAT CGTCAAGTCG AGGAAGGATC TCGGATCGAA GGACGCGGCG GGTGATGTGC GAAAGATGGA AGTTGAGCGC GAGCGTAAAC GTAAGGATGT CGAGGCGCGC GAGTTGAAAC GAACTTTCAA GGAGCACTGG AGTGATAAAA AGTTGGAAGA CATGACAGAG CGCGATTGGC GCATTTTTCG AGAAGACTTT AACATTTCGT ACAAAGGCGG CAAGTTACCG CTTCCCATGC GCGCGTGGAA AGAGTGCACG AGCTTGCCAC AAGAGATATT GCGCGCTATC GCGCAGGTTG GGTACGAAAA GCCGTCGCCG ATTCAAATGG CTAGCATTCC GATCGGTTTG CTGAAGAGAG ACGTCATCGG CATCGCCGAG ACGGGTTCGG GTAAGACGTG CGCGTTCGTC GTCCCCATGC TCGCGCACAT CATGCAGCTT CCGAAAATGA CGGACGAAAT TGCCGCGCAC GGGCCGTACG CCCTGATCAT GGCCCCTACG CGCGAGTTGG CGCAACAGAT TGAGGAAGAG ACTCTCAAGT TTGCGCAGTA TTTGGACTAT CGCGTCGGCT TGGTCGTCGG CGGTCAATCG ATCGAAGACC AAGGTTTTAA ACTTCGCAAA GGGGTGGAGA TATTAGTCGG TACGCCCGGT CGTATCATAG ATGTCATTGA GCGCCGATAC ACCGTGCTCA GTCAGTGCAA CTACATCGTG CTCGACGAAG CCGATCGCAT GATCGACATG GGTTTCGAAC CGCAAGTCGT GGCGGTGATG GAGGCGATGG GATCGGGTAA CTTGAAACCC GAGGACGAGG CGGAAGAGCT CGACGGCCAG GCGCTCGAGC AAGGTGGGCC GACGTCGTCA AAGTACCGAA CGACGTACAT GTTTTCCGCC ACCATGCCTC CGAGCGTGGA GCGTCTGGCG AGAAGTTATT TGCGCAATCC CGCGGTGGTC ACCATCGGCA GCGCCGGGAA GACGTCCGAT TTGATCAAGC AAGAGATTAT TTGGGTGTCG AGAAACGAGC GCGACTCCAA ATTTGAGCTC GTGTTATCGC GACATCCCAA CACGCAAGCC ATCGTGTTCG TGAACGCCAA ACGCTCGGTG GACGCCGTGG CGAATCTGTG CTACCGTCTC GGGTACTCGT GCGCGTCCAT ACACGGCGGC AAATCGCAAG ACCAACGCGA GGAGTCTTTG CGCGGGTTCA AGGCTGGGGA TTACGACATC TTGGTCGCCA CCGATGTCGC CGGTCGCGGG ATCGACGTCA AGGGCATCGA TCTCGTCGTC AATTACGAGT TGCCGCACAC GATTGAAAAT TACACCCATC GCATCGGGCG CACCGGTCGC GCCGGTCGCA AGGGCACCGC CGTGAGCTTC CTCACGAGCG ACGATCGCGA CATCATGTAC GAGCTCAAAG AACTTCTCAT CGAGAGCAAG AACCACGTCC CAGATGCGCT GGCAAACCAC GAAGCGGCGC GCGTAAAGCC TCAGCGCGAC GACAGAGGCA GACGCATGAA CCGCGAAGAC ATTCGAGGGC AAGAAGCCAT CATCTACTGA
|
Protein sequence | MLKRQYMGGD AKSKKVKTAN RGKFVFDWRK EEDTSRDLNP LYDRPHEVAP MFGRGMIGGV DRREQARSNA ERERELIVKS RKDLGSKDAA GDVRKMEVER ERKRKDVEAR ELKRTFKEHW SDKKLEDMTE RDWRIFREDF NISYKGGKLP LPMRAWKECT SLPQEILRAI AQVGYEKPSP IQMASIPIGL LKRDVIGIAE TGSGKTCAFV VPMLAHIMQL PKMTDEIAAH GPYALIMAPT RELAQQIEEE TLKFAQYLDY RVGLVVGGQS IEDQGFKLRK GVEILVGTPG RIIDVIERRY TVLSQCNYIV LDEADRMIDM GFEPQVVAVM EAMGSGNLKP EDEAEELDGQ ALEQGGPTSS NVERLARSYL RNPAVVTIGS AGKTSDLIKQ EIIWVSRNER DSKFELVLSR HPNTQAIVFV NAKRSVDAVA NLCYRLGYSC ASIHGGKSQD QREESLRGFK AGDYDILVAT DVAGRGIDVK GIDLVVNYEL PHTIENYTHR IGRTGRAGRK GTAVSFLTSD DRDIMYELKE LLIESKNHVP DALANHEAAR VKPQRDDRGR RMNREDIRGQ EAIIY
|
| |