Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_40984 |
Symbol | |
ID | 5002439 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 119497 |
End bp | 122085 |
Gene Length | 2589 bp |
Protein Length | 815 aa |
Translation table | |
GC content | 58% |
IMG OID | 640417860 |
Product | predicted protein |
Protein accession | XP_001418379 |
Protein GI | 145347862 |
COG category | [R] General function prediction only |
COG ID | [COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.954801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.413505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTGG ACACATTCGT GGATCATCTC AAAACGACCT TGGCGTCGAG CGGGCAAATC GTGCACGTGG AGAACATTCC CGCGAAACCG GCGGCGCGTA CGCCTCTCGC GTCAAAGTTG AGCGACGTCA CGCTCGCGGC CTTCGCCAGT GCGGGTATTG ATATCGAAAG AAAATTGTTC AGTCACCAGG CGCGCGCGAT TGAAACCGTG CTCGACGAGC GCGAACCGCG AAAACACGTC ATCGTCGCCT CTGGTACGGC TTCTGGTAAA TCGGTGTGCT ATAACGTGCC AACGATTGAA ACATTACTCG CGGACCCAAC GGCGACGGCT TTGTACATGT TTCCCACCAA GGCTCTCGCG CAAGATCAGC TACGAGCGCT ACGCAACATA CTGTCGAATA TTCCGAGGAG CGAAGAGACG GTGTTTGAGA TCGGGATGTA CGACGGGGAC GTTCGCGAGG ACGCACGCAC GGAAGTCCGC GAAAACGCAC GTTTAATCAT CACTAATCCA GACATGTTGC ACGTAAGCAT GTTACCGTCA CACAAGGCGT GGGCGCGCGT ATTATCTGGG TTACGATATG TCGTCGTCGA CGAGGCACAC GCATATTCGG GTGTCTTTGG TTCGCACGTC GCGCTCATAA TCAGACGTTT GCGACGGTTG TGCTCGGAAC TCTACGGATC GTCGCCACAG TTCATAATCA GTTCGGCGAC GGTGGCCAAC CCGCTCGAGC ACGCGCGAGA TTTGATTGGA TACGACGGCG TGCACCCAGA GCGCGATGTC ATCGAGGCGG TAACAAACGA TGGCGCGCCT CGCGGCGTGA AGACATTTTT ACTGTGGAAC CCGATGCTAA AACCCGGGCA ATCGAAGCAA ACGAACACCG AACGCAAATT TAGACGCGAG ACTGTGATCG AGCGCGGCAA GGCGGCGCTC GCAAGAAGAC TCCGCGAGCA GCATGGTGCT AATGAAAATG TAGAAGATGA AAAGCACGAC GAAAACGATG CAAAGGATGG CGCAAGAACG TCTCCAGTGG TAGAAATTTC ACAGCTCCTT GCCGAGTGCG TGCAGCACAA CCTACGCTGT CTCGCCTTTT GCAAGACGCG GAAGCTCTGT GAGCTCGTGC TCGTGTACAC GCGAGAGATT TTGCGCTCCT CGGCGCCACA TTTGGCCGAC AAAGTCGCGT CGTATAGAGG TGGATACGAA GCGATTGAGC GACGAGCGAT TGAAAAGGAG CTATTCTCGG GTGTCCTTCT CGGCGTTGCC ACGACGAACG CGCTCGAACT CGGCATCGAC GTCGGTTCGC TTGACGTGAC GCTGCACCTC GGGTTTCCAG GCTCCGTCGC ATCACTTTGG CAGCAAGCTG GTCGTGCTGG TCGGCGTGAA GGGCACGCAC TTTCAGTCTA CGTCGCGTTT GATGGCCCTT TAGACCAGCA TTTCATGCGA CAACCGCGCG CGCTCTTTGA TGCCCCGCTG GAGATGAGTT ACGTGTGGGC CGAGAATCCA ACCATCGTAG AGCAACACCT GGCGTGCGCG GCGTACGAGC GCCCTTTGTT CGCGAATCCC GACGTCGACG AGGTATACTT TGGCCCAAAG ACGCGGGAAA TCGCTGGAAA GCTCGCACGC AACAAGCTCA TGCGAGATGC GCGATCTTTA GTCGCGTGGG ATCTAAACGC CGCCATCGCG CAACCGCTTC TCGCGGCGAC GGACAAGACG CCCGCGCTCG ACGTGAGCGT GCGCACCATC GAAGAGGAAC GCTACGAAGT CATAGATATC GGTGGAGCTC GTGAAAAAGT CATCGCGAGT ATCGAGGCGT CCAAGGCATT CTTTGAAGTG TACGAAGGGG CGGTGTACAC GCACCAAGGG CGCACCTTGC TGTGCACCAA ACTCGATATC CCGCGCCGTC GAGCATTCGT GCGTATGGCG GATGTGAAAT ATTTCACTCG CGTGAAACAT GAGACGACGT GTGCGGTGCC CGGTGGCATG CGTGCGTACG AAGAGACGAA AGATGCGCTG TCGATCAAAT GCGATCAAGT CGACATCCGA ACGACTTTCA CGGGTTTCTC GCGAGTCGCT CGCGGATCGC AAGCAAAGTT TGATCACGAG GCGTTCCCTC CCAGAACGAC CGAATTCCGA ACCGTCGGCA CGTGGCTTCG TATTCCCGAC GACGTCGTCA CCGACGCATC CGACGCCGGA ATTGACCTTC GCGCCGGCGT GCACGCCGCG TCGCACGCCT TGCTCAACGC GCTTCCCCTA AGCGTTCCAT GCGGCGACGC CGACGTCGGC TGCGAGTGCT TCTCCGCTGA CCTGTACCGA CACACCTCGC GATACGCTCC GTTTCGGTTC TTGATCTACG ACCGCCACCT CGGCGGCGTC GGCATCGCCA AGCGCGCGTC CGCGCTCTTC TCCGACCTCG CTCGCGTCGC CGTGGCTCTC GTCGAGCACT GCGCGTGCGC CCGCGCCGAC GGCTGTCCTC GATGCGTCCA GCGCCTGTCG TGCGACGCGC ACAACGCGCA CATCAACAAA AAAGCCGCGC TGTGGCTGCT ACGTCGCGTG TTCGCGAACG ATTTGAAACC TAGCGATTGT GTAGAATAG
|
Protein sequence | MDVDTFVDHL KTTLASSGQI VHVENIPAKP AARTPLASKL SDVTLAAFAS AGIDIERKLF SHQARAIETV LDEREPRKHV IVASGTASGK SVCYNVPTIE TLLADPTATA LYMFPTKALA QDQLRALRNI LSNIPRSEET VFEIGMYDGD VREDARTEVR ENARLIITNP DMLHVSMLPS HKAWARVLSG LRYVVVDEAH AYSGVFGSHV ALIIRRLRRL CSELYGSSPQ FIISSATVAN PLEHARDLIG YDGVHPERDV IEAVTNDGAP RGVKTFLLWN PMLKPGQSKQ TNTERKFRRE TLLAECVQHN LRCLAFCKTR KLCELVLVYT REILRSSAPH LADKVASYRG GYEAIERRAI EKELFSGVLL GVATTNALEL GIDVGSLDVT LHLGFPGSVA SLWQQAGRAG RREGHALSVY VAFDGPLDQH FMRQPRALFD APLEMSYVWA ENPTIVEQHL ACAAYERPLF ANPDVDEVYF GPKTREIAGK LARNKLMRDA RSLVAWDLNA AIAQPLLAAT DKTPALDVSV RTIEEERYEV IDIGGAREKV IASIEASKAF FEVYEGAVYT HQGRTLLCTK LDIPRRRAFV RMADVKYFTR VKHETTCAVP GGMRAYEETK DALSIKCDQV DIRTTFTGFS RVARGSQAKF DHEAFPPRTT EFRTVGTWLR IPDDVVTDAS DAGIDLRAGV HAASHALLNA LPLSVPCGDA DVGCECFSAD LYRHTSRYAP FRFLIYDRHL GGVGIAKRAS ALFSDLARVA VALVEHCACA RADGCPRCVQ RLSCDAHNAH INKKAALWLL RRVFANDLKP SDCVE
|
| |