Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_49799 |
Symbol | |
ID | 5002588 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 491901 |
End bp | 494783 |
Gene Length | 2883 bp |
Protein Length | 933 aa |
Translation table | |
GC content | 54% |
IMG OID | 640418009 |
Product | predicted protein |
Protein accession | XP_001418258 |
Protein GI | 145347614 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4581] Superfamily II RNA helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.238463 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGAG ACGAGGGAAG GATCGAGGTC CCCGAAGTCC CGGCGAAGTC GTACGCGTTC GAGCTGGACA CGTTTCAGCA AAAGGCGGTG GAGTGCTTGG AGCGGGGGGA GTCGGTGCTG GTGAGCGCGC ACACGTCGGC GGGGAAGACG GTGGTGGCGG AATACGCGAT CGCGATGGCG ATACGAGACG GACAGCGCGT GGTGTACACG TCACCGTTGA AGGCGCTGAG TAATCAAAAG TATCGCGAGC TGAAGGAGGA GTTTGAGGAC GTGGGATTGA TGACGGGGGA CGTGGTGATA AATCCGAGCG CGTCGTGCCT GGTGATGACG ACGGAGGTTT TGCGATCGAT GCTGTATCGG GGAGGGGAGG TGATGCGCGA GGTTGGGTGG GTGATTTATG ATGAGATTCA TTACATGCGG GACAGCGAGC GAGGGGTGGT TTGGGAGGAG TCCATCGTGT TGTTGCCGGA CATGGTCAAG TACGTGTTTC TATCGGCGAC GATTCCGAAC GCTCGAGAGT TCGCGGAGTG GGTGTGCAAG ACGCATAATC AACCGTGTCA CATCGTGTAC ACTGACTTTC GTCCGACGCC GTTGGAGCAT TACGTCTTCC CGGCGAACGG AGAGGGGATA TTTTTAGTCA TGGATCGGCA GTCCAAGTTC AGGGACAGCA ATTTCGAGCA AGCCGTCACC GTCATCGCCG ACGGCGGCGG CGCGGCGGCG GCACGCGTGG CCAATCGAGC GCGAGGTGAT GATGGAAAAA AGGAGGCAGT TAACCAGGAC ATCTTTAAAA TCATCCGTAT GGTCGTCGAG CGCAACTACG ATCCTGTGAT TGTGTTCGCG TTCAACAAAC ACGAGTGCGA AAAGATGGCT AACTCATTAC ACAAAGTTGA TTTGTGCGAC GAAGACGAGA AAAAGCTCAT CGACACGATT TATTGGAACG CCATGGACTC GTTGTCGGAC GAAGACAAAC GATTGCCTCA AGTCGCAAAT TTGCCAAACC TTTTACGACG AGGTTTGGGT GTGCACCACT CGGGACTGCT GCCGATTTTG AAGGAGGTGA TCGAGATTTT ATTCCAGGAA GGCTTGATCA AAGTGCTCTT CGCCACGGAG ACCATGTCGG TGGGACTGAA CATGCCGGCT CGCACGGTCG TGTTTTGCTC TCCTCGAAAG TTCGACGGCG CTGGTTTTCG TTGGATCACG TCTGGCGAGT ACATTCAAAT GTCTGGTCGA GCCGGTCGTC GCGGCAAGGA CGACCGCGGT TTGGTAATCT TGATGATGGA TGAGCGTATG GATCCGCCGG TGGCGAAGAA CATGTTACAC GGGCAATCTG ATACTCTTGA TAGCGCATTT CACTTGAACT ACGCAATGAT TTTGAACCTG ATGCGCGTCG AAGGTGCTGA GCCAGAGTCG CTCATTCAAT CGTCGTTTGC GCAATTCCAA AACGATCGCG CGTTGCCGGG TCTGGAGGCG AAGATTGTCG AGATACAGAA GGATCGGGAC GCGGTGAAGA TTCACGACGA AGACAGCGTC GACGAGTATG TCAAGCTCAA AGACGGCCTC GATGCCATGA TTCGAGAACG TCGTGTCGTC ACCAACACTC CAACGCACGC GGTGCCGTTT TTGCAGCCTG GTCGATTGGT GCGCGTGTGC ACAAAATCGC CGTCGATTTC GTCTACGTAT GACGAGGAAG ACGATTCGAT TAGAATTCCC GTACCAGGAA CGGAGCCGGG TGAAGAGGAC GTCGTTTGGG GCATGATTGT GTCTTTCGAG CGCATCGGTG GCGGTGGAAA ATCTGGGAAA GCGGCGTACG GGGTCGACGT TCTGGTGCGC ACGCGCGAGA ATAGTGACGG TAAGACTCCG TTATCATCAA AGAGTAAAAA CGATAGATAC GAGTTTTTGA ATGCGAACGA GGAGGACGAT TCGTCGGAGC CGCGGGTGAT TCGAGTGCCC CTCGAGCAGC TCGATGTATT GAGCAGCGTT CGCGTATACT TGCCGAAAGA CTTGCATCCA CGCGAGGCGA GAGATCAGTG CATTAGCAGT GTGGGGGAGG TCATCAAGCG GTTCCCTGAT GGCGTGCCGG TTTTGGATGC CACGCGAGAT CTGAAGATCG ATAGTGAAAA CTTCTCCAAG CTCTTAAAGC GAATCGACGG CATCAAGTCG ATGATGAAGA AGCACCCCGT CGCCTCAAGT GAAAGGCTCG TCGAACAGCT CTCAGCGCAC AAGAGAAAAC GTGAGCTCTC CATCGCACTG AAGCAGGCGA AGAAAAACGC AAAGGCTGCC GCTGGACTGA TCATGCGCAA TGAGCTGAAG CAGATGCGTC GCGTGCTCAA ACGACTCGGG CACACGAGCG CTGAGGGCGT GGTACAAACG AAGGGAAGAG TGGCGTGCGA ACTCGCCTCG GTCGACGAGC TCGTCACCGC GGAGCTCATC TTCAACGGTA TGTTCAAAGA AGTCGATGTT GATATGCTCG TCGCTTTGGT TTCGTGCTTG GTGTGGCGCG AGAAGTCGCG CAACACACCC AAGCTTAGCG AAGAAACCGC GGAAGTGTTT TCGCGCCTGA AGGATGTTGC GCGCAAAGTC GGGAAACAAA TGATGGAGTG TAGGATGAGC GTGGACGTCG AAGAGTACGT AGAGGGCTTT AGGAGCGAGC TCATGGAAAT CATGCTCGCG TGGTGCAAAG GGAATAAATT TGCAGAGATT ATGAAAATGA CAGATTTGTT CGAAGGTTCC ATAGTGCGCG CCATTCGGCG TGTCGAGGAG GTTTTGCGCC AACTGTCTGA CGCGTGTCGG GTCATAGGCG AGACTGAACT TCAAGAAAAG TTCACAATCG CGAGCGAAAA AGTGAAACGC GACATAGTGT TCGTCGCGAG CTTGTTCCTT TAG
|
Protein sequence | MTRDEGRIEV PEVPAKSYAF ELDTFQQKAV ECLERGESVL VSAHTSAGKT VVAEYAIAMA IRDGQRVVYT SPLKALSNQK YRELKEEFED VGLMTGDVVI NPSASCLVMT TEVLRSMLYR GGEVMREVGW VIYDEIHYMR DSERGVVWEE SIVLLPDMVK YVFLSATIPN AREFAEWVCK THNQPCHIVY TDFRPTPLEH YVFPANGEGI FLVMDRQSKF RDSNFEQAVT VIADGGGAAA ARVANRARGD DGKKEAVNQD IFKIIRMVVE RNYDPVIVFA FNKHECEKMA NSLHKVDLCD EDEKKLIDTI YWNAMDSLSD EDKRLPQVAN LPNLLRRGLG VHHSGLLPIL KEVIEILFQE GLIKVLFATE TMSVGLNMPA RTVVFCSPRK FDGAGFRWIT SGEYIQMSGR AGRRGKDDRG LVILMMDERM DPPVAKNMLH GQSDTLDSAF HLNYAMILNL MRVEGAEPES LIQSSFAQFQ NDRALPGLEA KIVEIQKDRD AVKIHDEDSV DEYVKLKDGL DAMIRERRVV TNTPTHAVPF LQPGRLVRPG EEDVVWGMIV SFERIGGGGK SGKAAYGVDV LVRTRENSDG KTPLSSKSKN DRYEFLNANE EDDSSEPRVI RVPLEQLDVL SSVRVYLPKD LHPREARDQC ISSVGEVIKR FPDGVPVLDA TRDLKIDSEN FSKLLKRIDG IKSMMKKHPV ASSERLVEQL SAHKRKRELS IALKQAKKNA KAAAGLIMRN ELKQMRRVLK RLGHTSAEGV VQTKGRVACE LASVDELVTA ELIFNGMFKE VDVDMLVALV SCLVWREKSR NTPKLSEETA EVFSRLKDVA RKVGKQMMEC RMSVDVEEYV EGFRSELMEI MLAWCKGNKF AEIMKMTDLF EGSIVRAIRR VEEVLRQLSD ACRVIGETEL QEKFTIASEK VKRDIVFVAS LFL
|
| |