Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50709 |
Symbol | |
ID | 5004243 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 18547 |
End bp | 21889 |
Gene Length | 3343 bp |
Protein Length | 1055 aa |
Translation table | |
GC content | 52% |
IMG OID | 640419664 |
Product | predicted protein |
Protein accession | XP_001419877 |
Protein GI | 145351001 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4581] Superfamily II RNA helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.918381 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGTCG CGAGCGTCGC GCCGTTGCGT ACGTCGTTTC GTGGCGTGAA CGCGAACGGG AGCGACGCCA CCGGGGAGCG TCGTCGCCTC CTCTCGTTTC GTCGCGCTCC ATCGCGCGCG CGCCGCGATG AAGCGGTAAA CGCTCGCGTT CGAGCGAGTG GGACGTCTGT AGAACCGAGT GTGGATGAAG ACGATGAAGA AGTCGACGTC ACGGTGACGT CTACGTCGGA CCGGTTGGAG GCTCTTTTAA ACAAAGCTTC CACGTCGCCG GTGAACGTGC ATGACATAGA ACAGTTTTAC CCGTACGAAT TAGATGGATT TCAAGTCGAA GCCACTGAAT TATTGCTACG CGGGTCATCG GTCGTGGTGT CAGCGCCGAC GGGGAGCGGG AAAACTTTAG TCGGTGAGAC GGCGATTTTA ACTGCGCTGG CGCGCGGAGA AAAGGCGATA TACACGACGC CGTTGAAGGC GTTATCGAAT CAAAAGCTCA GAGAGTTTCA AAAGATTTTC GGTAAAAGAC GGTGTGGGTT AAAGACTGGT GATGTGGACA TCAACGGCGA TGCTGACGTC ATGATTATGA CGACAGAAAT TTTGAGGAAT ATGCTCTATT CAAGCGCTGC AGGTGGCCGC GACGACGAGC GACTCGCCGA CGTGAGTATC ATCGTTCTCG ATGAGGTGCA CTATCTCGCA GATAGATCCC GAGGAACAGT GTGGGAAGAA ACAATTATTT ACTGTCCTTC TCGCATTCAA CTTTTGTGTC TGTCCGCGAC AGTCGGTAAT CCAGAGGATT TGTCGGGCTG GATTGAGGAA GTTCACGGCG AATGCGAAAC CGTCGTGTCA AGTTACAGAC CTGTTCCCCT CACTTGGCAA TACAGCATGA AGCCGTCGCG CATGTATCCA GGGTTGGGAC CTTTGATGAA TTTCAAATCC ACGAAAATTC ACCACGATCT ACGACCTTTT ACGCGCGAAG GTCTTCAACA AGGATCGTAC GGTAACAACG ATTGGGCACC GGATGCGCAA AGAGGCGCGA AAGAGTCTGA GCGCGTTCTC AGAAGGCGCT TCGTGCCTCA CGTCGAAACC ACGGTACAGC AGCTCATCGC GAGCGATATG ATTCCCGCCG TGTGGTTTAT TTTTAGTCGT AAAGGTTGCG ACCAATCTGT CGATTATCTC GTGCAGGCCG GCGGAAATCT CGTGACGAGC AAGGAACGGC GAGAGATTGA TGATGCGTTG AAAGAATTTT CGGAGAAGAA CAAATCTGCT GTGCGAGCAA GCATGGTTGA GCCTCTAAGA CGTGGCATAG CGTCACATCA CGCCGGGTTG CTTCCAGCGT GGAAAGGTCT AGTAGAGAAA TTGTTTCAGC GAGGACTAAT CAAGGTGGTT TTCGCTACGG AAACACTCGC CGCAGGAGTG AACATGCCGG CACGATGTTC CGTGTTAAGT GCGCTTTCCA AGCGAGATGA TCAAGGTCCA CGACTACTAA CTTCCAATGA GTTCATGCAA ATGGCTGGTC GAGCGGGCCG TCGTGGGTTT GACACCGTTG GCCACGTCGT GTGCTGTCAG TCACCATTTG AGGGCCCGGA CGAAGCTTTC GAGCTCGTTC TTGCGCCACC TGAAAACTTG AAGTCGCAAT TTTCTATTTC TTACGGTATG GTTTTGAATT TACTCCAAGG TAGAACGCTC GATCAAGTCA AGGGAATCGT GGAGAGAAGT TTTGGCAACT ACCTTGGTGG TAAGGCGCGT TCGATGCGCG AGCGCGAACT TCTTCGTGTC AATGATCAGA TCAGAAAACT GGTGAGCGAG ATGGAGACAC TCGATGATGA TGAAGAAGCC GCGGAGTGGA GACGTTTCGT GAAGCTGGAT GAACGGCTAC ATGAAGAAAA GCGCTTGCTA AAAATCTTGA TTCGGCAATT AGCTGAAATG CGGGCAATTG AAACGCGGGA TCAACTACAG TTCGAGCTCG AACAGACGGG TGCTCCAGTC ATCGTTACGA TTGATATTGG TGACAACGTT CTCAAACGGC GCAAAGAACG ACGATCGGCT ACGATCGCTC TATTTGAAGA TGATTCTTTG AAGATGGAGG GTCAAGACTT TGCGGGCGAA TGGAGATTGC AAGATATTGG TGGCGACGAT GCTCCCGGTT TGGACGAGTT ATTCGGCGAT TCCGACGATG AAAGTCAAAA CGATGATTTT TTGCAGTCTT TTGACGACGA CGACGGTAAG TTCCCCAGAG GTCTCATCAC CGCCGCAATC GTCGAGGCTG TTCCGGCGAT GAAAATTGCG GCGACTGCGA GCACAATCGG TAAGCCATAT CCGATGGGCG AATTCACCGC CCTAGGAAAT GATGGTGTCT GGTACCGTTT GTACTCCGAT CGAGTAAAAT CTATAAGCCT CGGCGCCGAT GCGGTGCGCT TGGAAAGTTT TGGCGACATC GGCGTGCCGC CGGCTTCGAG CAGTCTTCGA TGGATCCGCG CGAGCGGCGG AGGTTTGTGG AAGGCGGATG TGTCTAAAAA GACCAAGTTG GTCGCTAATG GGATACCAAC TAACTTGAAC GACTTTGAAA TGATCGTCGA ATCGTCGGAT TCGATGGAAT TTATCGACGC GCAGAAGTTA CAGATTCAAA AGACGCGCGA GGAGATAAAC GGCTTGAAAA ACATCGCCAC CTTGCGCCGC GCTGCGAAGC AGCAAAAGCG GGCGGAGACG AAACTGAAAA AGCTAAAAGA AAAACGTGAT GGAATTGAAA AACGTATCAA AGAGTATTCT GCCGCGGGCT GGGACGATTT CTTGAGAGTT GTCGATATTC TTGTCGAGTG CGGCGCGATC GAGAGAGACA CCCTAAAATT GTTGGAGTTT GGTGAGACCT GTGCTGACTT GAGAGGGGAA AATGAATTAT GGCTTGGCAT GGCTATGTCT TCGCCGAGTA TCGAGAATTT GGACGCCGCG ACTCTTGCAG GTTTTGCAGG GGCGCTCTGT ATGGACAACC GTCCGGCTAC ATGCTACTAC GGCGCTTCGC AACACCTCGT CGAAGTGCTT GAAGAGCTCG AACCGGAGAT GGGCGACCTT CAGTACTTGC AACAATCTTC TCGAATCGAC ATGCCTCTGA GCTTGAGTTT CGAGATCGCG GCGTTGGTAG AGTCATGGGC ATCGGGAACG TCGTGGGACC AGATACGCCG TGATACTTCC TTAGATGAGG GAGACATCGC TAGATTGTTT CGACGAACTG CAGAACTTCT TGCGCAAATT CCGCGCACCG CACATCTACC GGAGAGTCTC AAAGCGACTG CAAAGAAGGC GAACGATGTC GTCAATAGAC CTCCGATTAG TGATCTTTCT TGATCATTAT CAT
|
Protein sequence | MPVASVAPLR TSFRGVNANG SDATGERRRL LSFRRAPSRA RRDEAVNARV RASGTSVEPS VDEDDEEVDV TVTSTSDRLE ALLNKASTSP VNVHDIEQFY PYELDGFQVE ATELLLRGSS VVVSAPTGSG KTLVGETAIL TALARGEKAI YTTPLKALSN QKLREFQKIF GKRRCGLKTG DVDINGDADV MIMTTEILRN MLYSSAAGGR DDERLADVSI IVLDEVHYLA DRSRGTVWEE TIIYCPSRIQ LLCLSATVGN PEDLSGWIEE VHGECETVVS SYRPVPLTWQ YSMKPSRMYP GLGPLMNFKS TKIHHDLRPF TREGLQQGSY GNNDWAPDAQ RGAKESERVL RRRFVPHVET TVQQLIASDM IPAVWFIFSR KGCDQSVDYL VQAGGNLVTS KERREIDDAL KEFSEKNKSA VRASMVEPLR RGIASHHAGL LPAWKGLVEK LFQRGLIKVV FATETLAAGV NMPARCSVLS ALSKRDDQGP RLLTSNEFMQ MAGRAGRRGF DTVGHVVCCQ SPFEGPDEAF ELVLAPPENL KSQFSISYGM VLNLLQGRTL DQVKGIVERS FGNYLGGKAR SMRERELLRV NDQIRKLVSE METLDDDEEA AEWRRFVKLD ERLHEEKRLL KILIRQLAEM RAIETRDQLQ FELEQTGAPV ILFGDSDDES QNDDFLQSFD DDDGKFPRGL ITAAIVEAVP AMKIAATAST IGKPYPMGEF TALGNDGVWY RLYSDRVKSI SLGADAVRLE SFGDIGVPPA SSSLRWIRAS GGGLWKADVS KKTKLVANGI PTNLNDFEMI VESSDSMEFI DAQKLQIQKT REEINGLKNI ATLRRAAKQQ KRAETKLKKL KEKRDGIEKR IKEYSAAGWD DFLRVVDILV ECGAIERDTL KLLEFGETCA DLRGENELWL GMAMSSPSIE NLDAATLAGF AGALCMDNRP ATCYYGASQH LVEVLEELEP EMGDLQYLQQ SSRIDMPLSL SFEIAALVES WASGTSWDQI RRDTSLDEGD IARLFRRTAE LLAQIPRTAH LPESLKATAK KANDVVNRPP ISDLS
|
| |