Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42421 |
Symbol | |
ID | 5003109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 543937 |
End bp | 546906 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | |
GC content | 54% |
IMG OID | 640418530 |
Product | predicted protein |
Protein accession | XP_001419195 |
Protein GI | 145349553 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1643] HrpA-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0589659 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.298974 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAACG CAAACAAGTT TGTTACCGAA GGTACCGCCG TCGCGGCCGA AGATGACGAC GCGCCAGCCC CGGATGACGA CGCGGAGAAG GTTTTGGATC GCGCGTGGTA CGACGACGAC GAGGGCGGCG GCGCGCACGG CGACGCGCAC AACCCTTTTA ACACGAACGC ACGGGATGAG GCGCGTTACG CGAACAAAGA ACAAGAATAC GCGAAAAGGT TGACTCGACG CGACGGGTCG CTCATGTCCA TGGCTGCGTC GCGACGCGTC AGTCAACTCA ACGCCGATTC AAATCAATGG GAAGAAAATC GTATGATGAC GTCCGGTGTG ATTCGCACCA AGGAAATTGA TTTGGATTTT GATGACATGG AAGAAAACCG CGCGGTTTTG CTCGTGCACG ACACGAAACC GCCATTCTTA GACGGCCGTA TGGTGTTCAC GAAGCAGCAA GAGACTGTCG TACCGGTGAA GGACGTCACG AGCGACATGG CGCAAATCGC GCGCAAAGGA AGCGCGTTGG TGAAGGAAGT GCGTACGAAG CGAGAGGAGA ACAAAGGTCG GGATCGATTT TGGGAAATGA AAGGGTCGAA GATGGGATCG ATCACGGGTA CGACACAAGC TGAAAACAAG GAAGCCGCGG AAAACGCGCA AGCGGCGAAA GGTCGCGATG ACGACAGACC AGACGTCGTC GGCGCGGACG GCGAAATCGA TTTCAAGGCT GGCGCCAAGT TTGCCGAGCA CATGAAAGGT TCGAAGGCGA GCGCACAAAG CGAGTTCGCG AAGACGAAGA CGATCAAAGA ACAGCGTGAG TTCTTACCTG TGTACGGTTG TCGCGAAGAC TTGATGCATG TCATTCGCGA AAATCAAATC GTAGTCGTCG TCGGCGAAAC CGGAAGCGGT AAGACGACGC AAATGACGCA ATACATGCAC GAGGAAGGTT ACTCCACATT CGGGATGGTC GGTTGCACTC AACCCCGTCG TGTAGCTGCA ATGAGCGTCG CGAAGCGTGT GAGCGAGGAA ATGGGCTGTG AACTAGGTAA GGAAGTCGGT TACGCCATTC GATTCGAGGA CTGCACGGGG CCTGATACGA TTATCAAGTA CATGACGGAT GGCGTGCTTC TTCGAGAAAC TTTGCGCGAA CCTGATCTTA ACATGTACAG CTGTATCATC ATGGACGAAG CGCACGAACG ATCGTTACAC ACTGACGTTC TATTCGGTAT TCTGAAGAAA GTTGTCGCGC GCCGTCGCGA TTTCAAGCTC ATCGTCACGT CGGCGACGTT GAACGCAGAA AAGTTTAGTA ACTTCTTTGG ATCGGTGCCG GTTTTCCACA TTCCTGGTCG CACGTTCCCG GTCGATATTC TGTACTCCAA GACACCCGTG GAGGATTACG TCGAAGCTGC GGTGAAGCAA GCGCTCACTG TGCATCTCTC GTCGGGACCG GGTGACATTT TGATCTTCAT GACGGGTCAA GAAGAAATCG AGACGGTGAC GTACACGTTG GAAGAGCGCG TCGAGCAGTT GATGAGCGAA GGCACGTGTC CACCGCTGAA CGTTTTACCA ATCTACTCAC AACTCCCGAG CGATTTGCAG GCGAAGATTT TTCAAGACGC AGAGGATGGT AACCGAAAGT GCATCGTCAG TACGAACATC GCGGAGACGT CGCTCACGCT CGACGGCGTC ATGTACGTCA TCGACAGTGG TTATTGCAAA CTTTCAGTGT TTAATCCTCG AATGGGTATG AATGCTTTGC AAGTTTTCCC TTGCGCGCAA GCTGCGGTGA ATCAACGCAG CGGCCGCGCC GGTCGTACTG GACCAGGGAC GTGCTATCGC CTGTACACGG AGATGGCGTT CAAGCACGAA ATGCTCGTCT CGACGGTTCC CGAGATTCAA CGCACCAACT TGGGTAACGT CGTGTTACTT TTGAAGTCGC TCAACGTGGA TAACTTGTTA GATTTTGACT TCATGGATCC TCCTCCCCAA GAAAATATCT TGAACAGCAT GTATTCCCTG TGGATTTTAG GCGCGCTCGA CAACACTGGC GGGCTCACGA AACTCGGCTC GAAGATGGTT GAGTTTCCCG TCGACCCGCC GCTGGCGCAG ATGCTCATCA AAGCGGAAGA AACGGGCTGC TCGAACGAAA TGCTCACCGT CGTCGCGATG TTATCGGTTC CGTCAGTGTG GTTCAGGCCG AAGGATCGAG AGGAAGAATC CGACGCCGCG CGCGAAAAGT TCTTCGTTCC CGAAAGCGAC CACTTGACGT TGCTCAACGT GTACCAGCAA TGGAAAAATA ACGGGTACAG GAACGATTGG TGCAACAAGC ATTTCATTCA GGGCAAAGGT CTGAAGAAAG GTAGAGAGGT GCGCGCGCAA TTGATGGATA TCATGAAGCA ACAGAAAATC CCGCTCGTGA GCTGTGGGCA AGATTGGGAC GTCTGCCGTC GATCCATCGC CGCCGCGTAC TTTCATCAAG CGGCGCGTTT GAAAGGCGTC GGTGAGTATG TCAATGCTCG CAATGGTATG CCTTGCCACC TTCATCCGAG CTCAGCGCTT TATGGTCTGG GTTACACTCC TGATTACGTC GTATACCACG AACTCATCAT GACATCGAAA GAATACATGC AATGCGTCAC CGCCGTCGAA CCGCACTGGC TCGCCGAATT CGGACCGATG TTTTTCACGC TCAAGGAGAG CCATTCGAGC ATGTTGAAAT CAAAGGCGAA GCGCAAAGAG GACAAGGCGA AGATGGAGGC TGAAATGCAA GCTAAACGCG ATGAGGAAGC ACAGCTGCAA GAAGCGCAGC GCACGCGAGA AGAAGATCGC CGCGCGAGAC AAAGGAGTCA AATCGTGACG CCGGGGCAAC GCAGCGCGGC GACGACGCCG CGCGTAGATT ACGGAACGTC GCGTCCCCCA TCGAGCGTTC GGCGCGGTGC GGGTGGCCGG ACGCCGGGAA GAAAACGCTT TGGACTATAG
|
Protein sequence | MANANKFVTE GTAVAAEDDD APAPDDDAEK VLDRAWYDDD EGGGAHGDAH NPFNTNARDE ARYANKEQEY AKRLTRRDGS LMSMAASRRV SQLNADSNQW EENRMMTSGV IRTKEIDLDF DDMEENRAVL LVHDTKPPFL DGRMVFTKQQ ETVVPVKDVT SDMAQIARKG SALVKEVRTK REENKGRDRF WEMKGSKMGS ITGTTQAENK EAAENAQAAK GRDDDRPDVV GADGEIDFKA GAKFAEHMKG SKASAQSEFA KTKTIKEQRE FLPVYGCRED LMHVIRENQI VVVVGETGSG KTTQMTQYMH EEGYSTFGMV GCTQPRRVAA MSVAKRVSEE MGCELGKEVG YAIRFEDCTG PDTIIKYMTD GVLLRETLRE PDLNMYSCII MDEAHERSLH TDVLFGILKK VVARRRDFKL IVTSATLNAE KFSNFFGSVP VFHIPGRTFP VDILYSKTPV EDYVEAAVKQ ALTVHLSSGP GDILIFMTGQ EEIETVTYTL EERVEQLMSE GTCPPLNVLP IYSQLPSDLQ AKIFQDAEDG NRKCIVSTNI AETSLTLDGV MYVIDSGYCK LSVFNPRMGM NALQVFPCAQ AAVNQRSGRA GRTGPGTCYR LYTEMAFKHE MLVSTVPEIQ RTNLGNVVLL LKSLNVDNLL DFDFMDPPPQ ENILNSMYSL WILGALDNTG GLTKLGSKMV EFPVDPPLAQ MLIKAEETGC SNEMLTVVAM LSVPSVWFRP KDREEESDAA REKFFVPESD HLTLLNVYQQ WKNNGYRNDW CNKHFIQGKG LKKGREVRAQ LMDIMKQQKI PLVSCGQDWD VCRRSIAAAY FHQAARLKGV GEYVNARNGM PCHLHPSSAL YGLGYTPDYV VYHELIMTSK EYMQCVTAVE PHWLAEFGPM FFTLKESHSS MLKSKAKRKE DKAKMEAEMQ AKRDEEAQLQ EAQRTREEDR RARQRSQIVT PGQRSAATTP RVDYGTSRPP SSVRRGAGGR TPGRKRFGL
|
| |