Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34459 |
Symbol | |
ID | 5000647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 767665 |
End bp | 770475 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | |
GC content | 54% |
IMG OID | 640416068 |
Product | predicted protein |
Protein accession | XP_001416755 |
Protein GI | 145344470 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1643] HrpA-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.318695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCCCG CGCTCGAGCC AGCGAAGAAG AAAGTTACAG GCGCGAAGGC GATTCCAGCA CCAAAAGGAG GCATTCTGGC TCCACCAACG CAGACCAAAG AAGAGCGCAA GAGAGAACTC GAGCGTCGAC GCGAACTTGC AGCCGAAGCC GAGAAGACGA GAGCGCGAGT CGAGGCGCAG ACAGAGGCGG AGGCACAGAC GATTGCCGAA GCGAAGACGA AGAAACTCGC CGAGGCAATC TTAAAGGAGG AAGAGAAGCG CGCGAACGAC CCGCGCGTGA TCGCCGAAAG CGAGCGACTG AAAACGGAGT TGACCAAATT CTGGAACGAT AAGAAGGACT CTCCGATACT TAAACAGCGT CAACGTCTTC CGGCGTGGGC AAAGCAACAA GAACTCATCG ACGCCGTCGA GCGCAACCAA GTGCTCATCG TCGCCGGCGA AACGGGTTGT GGTAAGACTA CACAGCTGCC GCAGTTCATT CTTGATAACG CTATCTGGCA AGGACGTGGA GCGATGACAA ACATGATTTG CACACAGCCG CGTCGAATCA GCGCCACGAG CGTGGCGAGT CGCGTGGCGA GTGAGCGAGG TGAACAAATT GGCAAAACAG TGGGCTATAA AATCCGTCTT GAAGGCTCGA TGAGCTCGAG CACGCGAATT TTGTTTTGCA CCACGGGCGT GTTACTGCGT CGTTTAACCG AAGACCCTTT ACTGAGCGGT ACGAGTCACG TGATAGTCGA CGAAGTCCAC GAGCGCTCGC TCGATTCTGA TTTCTTACTC GTCTTGTTGC GCGATATTTT GCCGCATCGA CCGACACTCA AAGTGGTGCT GATGTCAGCC ACGTTGAATG CGCTCGCGTT TGAAGACTAT TTCAAGGGCG TTTCTGCGGT GAGCAAAATC CCTGGGTTCA CTTATCCGGT GAACGAGCAT TATCTCGAGG ACATCTTACA AGTGACCGAG TATCAACCAA ACCCAGGGAC GGAGTATTTT AAAAAGGCGC CGCGACGCAG GGACAACTTT GACGCCTCTA GTCGACCGGT GTCAAGTAAG GATGGCGACA TACCGGATGA AGACTCGTTC AACATTACGT TGCGGGACAA GGGCTACGGT GACAACGTCG TTCGCGCTCT CCGAAACCTC GAACAGGGTT TGATCAATTA CGAGTTGATG ACGCTTTTAA TCTCGCACAT TTGCGAGTCG ATGGACGAAG GCGCAATACT GGTATTTATG CCCGGTTTGG CGGAGATCAC GAAGCTTTAC GAGGCGTGTG GGGCGAATCC TACGATCAAC GCGGCAACGT CTGGAGGAAA ATACCTGATT GCTTTGCACA GCACGCTGAG CACGGCCGAG CAGAGCATCG TATTCGATCA CGCCCCAGAC AGCGTGCGTA AGATAGTCAT CGCAACAAAT ATCGCAGAGA CGTCGATCAC GATCGACGAT GTCGTGTACG TAGTCGACAG TGGCAAGTGC AAAGAAAACG GCTACGATCC AAACACTCGA ATGCAGCTGC TTCTCGAGCA GTGGGTGTCT CGTGCGAGCG CTCGACAGCG TCGCGGTCGC GCGGGACGCG TGCAAGCCGG CCGATGCTTC CGAATGTACA CGCGTCACGT GCACGATACG GTCTTCGCTG AACACACGCT TCCTGAAATC AGACGCGTGC CTCTAGAAGG CTTGTGCTTG CAGATTCAGC TACAACGCAT GGCTGGGGGC ATCGCCGGAT TTCTCGGCAA GGCGTTGGAG CCTCCGAAGG TTGAGTCTGT GGAGGCGGCG GTGGCGTCTC TCAAACGCCT TGGTGCTTTG GATGAGCGTG AATGCCTGAC CCCACTCGGT CAACACTTGG CTACGTTACC CGTGGATGTT CGCGTGGGTA AGATGCTCCT CTACGGCTCC ATGCTTGGGT GCCTGGATCC CGTGCTCACG ATTGCTGCCG TCTTGAGCGG TCGTTCGCCG TTTGTGGCGC CGCTCGACAA ACGCGACGAA GCAGATCTCG CCAAAAAGCT CTTCGCCGAA GATCAATCGG ATCATCTCAC GATTCTGAAT GCCTATAATG GCTGGCAAGA TGCGAAGAAG CAAGGACGAT CGTCTGAGTT CGCGTTCACC CGCGAAAACT TTCTCTCGTG GAGAGCACTC GAGGGCATCG CAGATCTGCG GAACCAATTT ACACAACTTC TGAATGAATC AGGTTTCCTG GGATCGTCGT CGAAGAAGAA AGGTGGTGGA CGGTATCGCG GTCGCCAGCG CGGTAATGTC TTGGAAACCG ACGTAGATTG GATTCGAGCG AATCGAAACT CGGAGAATAA ACGACTGTTA AAGTCGGTTC TCGTGGCTGG GTTGTATCCA AACCTCATCA AAGTCGACCC CGGTTCTCGT CCAGATGCCC CACCCCGTCT GTCTTTCCTC GCCGAGAACG GGCGGACGGA AAAAATCCAA ATTCATCCAT CGAGCATCAA CTTCGAGGCG AAGAAGTTTA TCACCAAGTG GCTGGTGTAC CACGAGCGCG TGCAGACGAC GGCAATCTTC GTGCGCGATT GCACGGCGGT GACGCCTTAT CAACTTCTAT TGTTCGGGGG GAAAATCGAA GTGCAACACA CGCAAGGAAC GATAAGTATC GATCGCTGGG CGACGTTCCA AGCTCCAGCA AAAGTAGGAG TTTTGCTCAA AGAGATCCGA AACCAGCTTG ACCGCGTGCT GGCGCAAAAA ATTGAAAACG TCGGCAAGGA CGTCGGCGAA CTTTCCAACC CTCTCGTGCT CACGATTCTC GAGCTTCTCG ATTCGGAGAA GATCGCGAAG ATGTCCACAA AACCAAATTA G
|
Protein sequence | MAPALEPAKK KVTGAKAIPA PKGGILAPPT QTKEERKREL ERRRELAAEA EKTRARVEAQ TEAEAQTIAE AKTKKLAEAI LKEEEKRAND PRVIAESERL KTELTKFWND KKDSPILKQR QRLPAWAKQQ ELIDAVERNQ VLIVAGETGC GKTTQLPQFI LDNAIWQGRG AMTNMICTQP RRISATSVAS RVASERGEQI GKTVGYKIRL EGSMSSSTRI LFCTTGVLLR RLTEDPLLSG TSHVIVDEVH ERSLDSDFLL VLLRDILPHR PTLKVVLMSA TLNALAFEDY FKGVSAVSKI PGFTYPVNEH YLEDILQVTE YQPNPGTEYF KKAPRRRDNF DASSRPVSSK DGDIPDEDSF NITLRDKGYG DNVVRALRNL EQGLINYELM TLLISHICES MDEGAILVFM PGLAEITKLY EACGANPTIN AATSGGKYLI ALHSTLSTAE QSIVFDHAPD SVRKIVIATN IAETSITIDD VVYVVDSGKC KENGYDPNTR MQLLLEQWVS RASARQRRGR AGRVQAGRCF RMYTRHVHDT VFAEHTLPEI RRVPLEGLCL QIQLQRMAGG IAGFLGKALE PPKVESVEAA VASLKRLGAL DERECLTPLG QHLATLPVDV RVGKMLLYGS MLGCLDPVLT IAAVLSGRSP FVAPLDKRDE ADLAKKLFAE DQSDHLTILN AYNGWQDAKK QGRSSEFAFT RENFLSWRAL EGIADLRNQF TQLLNESGFL GSSSKKKGGG RYRGRQRGNV LETDVDWIRA NRNSENKRLL KSVLVAGLYP NLIKVDPGSR PDAPPRLSFL AENGRTEKIQ IHPSSINFEA KKFITKWLVY HERVQTTAIF VRDCTAVTPY QLLLFGGKIE VQHTQGTISI DRWATFQAPA KVGVLLKEIR NQLDRVLAQK IENVGKDVGE LSNPLVLTIL ELLDSEKIAK MSTKPN
|
| |