Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42450 |
Symbol | |
ID | 5003260 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 187172 |
End bp | 190168 |
Gene Length | 2997 bp |
Protein Length | 975 aa |
Translation table | |
GC content | 56% |
IMG OID | 640418681 |
Product | predicted protein |
Protein accession | XP_001419304 |
Protein GI | 145349776 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1948] ERCC4-type nuclease |
TIGRFAM ID | [TIGR00596] DNA repair protein (rad1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.488375 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTCG TCCTCGCCGA CGCGAGTCCG AGCGAAGCGG TCGTCGACGG GGCCGACCTT TTGCCGTTCC AGCGCGAGAT CACCAAGGAA CTGCTCGCGC GCGATGGTTT TTGCGTTCTC GCCGAAGGAC TCGGTGCGAG CGCAGTCATC GCCGCGCTCG TCGCCGTCGA CGACGCGCTG TCTAAGACGC ACGTTCTAGG CGAACCACCC ATGGTGACGC TCATCGTCGG TGCGAGTGAG CACGCGAAAG TGAGCGTGAA GGAGCGTATG ACGGCGCTGT TTCCGCGCGC GGCGCCGCCG CTGGAGTTCA CCGCCGACTA CGCGGGCGAC AAGAGGAAAA AGTTTTACGA CGCGGGATGC GTGGCGTTCG TGACGACTCG AATCGCGAGC GTGGATTTGT TGAGTGGGAG GCTGGACGCG AAGCGAGTGC GAGGGATCAT AGTGTGCTCG GCGCATAGGA CGAACGAGAC GTCGGGGGAG GGGTTCGTCG TGCGGTTGTT TAGAGAGGGG AATAGGAAAG GATACGTGCG GGCGATCAGC GATCGCCCTG GGGATTTGAC GCGCGGCTTT AATAGCGTGG AACGATGTTT GAAGGCGCTG ATGCTCACGC GCGTGCATCT GTGGCCGCGA TTTCATTTGC GAGTGAAGGA CGATTTAGAC GCGCGTCCGC CAGAGGTGAT CGAATTGAGA CAGCCGATGA GCGAGAACGT GTTGAAGATT CAAGAAGCGA TCGTCAGCGT CATGGATTCG TGCATGGCTG AGTTGAAGAA AAGTCGCTAC ATCGACACGA GCGATTTAAC GCTCGAGAGT GGGTTGTTCA AGAGCTTCGA TTTGATTTTG CAGCGACAAC TCGACAAGGT GTGGCACATC GCGCCGAGAC GAGTGAAACA GATCGTGTAC GATTTAAAGA CGCTCCGCTT ACTCGCAGAT GCGTTGCTGA AGTACGATTC AGTCACGTTT TTGAAGTACT TGCAGGCGTT GCGAGCGAGT GAGTCACGTG AGAGCATGTG GATGTTCACA GAGGCCTCGC ACGCGATTTT CGAGTACGCG AAAAAGCGAG TTTACCTGCT CAAGCGTAAA GCCGCGGCGG CGCAACCGAA AGGCTTGGGC GCAAAGCGAC CGCTTCCACC ACAAATCACA GAGACGGACT TGATTCCGAT TTTAGAACCC ATGCCAAAAT GGACGCTCAT GGAAGAGATT TTAGATGAAA TAGACGAAGA ACGTCGGCAA GGCGGAGAGC TCCTCGCCGT GGCGGACTCC GAAACGGTCG TGGACCTGAC GTTTTCTCAG CCGTATGAGT CCCAAGAGCA CGGCACGCAT AGAATGTTAA AATACAAGCA AGGAGCGACG TTGATTGTGT GTAAAGAAGA GCACGTGGCG CGGCAACTGG AGTACTGCAT TCGTTACGGA ACGCCGGCGC TGATGAATGC GCATTGGGTC GATTACTTGT TTAGCCGGGG CGGGAAGAAC GTGGCGGCGC AAGTGACGAA GCGACAGACG GGATGGCGCG GTGCCGGGCG CGGTGGTGGC CGTGGTAGTG GCCGTGGTGG TGGGCGCGGT GCGGCGCAGA AGCCTCAGCG TGTGTATTCA AAACTCGAGC GCATTCAGGC GAGAATGGAG GGTCGAGAAA TCGACGAAGA TCCGGGGCCG GCGAAAGATG GGAAATCTGA CGAGAACAAA CTCTTAGCGG CGGCCGCAGC GGAGGCGAAG AAGACCCTAG TCGCTGCGAA GAAGGCCGAA GACGCGGATA AGAAGATGAA GGCTACGAAG GCGGCGAAAG AGTTAGATAT CAAAATAGAA ATGAAAGCGG AAGAGGACGC GGCGGCTAGC GACGACGAAG TCATCGTAGT CGGCGATACG CGCACGCGCT CGACCGTTAA GCGCGATACC GATAACATGT ACGTGTACGC GCACGAGCGC AAATTGAACC TGCTGAACCG CATTCAGCCT TCGTTCGTGG TGATGTATGA TCCAGACGCG TCATTCATCC GTGAACTCGA AGTCTATCAG GCGACGCGCC CGGACGTCCC GGTCAAGGTG TACTTTTTGG TGTACGACAC GTCATTAGAG GAGCAAAAGT ATCTCAGTAG CATCAAACGC GAGAGTGCGG CGTTCGAAAA CCTCATACGC ACGAAGCAGC ATATGGCTGT TCCCGCTGAA CAAGAGGGTT GGACCGATTC AGAGAATCCG TTGCCGTTGT CGTTGCCGAG CTCGACCGCT CGACATCGAA TCGAGGAGTC GCAAGAGGCG AGTACGCGTA AAGGAGGCAG ATCGCTCACT ATTCGTTCGT CTCTCGAAGT CATAGTGGAT ATGCGTGAGT TCATGTCTGC GCTCCCTTGC GTGTTGCATT CGGCAGGTTT CAAAGTGCGC CCGACGACGC TAGAAGTGGG CGATTACATC CTCTCGCCCG ATATGTGCGT CGAGCGCAAA GCCATTCCAG ACTTGATTCA GTCCTTCGCG TCTGGGCGTT TGATAGCACA AGTCGAAGCG ATGTGCAAAC ACTATAAGAC ACCGATTCTA CTCATCGAGT TTGACGGCTC AAAAGCGTTC GCTCTGCACG CAGAAGCCGA CCTTCCTCGT TTCGTCGGGC AGCAACATCT CATCACGAAG ATATGTATGC TCATCACACG TTTTCGAAAG TTGCGTCTTA TTTGGAGTCG ATCGATGCAC ATGACGGCTG AAATTTTCGC AGAGTTAAAA AGGCTTGAGC CTGAACCCTC ACTCGAAACT GCGCAGCGAA TAGGCGTTCC CGATGCCGAC GGTGACGTGC ACAAACTCGT AAAGGATAAC CTCAACGACG CTGCCGTCGA TTTGTTGCGC AGGCTACCGG GTATCACCGA CGGCAATTAC CGACGAGTCA TCGCACGAGT TGAAAGTATC GAAAAGATGT GCGACCTGAG AGAAGACGAA CTCGCGGATA TCCTTGGCGA CGCACGGCAA GCGAAGACGC TCCACACATT TTTACACGCG CCGTTTCCGA AAGAATTCAT GTTTTAG
|
Protein sequence | MSLVLADASP SEAVVDGADL LPFQREITKE LLARDGFCVL AEGLGASAVI AALVAVDDAL SKTHVLGEPP MVTLIVGASE HAKVSVKERM TALFPRAAPP LEFTADYAGD KRKKFYDAGC VAFVTTRIAS VDLLSGRLDA KRVRGIIVCS AHRTNETSGE GFVVRLFREG NRKGYVRAIS DRPGDLTRGF NSVERCLKAL MLTRVHLWPR FHLRVKDDLD ARPPEVIELR QPMSENVLKI QEAIVSVMDS CMAELKKSRY IDTSDLTLES GLFKSFDLIL QRQLDKVWHI APRRVKQIVY DLKTLRLLAD ALLKYDSVTF LKYLQALRAS ESRESMWMFT EASHAIFEYA KKRVYLLKRK AAAAQPKGLG AKRPLPPQIT ETDLIPILEP MPKWTLMEEI LDEIDEERRQ GGELLAVADS ETVQGATLIV CKEEHVARQL EYCIRYGTPA LMNAHWVDYL FSRGGKNVAA QVTKRQTGWR GAGRGGGRGS GRGGGRGAAQ KPQRVYSKLE RIQARMEGRE IDEDPGPAKD GKSDENKLLA AAAAEAKKTL VAAKKAEDAD KKMKATKAAK ELDIKIEMKA EEDAAASDDE VIVVGDTRTR STVKRDTDNM YVYAHERKLN LLNRIQPSFV VMYDPDASFI RELEVYQATR PDVPVKVYFL VYDTSLEEQK YLSSIKRESA AFENLIRTKQ HMAVPAEQEG WTDSENPLPL SLPSSTARHR IEESQEASTR KGGRSLTIRS SLEVIVDMRE FMSALPCVLH SAGFKVRPTT LEVGDYILSP DMCVERKAIP DLIQSFASGR LIAQVEAMCK HYKTPILLIE FDGSKAFALH AEADLPRFVG QQHLITKICM LITRFRKLRL IWSRSMHMTA EIFAELKRLE PEPSLETAQR IGVPDADGDV HKLVKDNLND AAVDLLRRLP GITDGNYRRV IARVESIEKM CDLREDELAD ILGDARQAKT LHTFLHAPFP KEFMF
|
| |