Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38526 |
Symbol | |
ID | 5001719 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 628124 |
End bp | 631834 |
Gene Length | 3711 bp |
Protein Length | 1153 aa |
Translation table | |
GC content | 59% |
IMG OID | 640417140 |
Product | predicted protein |
Protein accession | XP_001417825 |
Protein GI | 145346706 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1643] HrpA-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.168449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACGCT CGAACGGCGA CGGCGCGTCG GACGCGAGCG CCGCGCTCGT GTTCCTGCCC AAGCGCAAGG GCACGAGCGG GCGGGACGGC GCGAGCGCGA TCGAGGCGCG CGATCGGCGA CGCGCCGAGG CGCACGCCGC GCGCAGGGTG ACCAAGCTGA GCAATTCGCA AAAGCGCAAA CTGAAGAAAC TGGCTGAGGA GGAGCGGAAA CGAAGCGAAA GGGCGAGCGT GATGGCCATG CTGGAGGCAA ACGTGGCGGA TGAACGCGCG CTGGGGCTGA TGCGGGCGAC GACGTCGTTG GGGGCGAGAG AGACGGCGAA GGAGAAGATG CGGCGCGCGC TCAAGGCGGA ACGCGCGGGG GTGACGCTGG ACGACTTGGA CGACGCGCGG TTGACGAAGC GACCGAAGGC TGTGGGAAGA GAAGATGAAG AACAGAGCGA GGACGAGAGC GAGAGCGAAC GGGAAGCGGC GGTTGAGGTC GGAGGGGTGC GAGATGTCTC GACGGGGAAG ATGGAAGACG ACGTCGATTC GGAGGACGAC GAACGCGCGG AGGACGACGC GTTGACTCGA GCCGCCATTC GCGCGGCGGA GGCGGCGGGC ATGGCGTCAC TCGATGACGC CACGCGCGAA CTAGTGAGAT CATTGCGCGC GCGAGCTGGG GTCGGGGATG ACTCAGATAA GCCGAAATCG AACGAGTACA AAGGCTTGCA TCACGAAGTG TTTCGAGGAT GTTCCTTTGT CGTTCCCGTT CAACGGACCG GGAAGATTAA CGATTCGCGC GAGGGGCTTC CGATCGTTCA AGAGGAACAT GAAATCGTAG ACGCGATTAA TACCAATCCC GTGACCGTCA TCTGTGGCGC CACGGGTTGC GGTAAGACGA CGCAAGTGCC GCAGTTCTTG TACGAGGCTG GTTACGGCGA TCCGGATTGC GACAGTCACC CTGGGGCGGT GGCGGTGACA CAGCCTCGAC GTGTGGCGGT GACCTCCACG GCGCGTCGTG TGGCAGAAGA GCTGAACGTT CCTCTCGGTG GTGATGTCGG ATATCAAGTG CGTTACGACA AGAACGTCGG CGACAACCCG CGTATCAAAT TTATGACGGA TGGCATCTTG CTTCGAGAAG TACAGGCAGA CTTCCTTTTA CGCAAATATT CGGTGGTGAT CATCGATGAA GCGCACGAAC GGAGCGTAAA TACGGATATT CTGTTGGGTC TTCTCTCACG AATCGTTCCT CTTCGAGCCG CACTCGCGGC TGAAGGGAAA GCTGTGACGC CGCTTCGATT GGTCGTGATG TCCGCGACGT TGCGAGTCGA AGAGTTCGTG GAGAACAGGA AGCTATGTCC GACACCTCCA GCGTTGCTTC AAGTCGCCAC GCGTCAGTTT CCAGTCACTG TGCACTTTTC ACGGAAAACG GAACACGCTG ATTACGTCGG TGCGGCGACG AAGAAGGTAC TCGCGATCCA TCGCAAGCTT CCACCTGGCG GCATTCTCGT CTTTCTGACT GGTCAGCGAG AGGTGGAGAT GGTCTGTCGC AAGCTTCGAG ACGCGTACCC GCTTCACGGG AAGCGCGTCA ACGCGGCGGA AAGTTCGGAT GATGAAGATG AAGATAGCGG CGACGCGATG GATGACACAT ACGACGTCGA CGCGATCGAC GCGGGGGGGG AAGATTTGGG CGGCGACGAC GACGAACCCG ATTTCGACGG CGAGGACGAT ATGTCGGACG CCGCGAGCGA CATCAGCGAA GAGGACGAAG TCCTAGTCAT GGGTGGTGAG GGCGTTGGAG AAGAAGAAGC CGCAGAAGCC GAAGCAGCTT GGACTCGAGC GAATGCTCCA TCTACGGGTT TGGGCGCCGA TAAGACGGCG GACGGACCGG GAGGTTTAAA TGTGCTTCCG CTCTACGCGC TACTGCCCCC GAATTTGCAG CAGCGCGTGT TTCAAGCTTC ACCCGACGGC TCTAGAATGG TCATCGTCGC TACAAACGTC GCGGAAACAT CTTTGACGAT TCCAGGGATA CGGTACGTCG TCGACGCCGG GAGAGCAAAG GAGCGGGTGT ACGAGCGAGA CGCAAGTTTG TCGCGGTTTC AAGTCGGATG GGTGAGCAAG GCTAGCGCCG ATCAACGCGC CGGGCGGGCC GGGCGTACGA GTCCAGGGCA CTGTTATCGT CTGTTTAGTA GCGCACACTT CGTGGATGAG ATGAAAGCGC ACGCAGATCC ACAAATCCTG GGTGTACCCG TCGAAGGCGT CGTGTTGCAA ATGCGTGCCA TGGGCATCGA CAAAGTGGTC AACTTTCCTT TTATTTCGCC TCCCGAAAGA TCGGCTCTCG CGGCAGCGGA GAAGACTTTA CAGATTCTCG GAGCGGTAGA GAAGAGTAGG CATGGCGAAG AGATCGGGCC TTTGACCGAT TTAGGGCGCG CTATGGCGGT TTTGCCAATC AGTCCACGGC ACTCGAGAAT GTTGTTCGCC GCCGCTCAAA GCGGCGTGGG AGGTTGCCTC TCACCGGCGA TTGCCATCGC TGCCGCGTTG AGCCTCGACA GCCCATTCCT GCGAAATAGC AGCGAAACTG TGGAAGACGA CGAAGAAGAA GGAGAGGCGA AGGCGACACC CAAAGGTCCG CCACCGCACG TTCGATTTCA CCACCCGGCG AGCGACGCTC TCTCGGCGGC GCAGGCGCTC TTGGCGTACG ACGCCTGCAA GAGCTCGGAC GCGGTGACGT TTTGCTCTAC GAATAGGTTG CACGAAAAGA CCATGCGAGA AATGTCCGAT TTGCGGCGAC AATTGAAACG GCTCGTCGTC AATCTCGCGA CGACATCCAA GTTTGGCGAC GACGTCTTTC CCAACGCCGC AGTGCTGAAC GAACTCGACG ATTCAAACCA GGCTGCTTCG TCGATGATTT CGCTTCCTCC CGGGGGCGAT GTCGAGCGCA CGTTGAGACA GGCGCTGTGT GCGGGTTGGG CGGATAGAAT TGCCCGTCGA TCAAAACACA AAGAGATGGA GCAAGCGTCT CGAGCGAACG AGAAATCGAC CAAGGCGACG CGATACGTTC CGGCGCTCCT CGACGCCGCG GTGTTTCTTC ATCCAACGTC TTCGTTGCAT CGAAGCTCTC CGGATTACGT CGTCTACACC GACTTACTAC AAACGGACAA GCGCGCCTAC ATCGTCGGCG CCACCGGGAT TGAGCCCGAG TGGTTGATTC AGCACTGCGA CGCACTCGTG GATCAAGGCG CTATGCTCGC CGACCCGGCG CCGAGATACG TCTCGCGGGA GGACCGCGTC GTCGGTTGGA CGGCGCCGAG ATTCGGACCG CACCGCTGGG ATTTGCCGCT GAATCCAATT GCTGTTAATG ATGTGGATAC CAAATGCGCC GTCTTCGCCA CCGCGCTTTT GTCGGGGGCG GTGTCGCCAC CTATGGCGGA TTTGCGAGAG AAGCTCGCCG CCAAGCCGCT CTTGGCGTCT CGACCTGAGG GACGAGCTCA AAAGCGCGTC GTCGATTTAC TCGGCGCACT GAAACGTGTC GGTGGTGGCA TTTGTACTCG GGCACAATTA CGACAAATCT GGATGACTCG TGGGAACGAC CGGTACTTGT ACCCAGAGCT CAAGGCGTGG ATGCGGGCGG GCAAAGGTTA CGCGCTCGAA CAGGCGTGGT TAAAAATAGT TCAGGGCGTC GTGAATTACG ACGAAGGCAA GGAGCGAAAG AAGAAGAAGG GAAAGAAGTG A
|
Protein sequence | MGRSNGDGAS DASAALVFLP KRKGTSGRDG ASAIEARDRR RAEAHAARRV TKLSNSQKRK LKKLAEEERK RSERASVMAM LEANVADERA LGLMRATTSL GARETAKEKM RRALKAERAG VTLDDLDDAR LTKRPKAVGR EDEEQSEDES ESEREAAPKS NEYKGLHHEV FRGCSFVVPV QRTGKINDSR EGLPIVQEEH EIVDAINTNP VTVICGATGC GKTTQVPQFL YEAGYGDPDC DSHPGAVAVT QPRRVAVTST ARRVAEELNV PLGGDVGYQV RYDKNVGDNP RIKFMTDGIL LREVQADFLL RKYSVVIIDE AHERSVNTDI LLGLLSRIVP LRAALAAEGK AVTPLRLVVM SATLRVEEFV ENRKLCPTPP ALLQVATRQF PVTVHFSRKT EHADYVGAAT KKVLAIHRKL PPGGILVFLT GQREVEMVCR KLRDAYPLHG KRVNAAESSD DEDEDSGDAM DDTYDVDAID AGGEDLGGDD DEPDFDGEDD MSDAASDISE EDEVLVMGGE GVGEEEAAEA EAAWTRANAP STGLGADKTA DGPGGLNVLP LYALLPPNLQ QRVFQASPDG SRMVIVATNV AETSLTIPGI RYVVDAGRAK ERVYERDASL SRFQVGWVSK ASADQRAGRA GRTSPGHCYR LFSSAHFVDE MKAHADPQIL GVPVEGVVLQ MRAMGIDKVV NFPFISPPER SALAAAEKTL QILGAVEKSR HGEEIGPLTD LGRAMAVLPI SPRHSRMLFA AAQSGVGGCL SPAIAIAAAL SLDSPFLRNS SETVEDDEEE GEAKATPKGP PPHVRFHHPA SDALSAAQAL LAYDACKSSD AVTFCSTNRL HEKTMREMSD LRRQLKRLVV NLATTSKFGD DVFPNAAVLN ELDDSNQAAS SMISLPPGGD VERTLRQALC AGWADRIAPN EKSTKATRYV PALLDAAVFL HPTSSLHRSS PDYVVYTDLL QTDKRAYIVG ATGIEPEWLI QHCDALVDQG AMLADPAPRY VSREDRVVGW TAPRFGPHRW DLPLNPIAVN DVDTKCAVFA TALLSGAVSP PMADLREKLA AKPLLASRPE GRAQKRVVDL LGALKRVGGG ICTRAQLRQI WMTRGNDRYL YPELKAWMRA GKGYALEQAW LKIVQGVVNY DEGKERKKKK GKK
|
| |