Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_41641 |
Symbol | |
ID | 5005020 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 129124 |
End bp | 132159 |
Gene Length | 3036 bp |
Protein Length | 936 aa |
Translation table | |
GC content | 65% |
IMG OID | 640420441 |
Product | predicted protein |
Protein accession | XP_001421062 |
Protein GI | 145353527 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1643] HrpA-like helicases |
TIGRFAM ID | [TIGR01970] ATP-dependent helicase HrpB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000692526 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGACGC GCGTCGCGAC GACGGACGCG CGCGCGCCGG TCGCGCGGCG CGCGCGCGGT GCGCGTCGCG ACGTCGCGCG GCGCGCGACG GGAGACGACG CGCGGCGCGA GGCGCTGCGA AAAGCGCGCG CGAAGTTGCC CATCGACGCG GTCGTGGACG CGTGCTTGGA CGCGCTCGAG GTGCGTCGAC GGCGCGCGAA AGCGCGACGC GGGACTGACG AGGCGAACGA TTGGGTTCGA TGCGAACAGC GCGCGACGCG CGCGGTGATG CAGGCGCCGC CGGGGGCGGG GAAGACGACG GTGATGCCGC TGGCGGCGGC GCTGGCGGCG GCGAGCGCCG GCGGCGACGG ACGCGCGGGG AAAGTGATCG TGCTCGAACC GCGACGGTTA GCGGCGAAAG CGGCGGCGAT GCGGATGGCT GAGATGTTGG GGGAACGCGC GGGGGAAACG GTGGGGTATC AGGTGCGATT TGAACGAAGG GCGAGCGCGG CGACGCGGGT GGAGGTGGTG ACGGAGGGCG TGCTGACGCG GCGGTTGAGA AACGATCCAG AGCTGCGGGA CGTGGGATTG GTGGTGTTCG ATGAGTTTCA CGAGAGGAAT TTAGACGCGG ACGTCGCGCT GGCGCTGTGT CGAGAGGTGC AACAGACGAT ACGGCCAGAT TTGAGACTGT TGGTGATGAG CGCGACGCTC GGGGAGATGG GCGCGCGCGT GGCGGCGTTG TTGAGGGATG AGAACGGGCC CGAGGTGCCG GTGATCGTGT CGGAGGGGCG GTCGTATCCG GTGGAGACGA TTTATTTGGG CGCGCCGGGC GCGGGATGGG GTGAGCTCGA GCGCGCGACG ACGAACGCGG TGAAGGACGC CGTGCGCGCG TGCCCAGATG GCGACGTGCT GTGTTTTCTT CCGGGCGCGG CGGAGATCAA TCGCGTGGTG CGAGATCTGC AAAGGGAGCT TCCGAACGGT GTCGTGGCGT TGCCACTGTA CGGCGCGCTA TCGCAAGAAG AACAGGCGGC GGCGCTCGCG CCGTCGAAAC CGGGCACGCG TCGCGTCGTC GTGAGCACTC CAATCGCCGA GTCTTCGTTG ACCATCAACG GCGTGAAAGT GGTCGTGGAC TCGGGATTGT GCAAGACGCC CAAGTTCGAC GCTCGGAAGG GTATGACGCG ACTGGAGACG ACTCGTGTCT CCCGCGCATC GGCGGATCAA AGGCGCGGGC GAGCCGGGCG CATCGCTCCT GGGACGTGCT ATCGTTTATG GAGCGAGGCC TCGAACGCGA AACTTCAGCC AGACACGACA CCCGAGATTT TGCAAGCCGA CTTGACTCCG GTGGCGTTAG ATTTAGCAGC GTGGGGCGTC GGCGATGGGG CAGACATGGC TTGGCTCGAC CCACCGCCTG AAGGCCCGCT CATCGCGGCG AGACGGTTGT TACGCGAGCT CGGCGCGTTG GAGGAAGGCA AACTCGTCCC TAGCGACGTG GGTTCGATCA TGTCCGAGCT TCCAGTGCAC CCAAGACTGG CGCGTATGTT ACTTTTTGGC GCGTCACGCG GCGCTGAGAG CGCTCGACTC GCGTGTCAAC TCGCAGCTGT CATCGGCGAC CGCGACTTGA TCTCTGGACG CGACGCCCCT CTCGACGTTC GGTGTAGACT TCGCGCGCTT TGGGGTCAAG ATCCACTCGC AAGCGCGGGC GATTTGGATG AAGAGAAACC AGAAGTCGAC CCGACGAAAC CGACGCGCGT TCCAATCGGT ACCAAACTCC CGAAGAGCGG GAAGAAAATC AAAGGCGCGC CTCGCGGCAA ACGCGCGAGC GTAAACGCGG CGGTGAGCGT CGCGACGGGC GGCGCGAGCG GTGCGAGCTG GAATGTCGAC GAGCGGGCGG TTCGCGAAGC GAAACAAGTT GCCGAGCAGT TACTCGGGAA CTTGCGCCGC TTAGCGTCTT CGCATCCTGA CTTCTGCGCT CCGGGTGTCG GCGAGGGCGA CGCCTCGGCG GTGGTGTGGT CGTTCTTGTG CGGCGCGGGC GAAAGCGAAG CCGGCCTACT GCTCGCTGTC GCCTATCCTG ATCGCGTCGC CGCGCGTAAA AACCGAGGAG GGGCGTTTCA ACTCTCTGGC GGCGGCGCCG CGAGCGTGGG AAGTGAGCAC AAGGATGACG CGCTGCTTCG CTCTGGCGAC AAGGCGAATG AAACTTTAGT CGTCGTCGAA TTAGCTGGCG ATGGCGCGGG AAACGCGGGA TCGAGGAACG ACCGCGTGCG GTTGGCGGCA CCGATCGACC GTGCATGCTT AGAAAGCGGT GGTGCGCTGT ACGAAGCGTT GTCCAAGGAG AGTGACGACG TGTTCTGGGC GAGCGCATCG AAATCAGTCT TCGCGCGCAG GCGCTTGACG GTAGGATCTT TGGTGCTTCG AGAGATTCCG TTTTCGGTGA AAGACAATCC CGAGGCGACT GTGAGCGCGA TGTTAGACGG TATTCGCGAG ATGGGTCTCG CGAGCGCCTT CGGTTTGAAC AAAGCGACGA CATCGTGGCT CAAGCGCGCG GAATATGTCC ATCGCTCAGG CGTGGACGCC ACGTTTCCAA ATCTCTCCGA AGAAAATTTG CTGAGCTCCG CCGGTGAATG GCTCGCGCCG TGGATCGCTG GCGCGCAGTC TAAATCGGAT CTCGCGAAAG TTGACGTCGC ATCCCTAGTC AAGGCGCATT TCTGCACGTA CGACCATCTC AAGCTCGTGG ACGACGCCTG TCCCGTCGCC GTTCGCCTGC CCAGCGGATC CAACGCCAAA GTCGACTACG ACGGCGACGT TCCCGTCGTC GCCGCGCGCA TCCAAGAGTT CTTCGGCACT ACCGAAACTC CACGCGTGGG CGGCGTCAAC TGCGAATTAC ACCTCCTCAG CCCGGCCGGT CGCACGCAAG CCGTCACGCG CGACTTGGCG TCGTTCTGGC GCAACGCCTA TCGCACGGAC GTTCGCAAAG AGCTCGCCGG TCGATATCCC AAGCACTTCT GGCCCGACGA TCCCGAGTCC GCCGCCGCGA CGAGCAAGAC CAAAAAGTAC ATGTAA
|
Protein sequence | MATRVATTDA RAPVARRARG ARRDVARRAT GDDARREALR KARAKLPIDA VVDACLDALE VRRRRAKARR GTDEANDWVR CEQRATRAVM QAPPGAGKTT VMPLAAALAA ASAGGDGRAG KVIVLEPRRL AAKAAAMRMA EMLGERAGET VGYQVRFERR ASAATRVEVV TEGVLTRRLR NDPELRDVGL VVFDEFHERN LDADVALALC REVQQTIRPD LRLLVMSATL GEMGARVAAL LRDENGPEVP VIVSEGRSYP VETIYLGAPG AGWGELERAT TNAVKDAVRA CPDGDVLCFL PGAAEINRVV RDLQRELPNG VVALPLYGAL SQEEQAAALA PSKPGTRRVV VSTPIAESSL TINGVKVVVD SGLCKTPKFD ARKGMTRLET TRVSRASADQ RRGRAGRIAP GTCYRLWSEA SNAKLQPDTT PEILQADLTP VALDLAAWGV GDGADMAWLD PPPEGPLIAA RRLLRELGAL EEGKLVPSDV GSIMSELPVH PRLARMLLFG ASRGAESARL ACQLAAVIGD RDLISGRDAP LDVRCRLRAL WGQDPLASAG DLDEEKPERA VREAKQVAEQ LLGNLRRLAS SHPDFCAPAG LLLAVAYPDR VAARKNRGGA FQLSGGGAAS VGSEHKDDAL LRSGDKANET LVVVELAGDG AGNAGSRNDR VRLAAPIDRA CLESGGALYE ALSKESDDVF WASASKSVFA RRRLTVGSLV LREIPFSVKD NPEATVSAML DGIREMGLAS AFGLNKATTS WLKRAEYVHR SGVDATFPNL SEENLLSSAG EWLAPWIAGA QSKSDLAKVD VASLVKAHFC TYDHLKLVDD ACPVAVRLPS GSNAKVDYDG DVPVVAARIQ EFFGTTETPR VGGVNCELHL LSPAGRTQAV TRDLASFWRN AYRTDVRKEL AGRYPKHFWP DDPESAAATS KTKKYM
|
| |