Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_12759 |
Symbol | CHR3506 |
ID | 5002700 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | - |
Start bp | 705643 |
End bp | 708540 |
Gene Length | 2898 bp |
Protein Length | 956 aa |
Translation table | |
GC content | 53% |
IMG OID | 640418121 |
Product | predicted protein |
Protein accession | XP_001419020 |
Protein GI | 145349184 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0485802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.764276 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCTCG AACGCGTGCG AGAGGATCAG AACAAGAAGA TTGCGGGAGA TAATAAGGCG GGGAAGTGGA AGTTTTTGTT GGCGCAAACC GAGGTCTTCG CGCACTTTTT GTCGGGGACG AAGGCGGCGA ACGAGGCGGC GAACAAGGGC AAGCGCGGGC GCAACAAGAG CCACGCGGCG GAGGAGTCGG AGGACGCCGA GCTCGTGGAA CACGCTGAGG ATTACCAAGC GGTGCGTTTG ACGTCTCAAC CGACGTGCAT TAAGTTTGGC AAGATGCGAG AGTACCAAAT TGCGGGTTTG AACTGGATGA TTCGATTGTT CGATCACGGC ATCAACGGCA TCTTGGCGGA CGAGATGGGT CTCGGGAAGA CGTTGCAGAC GATTTCTCTT CTCGGTTACC TCGCCGAATA CCGGGGTGTT ACCGGACCGC ACATGGTTGT CGTGCCGAAA TCCACGCTCG GTAACTGGAT GAACGAGTTT AAGCGTTGGT GTCCGATGAT TCGCACGTTC AAGTTTCACG GAAACGCCGA AGAGCGCGAG GCGCAAAAGG CCAAGTTTTT GGTGCCGGGC GGTTTCGACG TGTGCGTGAC GTCTTACGAA ATGGTCATCA AGGAGAAGAC TGCGTTGAAG AAGTTTCACT GGCGTTACAT CATCATCGAC GAAGCGCACC GCTTGAAGAA TGAAAATTCT CGTCTTTCCA TCGTCCTAAG AACGTTTTCG GCGAACAACC GCATGTTGAT CACCGGGACG CCGCTTCAAA ACAACCTTCA CGAGCTCTGG GCTTTGCTCA ACTTTTTGCT CCCCGAAGTG TTCGGTAACG CCGGTCAGTT CGATGAGTGG TTTGCGAACG TTGAAGACGG TGAGGGTGGT TCCGGTGCCG TCGTGTCTCA GCTTCACAAG GTTTTACGCC CCTTCTTGTT GCGTCGATTG AAGACGGAGG TCGAAACCAG CCTCCCGCCG AAGAAGGAGA CAATTCTCAA GATTGGCATG ACTGAGATGC AAAAAACGTT TTACAAGCGC ATCTTGCAAA AGGACATCGA CATTGTTAAC AGTGGCGCGG ATAGATCTCG GCTTTTGAAT ATCGTCATGC AGCTTCGCAA GTGCTGCAAC CATCCGTACT TGTTCCAAGG TGCCGAACCG GGTCCGCCGT ACATCACCGG GGATCACCTC ATCGAGAGCT CGGGTAAGCT CGCGCTTTTG GACAAGCTTT TGCCTCGTCT CATGCAACGC GGCAGTCGTG TGCTGATCTT CTCTCAGATG ACTCGTTTGC TCGACATCTT GGAAGATTAC TTGATGTACA GAAACTACCA ATATTGTCGC ATCGACGGGA GCACGGATGG CGCCGTCCGC GAAGACCACA TTGACGCCTT CAACAAGGAA GGTTCGGAGA AGTTCTGTTT CTTGCTTTCC ACGCGGGCGG GTGGTTTAGG TATTAACCTC GCCACTGCGG ACACCGTCAT CATCTACGAC AGCGACTGGA ACCCGCAAAT GGATCTTCAG GCGATGGATC GAGCGCACCG CATCGGTCAA AAGAAGGAGG TACAAGTGTT CCGCTTTTGC ACCGACGGTA GCGTCGAGGA GAAGGTGATC GAGAAAGCGT ACAAGAAACT CGCGCTCGAT GCTCTCGTCA TTCAACAAGG ACGATTGCAA GAGAACAAGA AGAACCTCGG CAAGGATGAA CTCTTGGCGA TGGTTCGCTT TGGCGCCGAG AAGATTTTCG ACTCGTCGAC GACGTCGATC ACGGACGAAG ATGTCGACGC CATCATGGCT CGCGGTGAAG AAGAAACCAA AGCACTCAAT AGCAAAATGC AAGGTTTCAC GGAGAAGGCC ATTCAGTTCT CTATGGGTGC CGAGAACTCG CTTTACGAAT TCGAGGACGA AGACGACAAA AACGTCGCCG CGTTGCCAGA GGGCATTGAC ATGAAGACCA TCATCAGCAG CAACTGGATC GATCCTCCCA AACGCGAACG TAAGAAGAAC TACAACGAGT CTGATTACTA TCGCAGCGCC ATGGCGCAAG CGGCTCGCCC ATCCAAGCCA ATGGGACCGA AGATTGCCAA GCTGCAGCAA ATGCACGACT TCCAGTTTTA CAACACCGCG CGAATTCAAG AAATTTACGA CAAGGATGTT AGACGCAAGA CGTACGAATG GCAGAAAGAC AAGAAGAAGG AGGAAGCTAA AGCTGCGCAA GATGAGGAGG CGCCCACTGA AGCGGAGGAA GACGATCCCA ACGCGCCTCC GGCGATCACG GAGGAAGAAA AGGCTGAGCA AGAATCGCTC CTCTCTCAAG GTTTCACAGA ATGGTCTCGC CGAGACTTCC AAGCCTTCTG CCGACTCAGC GAGAAGTACG GGCGCGAAGA CGTGGAGTCT ATTGCGAGTG AGATGGAAGG AAAGACTCTC AAAGAGGTCA AGGATTACGC CGCCGTATTC TGGAAACGTT ACGAAGAGAT TGCCGATCAT CCGCGCATCA TCAGCAACAT CGAAAAGGGT GAACAAAAGA TTCAGCGTCA ACACGACATG CTGAAGGCTG TACGAGAAAA GATCGCCAAG TACAAGAATC CGTGGCGCGA GCTCAAGTTG ACGTATGGTC CGAACAAATT CAAATCGTTC ACCGAAGAGG AGGATAGGTT CTTGTTGTGC TCCATCCCCG AAGTTGGCTT TGGGAATTGG GACGAGCTCA AGGCGCAGAT TCGCCAGCAC TGGCAATTCC GATTCGATTG GTTCATCAAA AGCAGAACGC CAAAAGAGCT CGGTCGCCGC GTGGAGACTT TAATTTCTCT CATCGAGAAA GAGGCTCAGG ACAGAGGCGA CAAGAAGCGA GACGCCGAAG CCGAAGCCGA AGCCGACGGG AGCGCGCAGA AAAAGGTGAA GGTTGAAGCT GAAGGCGTGA CCGCGTAG
|
Protein sequence | MELERVREDQ NKKIAGDNKA GKWKFLLAQT EVFAHFLSGT KAANEAANKG KRGRNKSHAA EESEDAELVE HAEDYQAVRL TSQPTCIKFG KMREYQIAGL NWMIRLFDHG INGILADEMG LGKTLQTISL LGYLAEYRGV TGPHMVVVPK STLGNWMNEF KRWCPMIRTF KFHGNAEERE AQKAKFLVPG GFDVCVTSYE MVIKEKTALK KFHWRYIIID EAHRLKNENS RLSIVLRTFS ANNRMLITGT PLQNNLHELW ALLNFLLPEV FGNAGQFDEW FANVEDGEGG SGAVVSQLHK VLRPFLLRRL KTEVETSLPP KKETILKIGM TEMQKTFYKR ILQKDIDIVN SGADRSRLLN IVMQLRKCCN HPYLFQGAEP GPPYITGDHL IESSGKLALL DKLLPRLMQR GSRVLIFSQM TRLLDILEDY LMYRNYQYCR IDGSTDGAVR EDHIDAFNKE GSEKFCFLLS TRAGGLGINL ATADTVIIYD SDWNPQMDLQ AMDRAHRIGQ KKEVQVFRFC TDGSVEEKVI EKAYKKLALD ALVIQQGRLQ ENKKNLGKDE LLAMVRFGAE KIFDSSTTSI TDEDVDAIMA RGEEETKALN SKMQGFTEKA IQFSMGAENS LYEFEDEDDK NVAALPEGID MKTIISSNWI DPPKRERKKN YNESDYYRSA MAQAARPSKP MGPKIAKLQQ MHDFQFYNTA RIQEIYDKDK DKKKEEAKAA QDEEAPTEAE EDDPNAPPAI TEEEKAEQES LLSQGFTEWS RRDFQAFCRL SEKYGREDVE SIASEMEGKT LKEVKDYAAV FWKRYEEIAD HPRIISNIEK GEQKIQRQHD MLKAVREKIA KYKNPWRELK LTYGPNKFKS FTEEEDRFLL CSIPEVGFGN WDELKAQIRQ HWQFRFDWFI KSRTPKELGR RVETLISLIE KEAQDRGDKK RDAEAEAEAD GSAQKKVKVE AEGVTA
|
| |