Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31789 |
Symbol | |
ID | 5001775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 534635 |
End bp | 535945 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | |
GC content | 61% |
IMG OID | 640417196 |
Product | predicted protein |
Protein accession | XP_001417802 |
Protein GI | 145346658 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0513] Superfamily II DNA and RNA helicases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00179288 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0182063 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCG TGGAACCTAC GGAGATACAG ACGAAGGCGA TCGATGTCAT CGGTCGAGGG GCGGGGAACG CGTTCGTCGC GTCGCACACG GGGAGCGGGA AGACGTTGGC GTACTTGTTG CCGGTGATTC AACGCATGAA GGCGGCGGAG ATCGCGGCGG GGGATCGGTT GGCGAAACCG AAGAGACCGA AAGTGGTCGT GGCGTGCCCG ACGCGAGAGC TGGCGGAACA AGTCGCGGAG GTGGCGAAGG CGTTGAGTCA CGTGGCGAAA TTTAGTTCGT ATTTAGTCGT CGGAGGTAGA CGTTTAGGGA CGCAAAAGGA GCGGTTAGAC TCCGCGATCG ATGTCGTGAT CGGGACTCCG GGTCGATTGA TCAAGCACGT CGATCAGGGG AACTTATTCT TGGGGAGCGT GGACGCGATG GTGTTGGACG AGGCGGACAC GCTCTTTGAA GCCGGATTTG GCGACGAGGT AAAGCGGTTG TTACGACCAC TCAAGGCGCG TCCAGAGGGA AAGACGTGCG TCCTCGTCTC GGCCACCATG CCGGATCGAC TAAAGAAGCT CGTGGACGAG GAGCTTCCGG CTTTGCAGTA CATTAAGACG GATTCATTGC ATCGCTCCGC GCCAGGGCTC AAGCACCGCT TCGTCGACTG TCCGGGCGAC GTGGACAAGA TGACGGTGCT CGAGCAAATC GTCGCGCCCG AGCACAAACA GGGGAAAAAG CTGATGATCT TTTGCAACAC GCTTCCCTCG TGCATCGCGG TCGAGCGCAC CATGTTCGAG GCAGATATTC GCACCGTGCA GTACCACGGC GACATGACGA GCGACGCTCG CGCCGACGCC ATGCGCGAAT TCATCGACGC CGACGCCGAC GAAAACCTCA CCATGGTGTG CACCGACCTC GCCGCTAGAG GTTTGGATTT TGGTCGCGTC AAGGTCGATC ACGTGGTGAA CTTTGACTTC CCCATGAACT CGCTCGACTA CATTCACCGC TCCGGTCGCA CCGCTCGCGC GGGCGCCGGC GGTAAAGTCA CCAACCTCGT CGCCAAAAAG GACCGCGTTC TCGCGAGCGA GATCGACAAC GCCGTCAAGC TCGGTCTGCC GATCGACAAC GCCACGAGCT CACGCGCCGT GAGCGAAGCT CGCAAGAAAA AATCCATCGC CGACGCTCGC GACAGGCGCA CCGGAGGCCG TTCTCGCGCC AAGCCGAGCA CCGTGCGCGA TTCCAAACCT TCCAACCGCG GTCGTCGCGG CGCCGCGCGG TTCACCACGG ACGACACTAA GACTAAGCCT TCCAACCGAG GTCGTCGCTG A
|
Protein sequence | MNIVEPTEIQ TKAIDVIGRG AGNAFVASHT GSGKTLAYLL PVIQRMKAAE IAAGDRLAKP KRPKVVVACP TRELAEQVAE VAKALSHVAK FSSYLVVGGR RLGTQKERLD SAIDVVIGTP GRLIKHVDQG NLFLGSVDAM VLDEADTLFE AGFGDEVKRL LRPLKARPEG KTCVLVSATM PDRLKKLVDE ELPALQYIKT DSLHRSAPGL KHRFVDCPGD VDKMTVLEQI VAPEHKQGKK LMIFCNTLPS CIAVERTMFE ADIRTVQYHG DMTSDARADA MREFIDADAD ENLTMVCTDL AARGLDFGRV KVDHVVNFDF PMNSLDYIHR SGRTARAGAG GKVTNLVAKK DRVLASEIDN AVKLGLPIDN ATSSRAVSEA RKKKSIADAR DRRTGGRSRA KPSTVRDSKP SNRGRRGAAR FTTDDTKTKP SNRGRR
|
| |