Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_35336 |
Symbol | |
ID | 5002963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 224999 |
End bp | 226459 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | |
GC content | 61% |
IMG OID | 640418384 |
Product | predicted protein |
Protein accession | XP_001418644 |
Protein GI | 145348415 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00292419 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACCG ACGACGCGTA CGGACGATGG AAGTCGTTGG TCCCGTTCGT GTACGACTGG TTCGCGCACA CGCGGACGTC GTGGCCGTCG CTGTGCGCGC GCTGGGGCGA GGTGCTCGAC GCGAACGACC ATCGCTCGCG ACAGCGCGTC TATCTCACCG AACAGACCGA AGGGACGACG GCGAGCGGGA AGCCGACGCC GAACACGATA TTGGTGTGCC AGGCGGAGGT GGTGCGACCG CGCGTGGCGG CGGCGGAGCA CATGATTTTC GATGAACACG CAAAGTCGCC AATTTTAAAG AAGGAAAAGG CGCTGTGGCA CCCGGGAGAG GTGAATCGAA TGCGGTGCGT GCCGGGGAAA GAAAACGTGC TGTTGACGCA CACGGATGCG CCGGAGGTGT TCGTGTTCGA CGCGAACGGG CCGGGAGGGA AGCAGAGCGC GTGTAAGAGA GCAGACGGGA CGCAGTACAC GCCGCCGACG GCGTGCTTGC GAGGACACAC GGAAAACGCG GAATACGCGC TGGCGGTGTC GACGGTGGGA GAGGTGGTGG CGAGTGGAGG TAAGGATGAA AAGGTGATGA TTTGGGAGCT CGGAGATGCG AGCACGGGGG GCGGGGCGAG AGGAAAGGAG GAGAAGGAGG GAAGCGGCGC GCCCGTGGTG GGCGGCGGGT TGAGCTCGAC GGAACTCGCG AGACACACGT CTATTTGGGC GCGCGTCGAG TTTTCGGGGC ACACCGATAC GATCGAGGAT GTGTGCTTTA ACCCACGGAA CGAGCGGGAG CTGTGCTCGG TCGGGGATGA TCGGAATATG TTTTTTTGGG ACACGCGAAC GAAGAAGGCG GCGGGGTTCG CGAAGGGGGC GCACGCGGAC GACGTGCACT GCGTCGCGTG GAGCGCGTTC GAAGAGCACG TCATCGTTAC TGGTGGAAAA GACACCACCG TTAAGGTTTG GGATCGTCGA ACGCTGTCCG ATAGCTCGAA CGAGGCAATG CACACGTTCG ACGACCACAC CGACAGTGTT TTGTGCGTGG ACATGCACCC GCAGGCAAAG GGGGTTTTCA TGACAGCCGA CGAAGTAGGC CGCGTGAACG TGTTTGATTA CTCGAAAGTC GGCGCTGAAC AGAGTGCGGA ACAAGCAAAA GCTGGTCCGG CGCACTTGGT CTTTCAGCAC AGCGGCCATC GTGGGACGGT TTGGGATATT CAGTGGAACC CTTACGACTC CTGGACCGCG TGCTCGACCT CGGTCGGGGA CTTTCAGAAT ACTTTGCAAC TCTGGCGCGT GAACGATTTG ATCTATCGCG ACGAAGAGGA GTGCATTCGT GAGCTCGAAC AACATCGGGA TATCATATGT GGTCGCGCGG CGCTGAAACA GTCAGAGCCG TCGGTGAAGG AGGAAAAAAC GGACGCCGAC ACCGACGGCG GCTCTATTAT CATCGAAGAC GACCGCGTGG ACGAAGACTA G
|
Protein sequence | MITDDAYGRW KSLVPFVYDW FAHTRTSWPS LCARWGEVLD ANDHRSRQRV YLTEQTEGTT ASGKPTPNTI LVCQAEVVRP RVAAAEHMIF DEHAKSPILK KEKALWHPGE VNRMRCVPGK ENVLLTHTDA PEVFVFDANG PGGKQSACKR ADGTQYTPPT ACLRGHTENA EYALAVSTVG EVVASGGKDE KVMIWELGDA STGGGARGKE EKEGSGAPVV GGGLSSTELA RHTSIWARVE FSGHTDTIED VCFNPRNERE LCSVGDDRNM FFWDTRTKKA AGFAKGAHAD DVHCVAWSAF EEHVIVTGGK DTTVKVWDRR TLSDSSNEAM HTFDDHTDSV LCVDMHPQAK GVFMTADEVG RVNVFDYSKV GAEQSAEQAK AGPAHLVFQH SGHRGTVWDI QWNPYDSWTA CSTSVGDFQN TLQLWRVNDL IYRDEEECIR ELEQHRDIIC GRAALKQSEP SVKEEKTDAD TDGGSIIIED DRVDED
|
| |