Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17128 |
Symbol | |
ID | 5003961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 438717 |
End bp | 440288 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | |
GC content | 61% |
IMG OID | 640419382 |
Product | predicted protein |
Protein accession | XP_001420176 |
Protein GI | 145351638 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00169444 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.688241 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAAAG AGATGCGCGA ACGCGTCGTG GCGCTGCGAA GCGAACACAA GTTTTACGCC TTGCAGAGCT TTTGTGAGAC GTTCGAGGCG ATGTGGATGC CGAGGAAGGA TGGGGAGGCG TGGGAACGCG CGCTGTGCGA ACCGCGCGAG CACGCGTGGA TTTTAGATTT TTTCTTGCGC GTGTGGGCGC CGGCGCCGAG CGCGTTGGGA CACGGCGAGA CGCTGCTCGT GCCGAGAAGT TGGGAAATTA GATCGCGAAG GAAAATTCGT GAGATATTGG GCGACGGCGT GGACGATGAG GCGGGATTTT TCGACATCGA CGCGCTGGCG CGATTAGAAG TGTTGTACCG GTTGTGCGAG GATTTGGTCG AGTCTCGCTC GTTTTCTAAC GTCGCGGAAC AGTGCTCGGA TGAGTTTGAC AAGGCGCGAC GGCGAAATAA AGTCGAACAG CTTCAGATGT GTAAGCGCGG CGTGCACGGC GAGCCGATCG GATACGACGC TAAGGGTCGC GCGTACTACG TCTCGGCGTT GGACGCTCGG ATTTGCCGAG CTGAAAAGGT GGCGGCGGAC GCAGGCCCGA GAGCGCGCGC GGATCCACTT TGGTCGACGC CGTTCGTGAC TCTGTTGGAA GTCATCGCGT TGAAGAACAC GTTGCAGAAC ACGACGAACA AGAAAGAGTT GGCACTGCTC GAGTATTTGT CGAAGGATCA CGTCCCGTAC CATCAGGAGC GCATGAAGCA GCAGCTGGCA GAGCTGGAGC GCGTCCAAGC CAGAAGCGAT GCCAGAGCGC GTGCGGAACA AGAACGGATT GCGTGGGAAA GCACGGCGAG GAAGAAGTCG AGTCGTCTGG AGGTGAAAAA GGCGCTTGAG AATTCGCGAC GAGAGCGCAA AGTCGAAACC ATTTCGAGCG ATGAACTCGA AACGGTTCTC GACGACTTGC GACGTTGGGT ACTGTTGCCT GAATCCATGC GTGCCACCGC GCACCCACCA CACGGGGTGC GGGTTTCCAC GCTGTACGGC GAACACGTCG GCGGGCTGAA CACGAAACCC GTCGTCGTCA AGCCAAAAGG GCGTCGTTGG GTGGGGTACT TCGTCAGCGT ACTTTGGGAA GACGACAACC CATCGAGCGA AGCCAGCTGG TCGGATGGTT ACTGCGTTTC GTACGATTCA GATACGTCCA AGCACTTGGT GATATACCCA GCTACGGCGA CGGTCGAGCG CGTCAACCTC GAAACCGTCT CGCTGCGCCT GGGTTCCCAC GGCCGAAAGT ACGATGTCCG CGTCGATAAG AAGGGCCGCG TGCTCGGCGA TCACGCCGCG CTCTACGCCG CGCTGAAGCC CGAGCTCGAC GCCGCGCGCG CATCTGAGAC GTATTTTAGC GGATGCTGGC GCGACGACGT CGTCGGAAAC GGCATCCCCG TGGACCCCTC GCGCGTCGGC GAGTGCGGCG CCGTGCTCCT CGACGCCGCC GAAGCGCCGG CCGAAGAAGC GCGAGACGCC GTCGCCGACG ACCCCCAGCG CGCCGATTCC ACCGACGACA CCACCACCAC GATCGACGAT TACTCTCCTT AG
|
Protein sequence | MGKEMRERVV ALRSEHKFYA LQSFCETFEA MWMPRKDGEA WERALCEPRE HAWILDFFLR VWAPAPSALG HGETLLVPRS WEIRSRRKIR EILGDGVDDE AGFFDIDALA RLEVLYRLCE DLVESRSFSN VAEQCSDEFD KARRRNKVEQ LQMCKRGVHG EPIGYDAKGR AYYVSALDAR ICRAEKVAAD AGPRARADPL WSTPFVTLLE VIALKNTLQN TTNKKELALL EYLSKDHVPY HQERMKQQLA ELERVQARSD ARARAEQERI AWESTARKKS SRLEVKKALE NSRRERKVET ISSDELETVL DDLRRWVLLP ESMRATAHPP HGVRVSTLYG EHVGGLNTKP VVVKPKGRRW VGYFVSVLWE DDNPSSEASW SDGYCVSYDS DTSKHLVIYP ATATVERVNL ETVSLRLGSH GRKYDVRVDK KGRVLGDHAA LYAALKPELD AARASETYFS GCWRDDVVGN GIPVDPSRVG ECGAVLLDAA EAPAEEARDA VADDPQRADS TDDTTTTIDD YSP
|
| |