Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33673 |
Symbol | |
ID | 5003522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 612434 |
End bp | 614099 |
Gene Length | 1666 bp |
Protein Length | 547 aa |
Translation table | |
GC content | 62% |
IMG OID | 640418943 |
Product | predicted protein |
Protein accession | XP_001419855 |
Protein GI | 145350950 |
COG category | [R] General function prediction only |
COG ID | [COG0714] MoxR-like ATPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.229173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.12598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGCGCGCGC ACGCATTCAT CCATGGCGCT CGCGATGGCG CGCGCGTCGC GCGTCGGCGC GATCCCGGCG CGCGCGCGCG ACCGACGCGA GGTCCGACCG CGCGGTCGCG CGATCGCGCG CGCCGCGGAC GCGGACGAGC CGGAGTGGAA GCGGCGACAG CGCGAGCAGG ATGCGCAGAT ATCGGCCGAG GAGCGCGCGG TGCGCGAGCG CGTGCGCGAG CGCATGGCGC CGATCGTGAG CGGAGAGACC GACGTCGACG CGGACGCGCA CGCAAAGGAC GGGATGGCGT CGTCGAGCGC GCGGAGGGAG GAACTGTCGA GGAAGATCGC GGCGGCGACG ACGGCGCTGG AGCGAGGTTT GGTGGAACGC GAGACGGAGA CGCGGTTGTT GCTGCTGGCG GCGTGCTGCG GAGAACACTT GCTGTTGCTC GGACCGCCTG GGACGGCGAA GAGCGAACTT GGACGCCGGC TGAGCGCGCT GTGCGAGGGA GGGCAATTTT TTGAACGATT GCTCACGCGG TTTTCGGTGC CGGAGGAGTT GTTTGGACCG CTGAGCATGA AGGGGCTGGA AAACGATGAG TACGTGCGGA AGATTGATGG ATATCTGCCC AGCGCCAACG TGGCGTTTGT GGATGAAATT TTTAAGGCGA ATTCAGCCAT CTTGAACTCG CTGTTGACGA TATTGAACGA GAGGTTGTTT GATAACGGGA GCGAACGCGT CGACGTGCCG CTGTTGTGCT TGGTCGGCGC TTCGAATGAA TTACCGGAGA GCGAGGAGTT GGACGCGCTG TACGATCGAT TTTTATTGCG ATCGAGCGTC GAACAAGTAT CCGCCGGTGG GCTCGGGAAG TTGCTATCTC TCGGCGGCGA GGCGGCGATC GGGTCCAAGG CGAACGATCA CGGCGGCGTC TCGACGAGCG ACACCCGACT CAAGCCGGAG GACTTTGCGA ACATTCGATA CGAAGCCGCG GCGGAGACCG AGGTACCGAC GAACGTCATC GAGCTGATCA CTGACTTACG AACGTTTTTG CAAGACAAGT GTGAGCCGCC GGTGTACGTT TCCGATCGTC GCTTGCTCAA GGCGGTGCAA ATGTTGCGCG TCGCGGCGTA CACGAACAAC CGCGACGAGG TGAGCGAATT CGACACCTTG TTGCTCGTGA ACGTGTTGTG GCAGCGCCCG AATGAGGCAA TGATGATCAA AGATTGGATT TTAGAGCGTC TCGCGCAGGA TCGTGGGACG AAACAGGTGC AGTATTTATT AGCGGGCTTG TTCGGCCGCG CCTGCCGCGC CGACGGCGAC GCCGAAGAGT GCGCGCGATT GCTCAGCGAA GCGAAAAATT TGCGCGGTGT ACTCACAGCA CAACTCAACT CCTTGCGAGG CGCTCAAGGG GGTTCTCTCC CCGCTTTGCG CGAACATCTT TGGCTTTCTC CCGCGGACGC TTCTCGCGCC GCGCAGACTC TCGGTCCGAT GTTTTCCAAG GTGAGCAAAT CTCTCGAGAA GCTCCTCGAA GACGTCTTGA CGCTCGAGGT CGCGTTAGAG CGCGACACCG AGCCGCACAT TCTTGCCCTT CTCATGCCCG ATTACTGGGC CGCGTTCATT CGCGAAGGCC CGATAGCTGA AGTCCAGCCT TTAGGCGTCT CCAACGCGAC CTCGGCGGCG CCGTGA
|
Protein sequence | MALAMARASR VGAIPARARD RREVRPRGRA IARAADADEP EWKRRQREQD AQISAEERAV RERVRERMAP IVSGETDVDA DAHAKDGMAS SSARREELSR KIAAATTALE RGLVERETET RLLLLAACCG EHLLLLGPPG TAKSELGRRL SALCEGGQFF ERLLTRFSVP EELFGPLSMK GLENDEYVRK IDGYLPSANV AFVDEIFKAN SAILNSLLTI LNERLFDNGS ERVDVPLLCL VGASNELPES EELDALYDRF LLRSSVEQVS AGGLGKLLSL GGEAAIGSKA NDHGGVSTSD TRLKPEDFAN IRYEAAAETE VPTNVIELIT DLRTFLQDKC EPPVYVSDRR LLKAVQMLRV AAYTNNRDEV SEFDTLLLVN VLWQRPNEAM MIKDWILERL AQDRGTKQVQ YLLAGLFGRA CRADGDAEEC ARLLSEAKNL RGVLTAQLNS LRGAQGGSLP ALREHLWLSP ADASRAAQTL GPMFSKVSKS LEKLLEDVLT LEVALERDTE PHILALLMPD YWAAFIREGP IAEVQPLGVS NATSAAP
|
| |