Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25712 |
Symbol | |
ID | 5006247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | - |
Start bp | 53202 |
End bp | 56122 |
Gene Length | 2921 bp |
Protein Length | 967 aa |
Translation table | |
GC content | 60% |
IMG OID | 640421668 |
Product | predicted protein |
Protein accession | XP_001422190 |
Protein GI | 145355912 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.260732 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.855505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGAGG AGGAGGAGGG AGAGGCGACG ACGTCGTCGA GCGGGACGGG ACGCGGAGGC GGCGCGGCGC GATGGTTGAT GAACGCGGAG ACGAGCGGGG CGTACGGGGA AGCGGCGTCG CGAGGAAGGA CGGGGGGGAG AGGAAGGGGT GGGGGCGCGC GGGGGAGGGG CGCGGCGCGA TCGAAGTCGG CGGACTCGAA GAAGCGGACG CGAGGGGACA AACCTACGAA CGTGCGACAG CGCGAGCGGA GCGACATGTA CGTGAAGAAT AATCAAATCA TCGCCAAGTT TGACGACGTC GAGGACGTGT TAGATTTCGC GAGCGAGAAT TTGGAAATCA TGAACGTGGT GAACTTGGCG ACGGCGGCGC ATCGGGTGGG GAAATTGAAC TCGACGCGGA CGAGGAACGA AGCCGGAGCG CCGGCGACGG CGACGAGGCA TCCCGCGGTG GTGGAAGACG CGCGATTTCG CGCGTTGTTT GAGAAATTGC GCGAGTATCT CGTCGCTTCG CAAAACGGAT CGCCGCTCGG GAAGGGGTTG GGGCGGTTCA ACGCTCGCGA ACTGTCCGCG ATTTTGTGGG GCTCCGCGCA CTGCGGCATC ACGACGTCGG ACGACGACCC AACCGTCGCT CTCGTGGTGA GGCGAATCGC AAACTTGGAC GACGACGACC CACCGGCGCA GAACGTGAGC AACGTATTGT GGGCGTACGC GACGATGCAC ACGTCGAAGA AGATAGACGT GGAGCTGGTG CAAAAGTGCG AATATTGGTG CGACCTCATC ATGGACGACT TCGCCCCGCA AGGGATTAGC AACTCTTTGT GGGCGTTCGC CACGCTGGGG TACACGCTTA AACCGGAGAC CATCGCCAAG TTTTCGCAGG CGATCAGGCG ACAGCTGAAA GATTTTAAAT CCATGGAGTT CTCAAACGTC GTTTGGGCGC TGGCGACGAT GAAGACGCAT TTAGACCCAC TAGAAGTCTT CGACGAACTC TTGGACGAGA TGCACGCGAG CATCAAGGCG GTTCCAAACA TGTGGAGCTC ACAGAGCGTG AGTAATACTT TATGGGCGAT CGCCACGCTC GACGGAGAAC CGCACAAATT AAGAGCTCGC CACGGCGATT ACTTGAACAC GCTGTGCATG TACGTAGAGC GTAAGGCAAA CGCGTTTGTT TGTCAAGGTT TGGCGAACAC GCTGTGGGCG CTGGCGACGC TCGAGTACAC GCCTTCGATG AAGATGCTCG AAGCCGCCAC GGCGCGTTGG TCCGCGTTAG CGACGGACGT GTACATCAGT GAGTGTAGCA ATTTGCTTTG GTCGTACGCC AGCCTGCGGT TCAACCCAGG AAATGAAGTG CTCACGCAAG TCGCAGAGTT GTACCTTCGC GTCGGGCAGC ACGACGAAGT GGCGTTGACG CAAGTCTCGA ACACCTTGTG GGCGTGGGCA AATTTCGGTT GGCTTCCCGA GGATCCAAGC ATCGTGGAGT GCGTCCTTCA AGTGGCGATC AAACACTTCA AGAGCGATCC AGATTTGCAA ACGCAGAGCT TGGCGAACAT CTTGTGGTCG TTGGCGACGC TGAGGTTCGT TCCCGGGGAT GAATTTCTCC AAGCCTTTAG AGAGCGCGCG CTCATAGAAT TGCGCGAGGA CGAAAGATTC TCCGATCAAG GGTTGTGCAA CACGGTTTGG GCGTACGGTC AGCTCGGAGT GAATCCAGGG ACGGAGTTAA TGAGTGAAAT CGCGAGTCAG CTGGGCGCTC GCGTGACGAA TTTCCCCACC CAAGGCGTGA CGAATTCGAT TTTAGCCTTT GCCACGCTCG GGTTTTGGCC GGATGAATGG GTCGTAGACA ACTACAGGGC GAAGATCGTG GAAATGTACT ACTCCACCAC GATTTCGGAC ATCGACTTGA CGCAGTTTTT CCAAGCGAAT TACTTGTTTG AAAAGTGTTC GCCCTACGGA CCGCTCGTCA CCGACCCGCA GATGATTGAG GACATGTTAT CGGCGTGGAA GCGCGGATCG AGCAAGGTTG TCATCAGTCA GTTTCATCGC GAGGTGAGCG ATACGCTGAC GAACATGGGC GTGCCACACG AAATTGAATA CATCACCGAA GACGGTTTGT TCTCCCTCGA CATCGCACTC AAGGGTAAAA AGCTCGCCAT CGAGGTGGAC GGTCCGTCGC ACTTTGCGAG AAACATCCAA AACCGCCGCA TGTCGGGGAA GCGACCCGAC GGCACGGGGA CGTATAACAT TCGTTATCAC TACCTCGACA CCAACGGTTG GACCACGGTA TTCATACCGT GGTACGATTG GAAACAGGTG TGCGACGAGG AGTCCGCGAC GAGAACCACC GGCAGACGCG CCGCGTTTTT AGCCAAGACG CTCTACGACG ACGCCGGTCT CACGCTCATG GACGTCGCCT CGGACGAAGA CATGTCCGAC TCCGGCATGT CTGGTTTTCA CATCCGCGCA CTCGCCGACG ACGACGTCTC CGTCGCCCAA GACGGCTCGC GTCTCGTCAT TCAAGGCGCC GAAAGCCAGC GACACCCCGA CGGCACGTTC AAGCCCGAGA TGAAATCCGT CGGCGCCTCC GTCCCCAAAC CTCCCGCCCC CGTTCGACGC ACCGGATACG GCATGCTCGA ACCATCACCG TCAACACCGT CAACACCGTC AACACCGTCA ACACCGTCCC CGCCCCCCCC CGTCGTCGCG CGCGTCGCCG GCGCTCGCGT CGCCGCGCGT CCGCCGTCTC GCGCGTCTCC GCGCGCGCGC GCGCCCGCCG ACGCCGCCGA CGCCCCCCCC GACGACGACG ACCCATCTCC GGCGCGTCGA TCGCGTAAAT CTCTCGCCAC TCAACGCGGC GCGGGCATTC GTCGTCGTCG TCCGCGCGCA CCGCCGAGCG TCGAGTCCGA TTAAACCGCG CGCGCGGCCG C
|
Protein sequence | MDEEEEGEAT TSSSGTGRGG GAARWLMNAE TSGAYGEAAS RGRTGGRGRG GGARGRGAAR SKSADSKKRT RGDKPTNVRQ RERSDMYVKN NQIIAKFDDV EDVLDFASEN LEIMNVVNLA TAAHRVGKLN STRTRNEAGA PATATRHPAV VEDARFRALF EKLREYLVAS QNGSPLGKGL GRFNARELSA ILWGSAHCGI TTSDDDPTVA LVVRRIANLD DDDPPAQNVS NVLWAYATMH TSKKIDVELV QKCEYWCDLI MDDFAPQGIS NSLWAFATLG YTLKPETIAK FSQAIRRQLK DFKSMEFSNV VWALATMKTH LDPLEVFDEL LDEMHASIKA VPNMWSSQSV SNTLWAIATL DGEPHKLRAR HGDYLNTLCM YVERKANAFV CQGLANTLWA LATLEYTPSM KMLEAATARW SALATDVYIS ECSNLLWSYA SLRFNPGNEV LTQVAELYLR VGQHDEVALT QVSNTLWAWA NFGWLPEDPS IVECVLQVAI KHFKSDPDLQ TQSLANILWS LATLRFVPGD EFLQAFRERA LIELREDERF SDQGLCNTVW AYGQLGVNPG TELMSEIASQ LGARVTNFPT QGVTNSILAF ATLGFWPDEW VVDNYRAKIV EMYYSTTISD IDLTQFFQAN YLFEKCSPYG PLVTDPQMIE DMLSAWKRGS SKVVISQFHR EVSDTLTNMG VPHEIEYITE DGLFSLDIAL KGKKLAIEVD GPSHFARNIQ NRRMSGKRPD GTGTYNIRYH YLDTNGWTTV FIPWYDWKQV CDEESATRTT GRRAAFLAKT LYDDAGLTLM DVASDEDMSD SGMSGFHIRA LADDDVSVAQ DGSRLVIQGA ESQRHPDGTF KPEMKSVGAS VPKPPAPVRR TGYGMLEPSP STPSTPSTPS TPSPPPPVVA RVAGARVAAR PPSRASPRAR APADAADAPP DDDDPSPARR SRKSLATQRG AGIRRRRPRA PPSVESD
|
| |