Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119558 |
Symbol | Cup67 |
ID | 5000285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 548446 |
End bp | 550239 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | |
GC content | 52% |
IMG OID | 640415706 |
Product | hypothetical protein |
Protein accession | XP_001416176 |
Protein GI | 145342263 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCGCA CGTGCGTGTG CACGAACCCG AACTGCGACA AAGATGTCGA GCGATTCGCG TTCCAACTCG GCAAGCACTG GAAAGACGAC GTGCTCAAAT CGATTCTATC GAGTCTCAGA CCCGGAGCGG ATGCTGATGA CGTAGAAAAA TACTTTTTGC GCGCGATGAC GAAGCCCGAT GGTGTTCGAG TGTCCTCGCA GCACTTCTTC TGGTACCATT TAGCATGGTC TCGAGGAGAC CGCCCGAAGC TGGTCTTCAA AGATGGCGCG CACGTCCGGC CGACGTACAT GCTCCATTCG AATCGGCCGA CGTACGCGGG CGAACGAATC GTCGATCCGG CTCGTGTAGA ATGTTTATCA CCGCGCGAGA CGCGTCTGCG CGTGAAGGCG CCGGCACGGG CGCCCCTTTC AGAGATTTCA AATTTGAATG AGGACGACGT GCATGCGACA CAAGCTGGTG TCAGAGTTGG GATTAAACGG CGACGATCTG CTCCTGCGGC AATTTGCTCG CACAATCGCG CGCAAAACAT GTCTTTGAAC TCGTTTGAAC GGGGAAGGCC GGGGGAAGAG ACGTGCCGCG TGCAAGAAGG GACTATCCAC AAGTTACGCT GTGATGAGTT GAAGTTGAGC GAGGCGCTGC GGGAAGAGCG TCAGCACGCA GACGACGCGC GCGAGGACGT TCGTTTGCTC ACGGAAGAAC TTGAACGTAT TCAAGAAGAA ATGCACGACC AGGATCGAAA ATCCTCGTTC GTAACTATGC GATACTTGCG GGCGACGAGA GCTAGTGATA AGCAAATCAA AGCGCTTACT GGGTGGCCAA CTTTGAAAAC ATTCGATGCT TTGTGGCGTT TCGTCAACGC GGGCGACCGC AATGCGACGC AAAACTTGTC GATTGTCAAA CGAAAATGGA GGGGCGATTC GACGAGCGCG ACGGGTTCGC ATGGAACGGT CGTTCCGCGG AAAAAGAGCT TCAAATGGGA AGATGTATTT TTGGCGTGGT GGATGAGAGC GAAGATGGAC TTGACCGCGG ATCAACTTGC ATACTTCGTC GGAATACACA AATCGACGCT TACACGTGCG CTACATCAAA TGACATCATT TTTAACGGTT TTTATACGTG CTTACGGTGG CGACCCGATC GATCCCGAAT GGATCCAGAT GACAACAGGG GAGGATTGGA AAGAAGCATA CGGGCAGAAA CCTTATGAGA TTATCGACGT GACTGAAGTG CCCATGGAGA CTCCAGATGA CCCATATATG CAGCGTTTGT TTTACAGTCA TTATAAGAAG CGCTACGTGT GCAAATTTCT TGGCGCCTGT CACTCAAATT GTCTTGCCGC TTTCTGTTCA GTTGGGTTCC CTGGATCCAT TTTTGACAAT GAAATTTGCT TTGCTGGAAA GGATCATCCG ATGCTTAATC ATGCGGTACA CCGAGCCAGC ATCCGCCGGA GGACGTTCAC AGACGAGAAC GACGAAAATG TAGTCGTTCC CGAAACAATT GCAGCTGATC GTGGCTTTTG CGAAGGGTTG AATAGACAGT TTGAACGGGA TGCGTTCAGG CTTGTGCACC CTGCGTTTTC AAGAGGCAAG AATGTGGCCT TCTCCAAGGA GGACATTCTC GACACCAGGG CGCAAGCATG TCGCCGAGTG CTAATTGAAA ATGCTTTCGG TCGCGTCAAG ACGATTTGGA AGCGTATGCG CGGTCCAATA GCTATCTCCA AGCTGAAATC GGTGCATCAG GAGTGGCCAA TAATATTTTT CCTAGCGCTA GATTTACAAG CTTCGCTAAG GTAG
|
Protein sequence | MPRTCVCTNP NCDKDVERFA FQLGKHWKDD VLKSILSSLR PGADADDVEK YFLRAMTKPD GVRVSSQHFF WYHLAWSRGD RPKLVFKDGA HVRPTYMLHS NRPTYAGERI VDPARVECLS PRETRLRVKA PARAPLSEIS NLNEDDVHAT QAGVRVGIKR RRSAPAAICS HNRAQNMSLN SFERGRPGEE TCRVQEGTIH KLRCDELKLS EALREERQHA DDAREDVRLL TEELERIQEE MHDQDRKSSF VTMRYLRATR ASDKQIKALT GWPTLKTFDA LWRFVNAGDR NATQNLSIVK RKWRGDSTSA TGSHGTVVPR KKSFKWEDVF LAWWMRAKMD LTADQLAYFV GIHKSTLTRA LHQMTSFLTV FIRAYGGDPI DPEWIQMTTG EDWKEAYGQK PYEIIDVTEV PMETPDDPYM QRLFYSHYKK RYVCKFLGAC HSNCLAAFCS VGFPGSIFDN EICFAGKDHP MLNHAVHRAS IRRRTFTDEN DENVVVPETI AADRGFCEGL NRQFERDAFR LVHPAFSRGK NVAFSKEDIL DTRAQACRRV LIENAFGRVK TIWKRMRGPI AISKLKSVHQ EWPIIFFLAL DLQASLR
|
| |