Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31751 |
Symbol | |
ID | 5001873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 474258 |
End bp | 476414 |
Gene Length | 2157 bp |
Protein Length | 709 aa |
Translation table | |
GC content | 61% |
IMG OID | 640417294 |
Product | predicted protein |
Protein accession | XP_001417783 |
Protein GI | 145346618 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0545641 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCAAC CGCACGAGTA CGACGACTGG TTCGCGCACG CGGACGCCGA TGGCGACGGC CGCGTCTCCG GCGCCGAGGC CGTGCACTTC TTCATGCGCG CTGGGCTCCC GAAGACCGAT CTCGCCAAGC TCTGGGACGC GGCGGACCAC GAACGGGAGG GATCGTTGGA TCGACGGGCG TTTTCGTTGG CGTGCGCGCT GATAGGAGCG TTGCAGCAGT ACGGAACGAT CACGAGAGAC GTGTTCGATC GCGCGCTTGC GGGAGATACG CGAGGATTTC CAAAGCCGAA GATGCAAGGG TTGGAGTTAC CGGCGGCGCC GACGGCGACG ACGAGTCAAC CGCCGGTTGC CGCGGCGACT GGCGGGACGT TTGGGAACGT TGCGCCGTCG CCCACGATGG AATTTACGTC GCCGCCGAAG GACGATTTAT TCGCTATATC GAGTGGGGTC GATGATTTCG CGCCGGTGGC GCCGGCGCCG ATGGAGCGAG CGCCGGCGCC CGTCGCGCCG ACGAGTCTCG CGTTTGACGC GCCTCCGCGC GCGACGTCGG TAGAGCACAC GGTGCAGGCG TATCAAGCGC CTGCAGTTGC CGTACCCGTG GCGCCGGAGG CGAACGTTGA TTGGCCGGTG ATCGGTCCGA ATGATTGGCA GCGATATCAA CAGATTTTCC TTTCGAACAC GAACGGGAAC CCGGAGGGAC GGTTGAGCGG GCAGCAAGTG GCGCCAATTT TACTCGGTAT GAACGCGCCT AAGCAAGTAT TGAAAGACGT GTGGGAACTT AGTGACTCCG ACAAAGATGG GTCTTTGGTT TGGACAGAAT TCGTCGTGGC GGCTTACCTC ACGGAACAGG CGAGGAACGG CTTGATGCCT CCCAAATCGC TTCCGCCCGG GCAATTTCCA CCATTTAGCA TGACGGCGGG TGAGCAACCC GCGCCAGTAG CTCCCGTCGT TCCAGAAGCC GCGCCAACAT CAGTGCTACA GGTGAACGCG GTCATGACGG ACGGGTTAAT GACACCTAGC ATCGCGCGAG AGCAGTTACA AAACATCACC GCACCCGCGC AAGCGGCACC GCAAGTGAAT GAGGCTTACA CATACAGAGG GCCAATGGCA AACATCGACG CCATCCCTGA ACAAGATCGT GATTTGGCGG GAAAGGTCAA GGAAAACGCG GAGAAGAGCG ATCGGCAATT GTGGGAACAA GAAATGAACG AGCGACAGAA CGTTCTGAGC GCGCACGCGG CTCAAGAAGT GTTGGCAAAT CTCGCATTGT TTGTTCGTAA ATGCGAAGCC GGAATGACAG AAGCGTCCTA CAGGGCACAA GTTGCCGAAT CGCAAGTCAT CGAACTTAGG CAGAAATGTG AGGTGATGGA AGGACGTGTG ACGCAGCTCG TCGAACAACT GGCCGGACCC ATTGAGCGCA TCGAGGCGAG CAAGAAGGAA CACGAGGAAT TGAGTGCGCG ATACCAGCAA CTCGAAGAGC GTCACGCCGA GTTGTCGCAG AACGCGTCGC AGCAAAATCA TTCGCAGATG ATGCAAGATA ACGTGAGTCT GCGTGCGAAA GTCGAGGCGA AATCCACGCA AATAGTGATG GAGGAGACGC GCGCGAGTCA AGCCGCGACA ACGTCGTTGA GCGCTCAGAT GCGAGAAACG CAGCTCACAG CGACGCAACC GCCTGCGACA GCGGCGTTGA TGGATTTCGG CGGCGTTTCG GCGGCGTCCA CAAATATATC ACCGGCCTCA GCCAATGCGA AGAGCACTTT CGAAGAGTGG GACAATTGGG GCACGACGGC GCGCGAAGAG GCGTCGACTC AAACGCGAGC GTCGTCGATG CCCGCGCAGC GGCACCGAAA AGTCCCATCC GAAATCCCCG CCGTCACCGC CGCGCTCATC GAAGGCGACG GCGGCTTTTT CGATAGCATC GACTCCGACC CGTTCGGCCA ATCGAACGAG TCGACGTCGC CCGCTCCCGC CATTCCTCCG TCGTCCTTCG GCGACGACCC CTTCGGCGCG CCGCCGCCGA CGCCGCCCGG CGACTACGAC GACCCTTTCG GCGCGCCACC ATCCCCCGGC CCGACGCCGC CGCAGAGCGC GCGCGCATCT TTCATAGACA TCGATCCATT CGCGATGTGA GTGCTCGCCC TAGCTCGGCG CGAGCGA
|
Protein sequence | MQQPHEYDDW FAHADADGDG RVSGAEAVHF FMRAGLPKTD LAKLWDAADH EREGSLDRRA FSLACALIGA LQQYGTITRD VFDRALAGDT RGFPKPKMQG LELPAAPTAT TSQPPVAAAT GGTFGNVAPS PTMEFTSPPK DDLFAISSGV DDFAPVAPAP MERAPAPVAP TSLAFDAPPR ATSVEHTVQA YQAPAVAVPV APEANVDWPV IGPNDWQRYQ QIFLSNTNGN PEGRLSGQQV APILLGMNAP KQVLKDVWEL SDSDKDGSLV WTEFVVAAYL TEQARNGLMP PKSLPPGQFP PFSMTAGEQP APVAPVVPEA APTSVLQVNA VMTDGLMTPS IAREQLQNIT APAQAAPQVN EAYTYRGPMA NIDAIPEQDR DLAGKVKENA EKSDRQLWEQ EMNERQNVLS AHAAQEVLAN LALFVRKCEA GMTEASYRAQ VAESQVIELR QKCEVMEGRV TQLVEQLAGP IERIEASKKE HEELSARYQQ LEERHAELSQ NASQQNHSQM MQDNVSLRAK VEAKSTQIVM EETRASQAAT TSLSAQMRET QLTATQPPAT AALMDFGGVS AASTNISPAS ANAKSTFEEW DNWGTTAREE ASTQTRASSM PAQRHRKVPS EIPAVTAALI EGDGGFFDSI DSDPFGQSNE STSPAPAIPP SSFGDDPFGA PPPTPPGDYD DPFGAPPSPG PTPPQSARAS FIDIDPFAM
|
| |