Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15647 |
Symbol | |
ID | 5002243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 77605 |
End bp | 79395 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | |
GC content | 62% |
IMG OID | 640417664 |
Product | predicted protein |
Protein accession | XP_001418147 |
Protein GI | 145347382 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0712404 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGG CGACGGCGAC GCGCGCGAGC GACGACGCGT TCGAGTACCG CGCGCGGCGA ACGAGCGCGG GGGGGTATTA TCGCCCGACG GCGTCGAGCG ATGGACGCGC GGGCGCGACG TACGAGGTCG CGGAACGCGG CGGCGAGCAC GCGGAGGAAG AGACGAGGAC GCGGAGGTTC GACACGGCGT CGGCGCGCGG CGTGACGACG TTCGACGCGC GGACGTGCGA GGCGATCGAG TTCGTGCCGA TCGACGCGTG GTCGCGGGAT AAGGCGAATT TTGAAAAGTT GCGGTCGATG CGAACGTTTC GACTGCACAG ACTGTGGAAG GCGTTCGCGA CGATGCGCGC GCACGCGCGA AGGAGAAAGT TTCGGCGCGC GCGCGCGCGG TTTAAGGAAT CGTCCGCGAT TTACGGTGAT GCGTTCGCGG GATACGCCGT TCCGACGATG ATGCGCGTGT ACGACGCGTG TCACTCGATC GCGCGAGACG CGCGCGTGTT TCGGCGCGAG GAAGCGCCGG CGACGAAGGT CGAAGAATCG TACGAGTTCG ACGACGAAGC GAATCTCGCG CCGGCGACGT ACGACGCGCG AGCTCTGTTC GAGACTCTGC TACGACTCGC AAACGAAGGC TCGACGCACA TTCGTGACGT CGCGTCGGCG ATTTCCAGCG ATGTGGAAAT CGCCAAAGAC GCGATCGAGC GACGATTTCT CGCCGATTTG GATGAGCGTC TACAGCCCAT GATCGCGGCG ACGAAGCGGT ACCGTCGACG ATCGAGCGGA ACGAACGGTG GAAAGCCAGC GCTCGCGCCC ACGATCGGCG ACGATCGACT TGTGACGGCG TCAGAGTCGA GAGACGCGTA TCCTTACACC GAGCGCGCGC TCGTGGCGGC GCTTCGAATG AAGCTGGAAT CTTTCGAGCG CTCGATTTGG ATGTGTTTTC GAGTCGCGAT TTCGCGCGCG AGAGACGCAT CGCTCGAAGA ACTCACCGCG TACGTGAGGG ATGCGCGTTC GTCGGAGTCA GCGATATTTC AAACCACGTT TGATTTCGAA ACGTCCGCGC TCGTACCGAG CTCGGATGCG TTCGAGCGAG CGATTCGAGA CGGCGCCGCC GCCTGGACGT CGTCCTCGCT CGGCGAGCGC GTCGGCGACG CCGTCGATTT GCGCTCGCTC GAGGATGAAG AGTTTGAAAA TCGCATCGAT AAATTTTGTG ACCTCGTCCG CGACACGTTC GCGAGCGCGC AAGAGGCGTT GCTGGTCGTC GAGACGCGGT TGCGCGAAAA ATCGCCAGAC GCCGTCGCGG ACGCAGACGG AGGCGATGAC GAATCGTCGA TGATTGAACG TTTGAGCGAG ATCGTGGCGC GTTCGAATGC GTTCAAGAAG GAAGTCGACG CGTTGCCAGG ATCGATTCGT CCGAACGACG GCGCGATCGC CGTGGACATC GTATCGCTGA AGCGCGCTTT GAAACCGGTG GCGTCGGCGA CGATTGACTC CGCGTGCCGA ACCGCCGTGG ATTGGGCGGC CGAACGCGCG CAGACGATCG CGCAGAAGTT CACGCGCGTG AAAACCCTCG ATGATAACGA CGATGAGACC AAGTTGAGTA AGATCCGTGA TCTCCGCGAC GAGGCGCGAG AGCTCGAGAA TGTGCATTTG GCGATGAAAC GACTCGGCGC AAACATTCCG GATTTCGACC GCGCGGCGTT CAAAGGCGTC GTGGAGGAGA TCGAAAACGC GCTCGCGGGT GACGACGGTG GCTACGAAAA AGATGAAACA AAGCACGGCG GCGATCGTTG A
|
Protein sequence | MATATATRAS DDAFEYRARR TSAGGYYRPT ASSDGRAGAT YEVAERGGEH AEEETRTRRF DTASARGVTT FDARTCEAIE FVPIDAWSRD KANFEKLRSM RTFRLHRLWK AFATMRAHAR RRKFRRARAR FKESSAIYGD AFAGYAVPTM MRVYDACHSI ARDARVFRRE EAPATKVEES YEFDDEANLA PATYDARALF ETLLRLANEG STHIRDVASA ISSDVEIAKD AIERRFLADL DERLQPMIAA TKRYRRRSSG TNGGKPALAP TIGDDRLVTA SESRDAYPYT ERALVAALRM KLESFERSIW MCFRVAISRA RDASLEELTA YVRDARSSES AIFQTTFDFE TSALVPSSDA FERAIRDGAA AWTSSSLGER VGDAVDLRSL EDEEFENRID KFCDLVRDTF ASAQEALLVV ETRLREKSPD AVADADGGDD ESSMIERLSE IVARSNAFKK EVDALPGSIR PNDGAIAVDI VSLKRALKPV ASATIDSACR TAVDWAAERA QTIAQKFTRV KTLDDNDDET KLSKIRDLRD EARELENVHL AMKRLGANIP DFDRAAFKGV VEEIENALAG DDGGYEKDET KHGGDR
|
| |