Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_23942 |
Symbol | |
ID | 5000000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 839641 |
End bp | 840771 |
Gene Length | 1131 bp |
Protein Length | 346 aa |
Translation table | |
GC content | 69% |
IMG OID | 640415421 |
Product | predicted protein |
Protein accession | XP_001415617 |
Protein GI | 145341026 |
COG category | [R] General function prediction only |
COG ID | [COG0456] Acetyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0967488 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATCGACCGC GCGACGCGCG CGATGGCGTC GCGCGCGGTG TCGATCGACG CGCGCGCGCG CGGCGACGCG ATGGCGTCGC GCGCGACGCG CGGCGCGACG CGGGACGGCG CGCGACGCGC GACGCGCGGA ACGCAGACGA CGACGACGCG ACGACGCGCG ATGACGACGA CGCGATGCGA TGGGCTCGCG CGGACGGGAC GCGGCGGCGC GACGAGGGAC GCGTCGACGT CGACGATGAT GGGCGCGGGC GCGGGCGCGC GGGGAGGGGT GGCGCGACGC GCGCTGGCGG ATCGACCGAC GCGAGAGGCG GCGGAGGACG AGGACGCGCG CGCGACGGGC GCGAGCCGGG TGAAGTCGAG GGGGGGGGTG GACGTCGTCG TCGCGACGAA TGACTTCGCG TTTGAGATCG CGGCGAATCT CAGGGCGACG GCGTTTTACG ATGATCTCGC GGAGAGGCAC GAGATGCCGT TTCCGCCGCG GTTCACGCCG ACGTTTCATC GGGAGTTCGC GCAGCGCGAA CGCAAGGCGC TGCGGGAGCG GACGACGAGA CGCGTGGGGC CGGCGCTGGA GTCGCGATGT TTCATGGCGG ATTGCGAAGG ATTGGGGTTA GTGGGGTGTT TGGACGTCAG CGTGCGCGAG GGGCCGTGCG CGAGTCAGAT CAACGGCGTG TGCGTGCCGG AGGGGGCGTC GTACGCGTAC GTGGACAACG TGGCGGTGGA CGCCGCGGCT CGTCGACGAG GGTCGGCGAA GCTCATGATG GAGTGCGCGA GCGACTGGGT CGAAGAGCGT GGAATCACGG AAATCTGGAC GCACGTGCAC TGCGATAACG TGGGCGCGCG AAGATTGTAC CACGCGTACG GTTTCCGGGC GCCCAGCGGC TCGCATCCGG AACAAGGCTT GCCGAATTAC TTCAACGGCG AGCGATTGAA GGGCCTAATC TTAATGCGAG CCCCTGTGCC GCTGGTGTAC GAGGCGCGCG TTGACGCGGT TTGCGGATGC GGCGCGTGCT TCGCGCGCGT GGACGAGTGC ATCTGCATCA AACCCGCGGT CGCCGCGCGT TAACGCCGCC TCGAGTCGAG ACTACTTAAA CTTTGTTGTA CTTTTCCTGC CCGCCGTACT ATATGCGCGC T
|
Protein sequence | MASRAVSIDA RARGDAMASR ATRGATRDGA RRATRGTQTT TTRRRAMTTT RCDGLARTGR GGATRDASTS TMMGAGAGAR GGVARRALAD RPTREAAEDE DARATGASRV KSRGGVDVVV ATNDFAFEIA ANLRATAFYD DLAERHEMPF PPRFTPTFHR EFAQRERKAL RERTTRRVGP ALESRCFMAD CEGLGLVGCL DVSVREGPCA SQINGVCVPE GASYAYVDNV AVDAAARRRG SAKLMMECAS DWVEERGITE IWTHVHCDNV GARRLYHAYG FRAPSGSHPE QGLPNYFNGE RLKGLILMRA PVPLVYEARV DAVCGCGACF ARVDECICIK PAVAAR
|
| |