Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_88051 |
Symbol | |
ID | 5003230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 476021 |
End bp | 477190 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | |
GC content | 59% |
IMG OID | 640418651 |
Product | predicted protein |
Protein accession | XP_001419392 |
Protein GI | 145349957 |
COG category | [R] General function prediction only |
COG ID | [COG0724] RNA-binding proteins (RRM domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.00743444 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCCGC GTGCGCCGAC GCCCGCGGCG GAAGTCGTTG GGAATTCGTT TGTGAACCAG TTTTACACGA TTTTGCACAC GTCGCCGGCG GTGCTGTATC GGTTTTACAC GAATGATTCG ACGCTCATCG TGAGTGGAGA GCACGGTGCG GCGAGTGATG CGCCAACGAC GTATCGTACG CAACGGGACA TTCACAACAA GGTTGTGAGC ATGCGGTACG ACGAGACCCA GGCGGATGTG AAATCGATCG ACGCCTCGCA CACGCTGGGC GGCGGCGTGC TGGTGCAAGT GACTGGCGCG TTGCGACGAA AAGGTGATGA TTTTGCGCGC AATTTTGTGC AGTCTTTCTT GCTTGCGCCT CAGGAAAACG GTTTCTTCGT GTTGAACGAC ATTGTTCGAT ACTTGGACAA GGTCGACACT TCGGGGGAGA AGGCGCCCAA GGAGGCGAAG ACGAGCGCCA AGCAGCAAGA CGTCAAGGGG GAGTCGAAGA CTAAGGCGGC GGAGGTCAAG TCGACGAAGA AGGAGAGTGG TGATAACAAG GCGAAGGGTG ACTCCAAGTC AACCGAGGAC GAGGACGCGG GTGAAGTGGA TCCGAGCAAG CCGAGAACGT ACGCGATGAT GGCGGCTTCG GCGGCGGCGG CGGCGGCGGC GGCCAAGCCC ACGGCAGCGA CAGCCAAGCC CACGGCGGCG ACGGCGGCGA CGATCTCACC GATGACCTCG CCGAGCGCTA CCTCGCCGAG TTCCAAGTCA CCGGATAAGG CGGAGCAAAC GGTCGCGGCG ACCAAGCCCG GATGCGGTAT CTTCATCAAG AACATCTTCA TCGAATCGAC GGTGGAGGAT CTTGAGCGCG AGTTCAGCAA GTTTGGTGTC GTCCTCGGTG GTGCCAAGGG CATCAATCTG AAGGCACCGA AATTGTCGCA TGAAACCAAG TTTGCTTTCA TCGACTTCGA TGAGCCAGCA TCGGCGCAGG CGGCTTTAGA GGCGACAATC GAGCTTCACG GCAAGATTTT AGTGGTTGAG ATGAAAAAGG CTTCAGTCGT CAATGCCAAG GGCGTTGGCG CGAACGGGAA GAAGGAGTCG AAGGGACGAG AGGGGGGGGT CGAGCGCAAG GGTAGCGCTA AGAATGGACG GCAACCGAAG AAGACTGAAG GTGCCGGTTC CAACAAATGA
|
Protein sequence | MAPRAPTPAA EVVGNSFVNQ FYTILHTSPA VLYRFYTNDS TLIVSGEHGA ASDAPTTYRT QRDIHNKVVS MRYDETQADV KSIDASHTLG GGVLVQVTGA LRRKGDDFAR NFVQSFLLAP QENGFFVLND IVRYLDKVDT SGEKAPKEAK TSAKQQDVKG ESKTKAAEVK STKKESGDNK AKGDSKSTED EDAGEVDPSK PRTYAMMAAS AAAAAAAAKP TAATAKPTAA TAATISPMTS PSATSPSSKS PDKAEQTVAA TKPGCGIFIK NIFIESTVED LEREFSKFGV VLGGAKGINL KAPKLSHETK FAFIDFDEPA SAQAALEATI ELHGKILVVE MKKASVVNAK GVGANGKKES KGREGGVERK GSAKNGRQPK KTEGAGSNK
|
| |