Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28809 |
Symbol | |
ID | 4999764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 344867 |
End bp | 346033 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | |
GC content | 64% |
IMG OID | 640415185 |
Product | predicted protein |
Protein accession | XP_001415461 |
Protein GI | 145340706 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00340732 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGGTGG ATTGCGACCC GGGACACGAC GACGCGTTCG CGCTGTACCT CGCGCTGCAC GGGACGGCGG AGCGCGCGCG CGTCTTGGTC GGCGCGTCCG CGACGCACGG GAACGCGAGC GCGGGAAAGA CGACGGTGAA CGCGCTGCGC GCGCTGGCGT GGATCGGCGC GCGGGCGTCG CGAGACGGGG GGGAAGAAGG GGGGCGCGGG ACGCGGGACG CGCGACGCGT GGCGTTCGCG GCGGGCGCCG AGAGGCCGTT GCTTCGCGCG TCGAGCGCGT GCGAGGAGAT TCACGGCGAG AGCGGGTTGG ATTCGGCGAG CGCGACGGAT GCGCTGGAGG AGTACGCGTG GGGAGAGGAC GAGAGAGATT GGATGGGGGC GGGCGGGATC GAGGCGCTGT TTCATCCGCG CGCGAAGACG GCGGCGGAGG CGATGTACGA CGCGTTCGTG CGGCACGTCG AAACTTCGGG AAGAGGACGG CCGTTCGTCG TCGTCGCCAC GGGGCCGTTG ACGAACGTGG CGACGATGAT TCTCGCCAAG CGTTCGCAGA TTGATTTATT TCCCGACGAA TGTCGACCCG TAATTTTTTG CATGGGCGGC GCCGTCGGCG ACGGCAACAC CGGCGCTCGA GCGGAGTTCA ACATTCAGTG CGATCCCGAA GCGGCGAAAA TAGTCTTCGA GAGCGGTTTG CGAGTCTACA TGATACCGCT CGAGGTGACG CACACGGCGA TCGTCACCCC GAGCGTTCTC GACAGCTTGA CCACCGGTGG TCATTTCGAC TCCGGCAAAG CGGGCGCGAG CGCGCACGCC AAGCAAATCA GATCGCTCTT GACATTCTTC AAGGACACGT ACGAGAACGT CTTCGATTTC AAGACGGGAC CGCCTTTACA CGACCCCTGC GCCGTGTGGG CTGCGATTAA TTTTATCGAG GGCGCCACGT ACGATGAATC CGAAACGGAC GATAGATTTC GAGACTTGTT CGAATTCACG CACGAACGCG TCGACGTCGA GTGCGAGTCG CGACTGACGT ACGGCCAAAC CGTCATCGAC CGCTGGGGGA CGAGCGCCGA GCCGAAGAAC GTGTACGTCG CGAGAAGCAT GAACGTCGAT CGCTTTTGGG AAGCCATGCG CGACGCGATT CAGCACCGTT TGAGCTGGAT GTCTTAG
|
Protein sequence | MWVDCDPGHD DAFALYLALH GTAERARVLV GASATHGNAS AGKTTVNALR ALAWIGARAS RDGGEEGGRG TRDARRVAFA AGAERPLLRA SSACEEIHGE SGLDSASATD ALEEYAWGED ERDWMGAGGI EALFHPRAKT AAEAMYDAFV RHVETSGRGR PFVVVATGPL TNVATMILAK RSQIDLFPDE CRPVIFCMGG AVGDGNTGAR AEFNIQCDPE AAKIVFESGL RVYMIPLEVT HTAIVTPSVL DSLTTGGHFD SGKAGASAHA KQIRSLLTFF KDTYENVFDF KTGPPLHDPC AVWAAINFIE GATYDESETD DRFRDLFEFT HERVDVECES RLTYGQTVID RWGTSAEPKN VYVARSMNVD RFWEAMRDAI QHRLSWMS
|
| |