Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_39681 |
Symbol | |
ID | 4999967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 385843 |
End bp | 386853 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | |
GC content | 66% |
IMG OID | 640415388 |
Product | predicted protein |
Protein accession | XP_001415478 |
Protein GI | 145340742 |
COG category | [S] Function unknown |
COG ID | [COG1720] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00104] probable methyltransferase, YaeB/AF_0241 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0134522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCACG TCGCGCTCGC GCTCGTCGCG CTCGTCGCGC TCGCGCGCGC GACGACGCGG GACGAGGGCG AGGCGCTGCG ACGCGCGGAG GACGCGCGCG CGGCGGAGCG GCGCGGGCGA ACGACGGCGG AGCGCGCGCT GCGCGAGGCG CGCCTGGAGC TCGAGCGCTG CCGACGAGCG ATGGACGCGC GCGGAGGCGC GACGCGAGCG AGGGCGTCGA CGTCGCACGA GATGATGGCG ATGGGGACGT TCGCGAGCGC GTTCGATCGA AGGCTGGGGA CGCCGCGGCA ACCGTCGCTG GTGCCGCTGG CGCGAGGACG CGTCGAACTC GACGCCCGGG TGCCGGCGAG CGCGCTGGAG GGATTGGACG AGTTCACGCA CTGTTGGTTG GTGTACGTGT TTCATGAGAA CACGGATCTG GGCGCGGGGG CGGAGAAAAA TGGGAAAGTG GGCGCGGCGG ATGGGCCGAA ATCGACGCTT CGCGGGAAGA TTCGGGTGCC GAGATTGAAC GGGGAGAAGC GAGGGTGCTT GGCGACACGA ACGCCGCATC GACCGTGTCC GATAGGACTT AGCTTAGTGC GAATCGTTCG CGTGAATGGG AAATCGTTGG ACGTGGCGGG GGCGGACTTA GTGCACGGGA CGCCGGTTTT GGACGTAAAA CCATACGTGC CGTACAGCGA TTGCGTCGTC GACGCTCGCG CGCCGGAGTG GGTCGGAAAC GATCTCGCCG ATGGTGACGG ACCGTTGACC GTAGATGAAG TGAAACTTAC GGACGCCGGC GAGGCTGCTT TGCGCGCGGC GTGGGAGCGT CGGCGCAAAG ATTCACTGTA CGAAAGCGCC GACGAATTCG TCGCGTTTGT GACGCAGGCG CTCGGGCGAG ACATTCGGTC GTATCACCAG CGCTTGGGCG ATGTTGCGCC AGACACGGAT TGGCGCGTAA GTTTAGATGG TGTTATCGTC GTGTACCGTC AGTCAGCGAA GCGCGTGGTC GTCGTCGGGG CTGAAAACTA G
|
Protein sequence | MAHVALALVA LVALARATTR DEGEALRRAE DARAAERRGR TTAERALREA RLELERCRRA MDARGGATRA RASTSHEMMA MGTFASAFDR RLGTPRQPSL VPLARGRVEL DARVPASALE GLDEFTHCWL VYVFHENTDL GAGAEKNGKV GAADGPKSTL RGKIRVPRLN GEKRGCLATR TPHRPCPIGL SLVRIVRVNG KSLDVAGADL VHGTPVLDVK PYVPYSDCVV DARAPEWVGN DLADGDGPLT VDEVKLTDAG EAALRAAWER RRKDSLYESA DEFVAFVTQA LGRDIRSYHQ RLGDVAPDTD WRVSLDGVIV VYRQSAKRVV VVGAEN
|
| |