Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26466 |
Symbol | |
ID | 5004589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 276942 |
End bp | 278549 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | |
GC content | 63% |
IMG OID | 640420010 |
Product | predicted protein |
Protein accession | XP_001420293 |
Protein GI | 145351888 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.377719 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.17293 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGACG TCGACGCGGA CGTCCTGCCG CTGTTGCTCC GGCTGCGAAG CGCGGGCGCC AGCGCGGAGG GGCTGCGACG AGGCGCGGCG GGCGTGCGGG CGTGGAGGGA CGCCCTCGCG CGAGGGCTGC TGCCGGACGC GTCGCTCGAG TGGCCGGAGG ACGAGACGTT CAGGACGGCG CTGATCGAGG CGCTGGGGGA TTTAGACATG GCCAGGTTCA CGCGACGGTT TCCGCCGGTG CTGGACACGT TGATGAAGAA TGTGTTGGAT ATTTTGTACG TGTACGAACG CGATCGAGAG GACGAAGACG CGACGCCGGA GTTGCCGCCG ACGGAGCCGC GGGATTCGGA GACGGCGAAC GATGGAGAGG GAGAGGGAGA CGCGCGAGGA AGCGGCGCGG GCGAGAGCGA CGAAGAGGAG GGGGAGGGCG AGGCGCGGAG TCAGGCGGGA GGGCGAGGCG GCGCGGGGGA AGAAACCGAC GGGAGCGATA ACGTCGACGA ATTCGACGTG GGGATGGATG GCGACGACGG CGCGAACGAG GCGATGGAGC GGGCGAAGGA GAAGAATAAG GAGATCGTCT CGCGGCTCAT GGAGGAGTTC AAAGAGCAGT GGGAACCGGC GATGGATAAG CTCGACAAGG CGGCCAAGGC ATTCGAAGGA TTAGATTTAG ACGACCTCGC CGACGGCCCG GAAGGCTTCG ACCTCACGCG AGGTCTGTGG CAGCAGACTG GGTGGAAGGA GCTCGATTCG CTTCGCAAAA AGCTGCAAGA CTTGAAAGAG TTGCGCGACA TGGTGCGCAG TTTGGGCCGA GGCAGTGGTC GCGGACCTTT GCGTCGCGCG CCGCGACAAA GAGAGCGCCA AGGATTCCCC ATAGGTCTCG TGCGAAGTCC GATGGAGCCC GAACAAACAT CCGGTTTGTG CCGCTCGGAC GATTTGTCCC GCATGATGCC GAGCGAGATG GTGCTACTCG CGTCGAGTCT CCCGCAAGCG CGTCTTTTGC ACTTTGCGCG TCGCGCCGAG CGCACTTTGC TGTCGTATGA GCGCGTGGGG TGGTCGGAAG AACCCGCGGT GACTGTAGAG GGCTTCGAGA CGCGCCCCGC GGCGGAGTGC GGGCCAATCA TCGTGTGCCT GGACACCTCG GGGTCGATGA TGGGCGCTCG CGAGACCGTC GCCAAAGCCA TGGTTCTCGA GTGCATGCGG CAAAGTCGCT CGCAGCAGCG CGCGTGTTAT TTATATTCTT TTAGTGGCCC AGGAGATTGC CAAGAGCTCG AGCTCAAGCT CAACGCCGCC GGTCTCTACG GTTTGTTGGA ATTTCTCAGC GGTAGCTTCC ACGGCGGCAC CGACGTCGAC GAGCCATTCA ATCGCGCGCT CGCTCGGTTA AACGAGGCCG AATGGAGCAA CGCTGATATA TTGCTCGTCA CCGACGGGGA AATCAAACCT CCCGACGAAA CCTTGATCGC CAATCTCAAC GAGGCAAAGG AAGAGATGGG ATTAAAGGTG CACGGTTTGC TCGTCGGCGA CGCCGGCAAC GCCGAAGTCG TGGAATCGAT TTGCACTCAC GTACACGCGT TCAAGTCCTG GACCGCGGTC GGCGGCAAGC CATCGTAA
|
Protein sequence | MRDVDADVLP LLLRLRSAGA SAEGLRRGAA GVRAWRDALA RGLLPDASLE WPEDETFRTA LIEALGDLDM ARFTRRFPPV LDTLMKNVLD ILYVYERDRE DEDATPELPP TEPRDSETAN DGEGEGDARG SGAGESDEEE GEGEARSQAG GRGGAGEETD GSDNVDEFDV GMDGDDGANE AMERAKEKNK EIVSRLMEEF KEQWEPAMDK LDKAAKAFEG LDLDDLADGP EGFDLTRGLW QQTGWKELDS LRKKLQDLKE LRDMVRSLGR GSGRGPLRRA PRQRERQGFP IGLVRSPMEP EQTSGLCRSD DLSRMMPSEM VLLASSLPQA RLLHFARRAE RTLLSYERVG WSEEPAVTVE GFETRPAAEC GPIIVCLDTS GSMMGARETV AKAMVLECMR QSRSQQRACY LYSFSGPGDC QELELKLNAA GLYGLLEFLS GSFHGGTDVD EPFNRALARL NEAEWSNADI LLVTDGEIKP PDETLIANLN EAKEEMGLKV HGLLVGDAGN AEVVESICTH VHAFKSWTAV GGKPS
|
| |