Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32661 |
Symbol | |
ID | 5002691 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 439246 |
End bp | 441305 |
Gene Length | 2060 bp |
Protein Length | 595 aa |
Translation table | |
GC content | 59% |
IMG OID | 640418112 |
Product | predicted protein |
Protein accession | XP_001418705 |
Protein GI | 145348541 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.373889 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGACGCGC GAACGAGCGA CGTCGGCGCG CGCGCGCGAG GACCGCGCGC GACGAATCGA CGCGCCCGAG CGACGGTAGA CGTCGAACGA GACGCGCGCG GAACGACGAC GCGCGCATTC CTTTGACGCG GGAAGAGGAC GTTTTTGAAA TTTAAACGCG GCGCGACGCG GCGACGGCGG CGCGATGAGC GAGAACGACA TGGCGGTCGC GAACGCGCCG GTGGACGCGT CGTGCGCGAA GAGTGCGATC GAGGGCGCGA GCGGCGACGC GCGAGCGGCG GCGGCGGTGC CGGCGTGGAT GGCGGATGAC GACGACGACG GGTTGACGGA GGAACAGCGA AGGGTGATCG CGGAGCGGGT GGCGGCGTTG CAGAATCGAT TTAAGAAGTT TGAGGAGAAA ACGACGCGGG AGAAGGTGGA GGAATTGATG CTCGGGAACG CGGATAAGGG GTTGACGGAA CGAGAGGCGG ACATGGTGCT GAGAGTGTGT AACGGGAACG AGTTCGAGGC GAGCGACCGC CTCGGCGAGG CGGAGGATGG GGAATCGTTT TTGATGTCTA TTCGGTTCAT GATTCAGGAA GAGGACGCGC TGAAACGACG TTCGCAGAGC TCAAAAGCGG CACAGACGGC GAACGAGCGG TTTAACAAGC GCATCGAGCG CTTGCGTAAG CGTCGGATGT CGTTCGGCGG CGACGAAGAG GACGAGGAAG ACGCGGGCGA GCCGGAGGAG TTCTTCGAGA CGGAGTACAT TTTAGAAGAG AGTCTCGGCG TACAATTCGT GCGTCACACG AAGAAGCACG TGAATGTGAA ACGTTTGCGT CTAGACGACG CCATCGCGCG CGCGCAAGTG GCGGAAGAGC TCGCGGAGCC GGAGGTCAAC CCGTACGAAG GATGGTCCGA GGCGCGTATC AAGGCGTGGA AGAATCGCGA AAAGAATGAA AATCAATACT ATTATCGATT CAACGAACCG GGAGAGATGC AAGCGAACGG GAAATGGACG GACGAAGAAC ACCAAAAGTT CATGGATATC ATCGCCTCTC TTCCTGGCGG CAAGGCGAAC TACGAATGGG GCACGTTTTC CAAAGGTATC CCAGGCCGCG TCGGTTACCA GTGCTCGAAT TATTACCGAA CCTTGGTCAA AAACAATATC ATCCAAGACG AAAACTATAT GGTGGACCCG GTGTCGGGTG ACCTGCGATT CAACTTCAAG AACAAGGGCT TCACGCGCGC CGAACCCGTC GAGGGTGAAC CGCGCATGGT CATGGTTAAG AAGGTCATCA AGCAGAAGAA ACCGAAGGCC CCGAAGCAGC CGAAGAAGGC GAAGAAGAAG GTGAAGAAAT CGGACGACAA GGCGTTCCAC TGCACCATCA GAGCGGGTCC CGTGCGTTCG AGCGGTCGAG CGACTAAGAA GACGTACACC GACGGCGGCG ACGACGACGA GGATTTCGAC GATGAATTCG AGGAACTTCC AGTACTCCCG GGATTCATCG ATCCCATCAC TCGCATGGCC ATCGAAGAAC CGGCGATTTC GCCGTACGGC CACGTCGCAG GTTACGAGAC GTGGTGCAAG ATTCTTCGCC AACCCGAGGC ACCGGACACG TGTCCGTTCA CGAAGCAACC GTTAAAACGT CGCGAGCTGA TCAAGCTCAC GCACGAAAAC ATCGACCAAT ACCGCTCGAA GATCGTCGAA AACCAACAGA CGTAGATTCC TCACCTTGTG ATGAATACCT TTTGTACCTC ACTTCCGTCC GCTCGCCATG GCATCATCCC GCGCCGCCGC GCCCCCGAGT GGAGAAAACG CGCAAGTGCG CCATCTCGGG CGCGGCAACC GCGTGCGCAC GCCTACGCAT CCGACGAAGG GCGACTATTT GAGCATCAAG ACCGAAGAAG TCATCTTCGG CATCTTGTTC GCGTTCATGG CGTGGAAACT GTGGCAGAGC GTGCGCGGTT TATTTAGATA CTGGAAGTAC GTGCGCGCGC CGAACGCGCC GGAGACAGAG CTCGAGGCAG GGGCGATCGC GTTCGAGGAA GATGACGATG AGAACGAAAA GATTGATTGA
|
Protein sequence | MSENDMAVAN APVDASCAKS AIEGASGDAR AAAAVPAWMA DDDDDGLTEE QRRVIAERVA ALQNRFKKFE EKTTREKVEE LMLGNADKGL TEREADMVLR VCNGNEFEAS DRLGEAEDGE SFLMSIRFMI QEEDALKRRS QSSKAAQTAN ERFNKRIERL RKRRMSFGGD EEDEEDAGEP EEFFETEYIL EESLGVQFVR HTKKHVNVKR LRLDDAIARA QVAEELAEPE VNPYEGWSEA RIKAWKNREK NENQYYYRFN EPGEMQANGK WTDEEHQKFM DIIASLPGGK ANYEWGTFSK GIPGRVGYQC SNYYRTLVKN NIIQDENYMV DPVSGDLRFN FKNKGFTRAE PVEGEPRMVM VKKVIKQKKP KAPKQPKKAK KKVKKSDDKA FHCTIRAGPV RSSGRATKKT YTDGGDDDED FDDEFEELPV LPGFIDPITR MAIEEPAISP YGHVAGYETW CKILRQPEAP DTCPFTKQPL KRRELIKLTH ENIDQYRSKI VENQQTGENA QVRHLGRGNR VRTPTHPTKG DYLSIKTEEV IFGILFAFMA WKLWQSVRGL FRYWKYVRAP NAPETELEAG AIAFEEDDDE NEKID
|
| |