Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16081 |
Symbol | |
ID | 5002820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | - |
Start bp | 256834 |
End bp | 258477 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | |
GC content | 60% |
IMG OID | 640418241 |
Product | predicted protein |
Protein accession | XP_001418886 |
Protein GI | 145348911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.355466 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGAC GGGCGGAGGA GGAGACGTTT AGGCCGCGAA CGCACGGCGA GGGGAGACGA GACGCGGGAA AGCGGGCGTC GTTGGAGAAG TTGGCGGCGC CGAGGACGGC GTTGTGGGAA CGATCGGCGA GCGTGAAGAA GGAGAAAGAT GAGGCTGTGT TTGCGGAGAA TTGCACGTTC GCACCGAAGG TTGGTCGAGG GCCGAAGACG CCGTCGACGA AACCGGCGGC GGAGCGATTG TACGAGTACG CGGAGAAGCG ATTAGAGACT AGGGAGCGCG TACAGGCGCG CGTGGTGGAG GAAGAGATGG AACTTTTGAC GTTCAAGCCG ACAGTTAACG TCAGGACGAG CGCTGGGATT CGAGACAAAG TCGCCAAGAC GCCGCCGTTG CACCGGCGCG TCGCCGACGT GTTGCGCGCC AAAGAAAACG TGAGAACAGA GGCTCGTTTA AAGGTGGAAG ACGAACTCGC CAAGGCGCAC ACGTTCAAGC CCACGATCAA TCCGACGAGT GTGATTCTCG CCATGCAGCG TGCCGAGATT CAGAAGGCTA TGGAAGATGA TGACGACGCC GGCGACGAAG CGCCAACGAT TCACCGCAGA CGATCGTCGG CGTTGGACGG CGAGGATGAA AATCTCACTT TTGCGCCTAA AATCACGCGC GAAAGCGAGC GCGTGGTCGA CGAACTTGAG CGTCAAGGCA AACTAGGCGC CGGCTTCCTT GAACGACAGC GTGACTTCAG CGAAAAAGTC GCGCGGCGCG CCCAGGAAAA GCGCGCTATG GTCGACGACG AATGCACATT TATGCCAGAT ATCGGCAACG CCGCCAGCGT GTTGCGCCGA GGGCGGCACG TGTACAAGTT GCTCGAAACG CCCGAAGAGC GCTCAGATCG ATTGGCGGTG AAAGACGCCG AGCGTAAGCG TGCCGCGCAA CGCGTCCGCG AGCGAGAGCA CTACGCGCAG TTTACGTACC AACCCGAGTT GAATGAAAAG TCGTACGAGC TCGCGCCTCA CGGTAGTACG ATCGACGATT TGGCGCGCGA CGAGCGACGG GACTTGGCGC GACGACGCGC GCAAGCCGAG CTCGAACGTG AGTTCCGAGA ACAGCACACG TTCGAACCAA ACCTCGATCG GTCGAATGAG GCCCGAAAGG CTCGCGATAC GAGTCAGTTC GCGATGGATT ACGGCGTCGG CGGCGATGCG GTGAGCGCTC GCATCGAAGC ATACCGGCAC GAAAAAGAAA CTGCGCTCGA AAATCTCCGT AGGCGCGCCG AGTATCGCGA GTTGGAGCAG TGTACGTTTC GTCCCGAGTC CATCGCGCGA GAGCCTCGCG CCATGGGCTC CCCGTCCGCG TCGAAAGTCA AAGGTATGGA TTCGTTTTTG CGCAAGCAAG CCAAGGCGCG CGAGCTCGAA GAAGAAAAGC GCGAGCGATA CGCCAAGGCT TTCCTCGAGA ACTTGGACGA TTTCGACCGA TGGGGCCGAC GGACGATTCC CGAGCCGTTC ACCGGCGCCT TCGCCGAAAA CGTCGTCGAA AAGGCTGAGG CTCGACGCAA AGCGTTGGCG GAGGAGCACT TGCGGCGCGA GTTGGAGGAG TGCACGTGGG AACCGGCGAC GAATCATTCT CGCAAGTCTA CTAGTATTAA ATAG
|
Protein sequence | MARRAEEETF RPRTHGEGRR DAGKRASLEK LAAPRTALWE RSASVKKEKD EAVFAENCTF APKVGRGPKT PSTKPAAERL YEYAEKRLET RERVQARVVE EEMELLTFKP TVNVRTSAGI RDKVAKTPPL HRRVADVLRA KENVRTEARL KVEDELAKAH TFKPTINPTS VILAMQRAEI QKAMEDDDDA GDEAPTIHRR RSSALDGEDE NLTFAPKITR ESERVVDELE RQGKLGAGFL ERQRDFSEKV ARRAQEKRAM VDDECTFMPD IGNAASVLRR GRHVYKLLET PEERSDRLAV KDAERKRAAQ RVREREHYAQ FTYQPELNEK SYELAPHGST IDDLARDERR DLARRRAQAE LEREFREQHT FEPNLDRSNE ARKARDTSQF AMDYGVGGDA VSARIEAYRH EKETALENLR RRAEYRELEQ CTFRPESIAR EPRAMGSPSA SKVKGMDSFL RKQAKARELE EEKRERYAKA FLENLDDFDR WGRRTIPEPF TGAFAENVVE KAEARRKALA EEHLRRELEE CTWEPATNHS RKSTSIK
|
| |