Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14418 |
Symbol | |
ID | 5000860 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 37124 |
End bp | 40465 |
Gene Length | 3342 bp |
Protein Length | 1091 aa |
Translation table | |
GC content | 57% |
IMG OID | 640416281 |
Product | predicted protein |
Protein accession | XP_001416533 |
Protein GI | 145344012 |
COG category | [A] RNA processing and modification |
COG ID | [COG5181] U2 snRNP spliceosome subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGAGG CGGCGGTGGA GCGAGAGCGC GACAACACGC TGCGAAACAT TGAAAAGAAG CAACGAGAAG AGGCGGAACG CGCGGCGGAC GAGGCGCGAC TCGCCGCGGC GCGAGCGAGC GAACCCGCGG CATCGTCGGC GAGGCCAGAG AAGCGCAAGC GACGATGGGA CGCGAAACCG GACGACAGCG CGGCGCGAAG CGGGCAGGCG CCGGCGCCGG CGCGCGTGAG CGAGTGGGAA AGCGACGACA GCGCGGCGAC TGGGACGAAG TGGGACGACG CCACGCCGCA GCGCCCATCA GAGTGGGAAA CGATGGAAAG CGGCACGCGA GCGGCGGACG TCAAACCGAC GCCGCGACGC TCGAGAAGCC GTTGGGACGA GACGCCGATG ATTCGCGCGG GTGGCGATCC GAGCGCCACG CCGGCGTGGA CGGGTGGCGA GACGCCCGTC ATCGCCGCCG GTGGGGAGAC GCCGAAAATC ACGGCTGGTA TGGCGACGCC ATCTGCAGCG CAAATCGCCG CGCACGCCGC CATGCAATCG AACGTTCCTT TGACGCCCGA GCAGTATCAG CAGATGCGTT TTCAACGGGA AATCGAAGAG CGCAACCGAC CGCAGACGGA TGAGGAACTC GACGAGTTGT TGCCGTCCGA AGGGTACAAG ATTCTCGAGC CACCGGCGAG CTACGTGCCC ATTCGCACGC CTGCGCGTAA GCTCATGCAA ACGCCGATGC CGTACGGCTC AAACGCCGGC TTTTTCAGCA TTCCCGAGGA GGATCGCGGG CAAAAATTTG ACGTCGCCCT CGTCCCCGAA GGTTTGCCGG AGATGAAACC CGAGGACGTG CAGTACTTTG CACCTTTGTT GAAAGAAACG GACGAGGAAG CCTTGACGAT CGAAGAGCAG AAGGAGCGCA AGATTATGCG TTTGTTACTT CGAGTCAAGA ACGGGACGCC GCAGCAGCGC AAGACGTCGC TTCGTCAAAT CACCGATCGC GCCAAAGAGT TCGGCGCCGG ACCGTTGTTC AACCAGATTT TACCTTTGCT CATGTCGCCG ACGTTGGAAG ATCAAGAGCG TCACTTACTC GTCAAGGTGA TTGATCGCAT CTTGTACAAG CTCGACGACT TGGTGCGCCC CTACGTGCAC AAGATTCTCG TCGTCATCGA ACCTTTGTTG ATTGATGAGG ACTACTACGC CCGCGTCGAG GGGCGGGAAA TCATTAGTAA CGTCGCAAAG GCTGCAGGAT TAGCGACGAT GATCGCCGCT ATGCGCCCTG ACATTGATAA CGTGGATGAG TACGTGCGGA ACACGACGGC GAGAGCGTTC GCCGTCGTCG CGCAAGCGCT CGGCGTGCAG TCTTTGCTTC CTTTCTTGAA GGCGGTGTGC CAAAGTAAGA AGAGTTGGCA AGCGCGTCAC ACGGGCATCA AAATCGTTCA GCAAATCGCA ATCCTTCACG GGTGCGCCGT GCTGCCGCAC CTCAAATCTT TGGTGGACAT CATTGAAAAC GGTTTGGGCG ATGAAAACCA AAAGGTGCGC ACCATCACCG CGCTTTCCAT CGCCGCGCTC GCTGAGGCTG CGACGCCGTA CGGTATCGAG TCGTTCGATA ACGTTCTCAA ACCGTTGTGG AAAGGCATTC GCGCACACCG AGGGAAAGTG TTAGCCGCGT TCCTCAAGGC CATCGGTTTC ATCATTCCGC TCATGGACGC GATGTACGCC AACTATTACA CCCGCGAGGT CATGGTGATT CTGATTCGTG AATTCGCTAC TGCGGACGAA GAGATGAAGA AAATTACGTT GAAAGTTGTC AAGCAGTGCG TAGCCACCGA CGGGGTCGAG CCAGAGTACA TTCGAGCCGA AGTCATGCCT GAATTCTTCA AACATTTCTG GGTTCGACGC ATGGCTCTCG ATCGACGCAA CTATCAACAG CTCGTGGAGA CGACGTTGGA GGTTTCCTTA AAAGTCGGTG CCGCGGAAAT CATCGGGCGA ATCGTGGAAG ATTTGAAGGA TGAGTCCGAG CCTTATCGCC GTATGGTGAT GGAAACAATC ACCAAGGTGA TTGAAGAGCT TGGTACGGCG GACGTGGACA CTCGCATGGA GGAACTCCTG ATTGACGGCA TGCTGTACGC CTTTCAAGAG CAAACGTCGG ATGAGAATGA TATCATGCTC AAGGGCGTCG GTACCATCGT CAACGCCCTT GGACTTCGTG CGAAGCCGTA CTTGCCCCAA ATTTGCGGTA CGATCAAATG GCGCATGAAC AACAAAAGCG CCGACATTCG CGAACAAGCG GCTGATTTGA TCAGCGCCAT CGCTCCCGTG ATGCGCAAGT GCGAAGAAGA GCAACTTCTC GGGCACTTGG GGGTCGTTTT GTACGAGTAC CTCGGTGAAG AATATCCCGA AGTTTTGGGT TCGATTCTTG GCGCGTTGAA GGCGATCGTG AGCGTGCAAG GAATGACGCG AATGACGCCG CCGATTAAGG ATCTTCTCCC GCGATTGACG CCAATTTTGA AGAATAGGCA CGAGAAGGTG CAAGAGAACA CGATCGATTT GATCGGTCGA ATTGCCGATC GAGGTGCGGA GTACGTCGCC GCGCGCGAGT GGATGCGAAT TTGCTTTGAA CTTTTAGAAT TGTTGAAGGC GCCGAAGAAG GCGATTCGCC GCGCGACGGT GAACACGTTT GGTTACATCG CCAAGGCGAT TGGTCCGCAA GACGTGCTCG CGACGCTTTT GAACAACCTC AAGGTACAAG AGCGTCAAAT GCGCGTGTGT ACGACTGTGG CGATCGCAAT CGTTGCGGAA ACGTGCGCGC CGTTTACCGT GCTACCGGCG CTCATGAATG AGTACCGCGT GCCCGAGCTC AACGTGCAAA ACGGCGTGCT GAAATCGTTG GCGTTTTTGT TCGAGTACAT TGGAGAGATG GGAAAGGATT ACATCTACGC CGTCACGCCG CTGCTCGAAG ACGCGCTCAT GGATCGAGAT CTGGTACACC GCCAAACTGC GGCGGTGACG GTAAAGCATC TCGCCTTGGG ATGCGCTGGT CTTGGATGCG AAGACGCCGT GACGCATCTG ATCAATTACA CTTGGCCCAA CGTATTTGAG CCGTCGCCGC ACGTCATCAA CGCTGTGACG GAGGCGATCG AAGCCGCACG AGTGGCGCTC GGACCACATT TCGTACTCGC ATACACGTTG CAAGGGCTAT TCCATCCCGC ACGCAAGGTT CGCGACATTT ACTGGAAGAT TTACAACACG CTGTACATTT CATCCGAGGA TGCTCTCGTG CCGGCATACC CGGCGCTCGA CGACGACGGA CCGAACACGT ACCGTCGCGT CGAGCTTGAC TGTTTCGTGT AG
|
Protein sequence | MKEAAVERER DNTLRNIEKK QREEAERAAD EARLAAARAS EPAASSARPE KRKRRWDAKP DDSAARSGQA PAPARRPSEW ETMESGTRAA DVKPTPRRSR SRWDETPMIR AGGDPSATPA WTGGETPVIA AGGETPKITA GMATPSAAQI AAHAAMQSNV PLTPEQYQQM RFQREIEERN RPQTDEELDE LLPSEGYKIL EPPASYVPIR TPARKLMQTP MPYGSNAGFF SIPEEDRGQK FDVALVPEGL PEMKPEDVQY FAPLLKETDE EALTIEEQKE RKIMRLLLRV KNGTPQQRKT SLRQITDRAK EFGAGPLFNQ ILPLLMSPTL EDQERHLLVK VIDRILYKLD DLVRPYVHKI LVVIEPLLID EDYYARVEGR EIISNVAKAA GLATMIAAMR PDIDNVDEYV RNTTARAFAV VAQALGVQSL LPFLKAVCQS KKSWQARHTG IKIVQQIAIL HGCAVLPHLK SLVDIIENGL GDENQKVRTI TALSIAALAE AATPYGIESF DNVLKPLWKG IRAHRGKVLA AFLKAIGFII PLMDAMYANY YTREVMVILI REFATADEEM KKITLKVVKQ CVATDGVEPE YIRAEVMPEF FKHFWVRRMA LDRRNYQQLV ETTLEVSLKV GAAEIIGRIV EDLKDESEPY RRMVMETITK VIEELGTADV DTRMEELLID GMLYAFQEQT SDENDIMLKG VGTIVNALGL RAKPYLPQIC GTIKWRMNNK SADIREQAAD LISAIAPVMR KCEEEQLLGH LGVVLYEYLG EEYPEVLGSI LGALKAIVSV QGMTRMTPPI KDLLPRLTPI LKNRHEKVQE NTIDLIGRIA DRGAEYVAAR EWMRICFELL ELLKAPKKAI RRATVNTFGY IAKAIGPQDV LATLLNNLKV QERQMRVCTT VAIAIVAETC APFTVLPALM NEYRVPELNV QNGVLKSLAF LFEYIGEMGK DYIYAVTPLL EDALMDRDLV HRQTAAVTVK HLALGCAGLG CEDAVTHLIN YTWPNVFEPS PHVINAVTEA IEAARVALGP HFVLAYTLQG LFHPARKVRD IYWKIYNTLY ISSEDALVPA YPALDDDGPN TYRRVELDCF V
|
| |