Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119367 |
Symbol | Sf3b2 |
ID | 5000092 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 150039 |
End bp | 151936 |
Gene Length | 1898 bp |
Protein Length | 565 aa |
Translation table | |
GC content | 64% |
IMG OID | 640415513 |
Product | splicing factor 3B subunit2, probable |
Protein accession | XP_001416084 |
Protein GI | 145341996 |
COG category | [A] RNA processing and modification |
COG ID | [COG5182] Splicing factor 3b, subunit 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGTCG CGGCGAAGCG ACGCGCCAAG GCGCGCGCGG CCAAGGCGCG CGACAGGGCG ACGGTGCGTT CGCGCGCGCG ACGCGCGCGA ACGATCGAGC GCGCGCGCGA CGCGAAGGCC GAAGGCGCGA TCGGCGCCGG CGGCGGGATC GAACGAGACG ACGGCGCGCG ATGGCGCGCG CGACGCGAGC GACGCGCGAC GACGCGCGAA GACGACGCGG ACTGACCTCG AGTCGCGATC GGTGGCGCGA CGACCGCGCG CAGGCGGAGG CGAAGACGGG AGACGACGCG ACGGCGACGA CGACGCGCGG CGAGACGAAG GCGCGAGGCG CGAGACGCGA AAAGAGCGGC GCGAACGCGG GCGATGAGGA CGTCGAGATC GAGTACGTGG CGGCGCCGAT CGAGTTAGAG CTAGAGGGGA CGAGCGATGC GGACGGGGAG GACGGATTGG GGGAGTTTAA GGCGATTTTT GAGGCGTTTC GAGCGAGAGG GGCGAGAGGG GGCGAGGTCG GCACGAGCGC GGACGACGCG GGGGCGAAGA AGGGTGATGG GGGCGACGGC GCGGATGAAG ACGACGGGGA CGACGGGGAC GATGACGGCG CGGGCGAGGA GCTTTCGAAT AAAAAGAAGA AGGAACTGCG TCGGATGAAG GTGGCGGAGC TCAAGCAACA CTGCGCGAAG CCAGAAGTCG TGGAGGTTTG GGACGCGAGC GCGAACGATC CGAGATTACT CGTCTTTCTC AAGGCGCACA GGAACACGGT CCCGGTGCCG AGACATTGGA GCCAAAAGCG GGCGTTTTTG CAAGGTAAGC GAGGGATCGA GAAGCCGCCG TGGGAGTTGC CGGATTTTAT TCGCGCCACG GGAATTCAGA AGATTCGCGA CCACTACGCG GAGAAGGAAG ACGCCAAGTC GTTGAAACAG AAGGCAAAAG ATACGAAAAC GGCCAAGCTC GGGCGCATGG ATATCGATTA TCAAATTCTG CACGATGCGT TCTTCGTCTA CCAATCCAAA CCCAAGATGT CCAAACCCGG GGATTTGTAC TTTGAGGGCA AGGAGTTTGA GGTTTCCATC GGACGCAAGC CCGGCAAGCT GAGCGAAGAA CTCAAGGCTG CGCTCGGCAT GACGGACGGC GGACCACCTC CGTGGTTGAT CAACATGCAA CGATACGGTC CGCCGCCGAG CTATCCGCAC CTGCGCGTGC CGGGGCTTTC CGCGCCGATT CCCGCGGGAG CGCAGTTCGG CTACCATCCG GGCGGTTGGG GCAAACCGCC CGTGGACGAG CTCGGCGTGC CCATCTACGG CGACGTGTTC GGAAGCACGA AGACAAAGGC GAGTGATAGC ACGCCGTACG ACGTAGCTGT CGACAAGACG AAACGGTTCG GTGAGATCGA CGAAGAGTTG GAGGAAGAGG AGAGCGAGGA AGAGGTGGAG GAGGAAGAAG TCGCCGCGGA GGAGGCCGAG GAGGAGGAGG CCGAGGAAGA AGAAGAGTCC GCGGAACTCC ACGCGGGCGC CGAAACGCCA GACGTTTTGG ATCTGCGCAA AAAGTCCGAA GGCCCGAAGT CATTGTACAC GGTGTTACCG TCGCAAGAAG CCTCCGTCGG CGCCGACCAA ATCGTCGGCT CGGCGCACAC GTACGCCATC CCTACCGACG GCGACGACAA GCCCAAGCGT CGCACCCGTG GCGCGGGCGT CGAAGTCACG CTCGACGCCG AGGCCCTCGG AGACGATGGC TTGCAAGATG AAGACGCGAT CCGCGCCGCG TACGAGGAAA CCGTCGCGGC GAAGAAGGCC GCCGCCGCGC CAGAGGACTT TTCCGACATG GTCGCCGACC ACGCCCGCGC TCAAAAACGA AAGGCCAAAG ACAAAAAAGA TAGCGACGCA AAAAAGTTCA AATTTTAA
|
Protein sequence | MGVAAKRRAK ARAAKARDRA TAEAKTGDDA TATTTRGETK ARGARREKSG ANAGDEDVEI EYVAAPIELE LEGTSDADGE DGLGEFKAIF EAFRARGARG GEVGTSADDA GAKKGDGGDG ADEDDGDDGD DDGAGEELSN KKKKELRRMK VAELKQHCAK PEVVEVWDAS ANDPRLLVFL KAHRNTVPVP RHWSQKRAFL QGKRGIEKPP WELPDFIRAT GIQKIRDHYA EKEDAKSLKQ KAKDTKTAKL GRMDIDYQIL HDAFFVYQSK PKMSKPGDLY FEGKEFEVSI GRKPGKLSEE LKAALGMTDG GPPPWLINMQ RYGPPPSYPH LRVPGLSAPI PAGAQFGYHP GGWGKPPVDE LGVPIYGDVF GSTKTKASDS TPYDVAVDKT KRFGEIDEEL EEEESEEEVE EEEVAAEEAE EEEAEEEEES AELHAGAETP DVLDLRKKSE GPKSLYTVLP SQEASVGADQ IVGSAHTYAI PTDGDDKPKR RTRGAGVEVT LDAEALGDDG LQDEDAIRAA YEETVAAKKA AAAPEDFSDM VADHARAQKR KAKDKKDSDA KKFKF
|
| |