Gene OSTLU_119367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119367 
SymbolSf3b2 
ID5000092 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp150039 
End bp151936 
Gene Length1898 bp 
Protein Length565 aa 
Translation table 
GC content64% 
IMG OID640415513 
Productsplicing factor 3B subunit2, probable 
Protein accessionXP_001416084 
Protein GI145341996 
COG category[A] RNA processing and modification 
COG ID[COG5182] Splicing factor 3b, subunit 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGTCG CGGCGAAGCG ACGCGCCAAG GCGCGCGCGG CCAAGGCGCG CGACAGGGCG 
ACGGTGCGTT CGCGCGCGCG ACGCGCGCGA ACGATCGAGC GCGCGCGCGA CGCGAAGGCC
GAAGGCGCGA TCGGCGCCGG CGGCGGGATC GAACGAGACG ACGGCGCGCG ATGGCGCGCG
CGACGCGAGC GACGCGCGAC GACGCGCGAA GACGACGCGG ACTGACCTCG AGTCGCGATC
GGTGGCGCGA CGACCGCGCG CAGGCGGAGG CGAAGACGGG AGACGACGCG ACGGCGACGA
CGACGCGCGG CGAGACGAAG GCGCGAGGCG CGAGACGCGA AAAGAGCGGC GCGAACGCGG
GCGATGAGGA CGTCGAGATC GAGTACGTGG CGGCGCCGAT CGAGTTAGAG CTAGAGGGGA
CGAGCGATGC GGACGGGGAG GACGGATTGG GGGAGTTTAA GGCGATTTTT GAGGCGTTTC
GAGCGAGAGG GGCGAGAGGG GGCGAGGTCG GCACGAGCGC GGACGACGCG GGGGCGAAGA
AGGGTGATGG GGGCGACGGC GCGGATGAAG ACGACGGGGA CGACGGGGAC GATGACGGCG
CGGGCGAGGA GCTTTCGAAT AAAAAGAAGA AGGAACTGCG TCGGATGAAG GTGGCGGAGC
TCAAGCAACA CTGCGCGAAG CCAGAAGTCG TGGAGGTTTG GGACGCGAGC GCGAACGATC
CGAGATTACT CGTCTTTCTC AAGGCGCACA GGAACACGGT CCCGGTGCCG AGACATTGGA
GCCAAAAGCG GGCGTTTTTG CAAGGTAAGC GAGGGATCGA GAAGCCGCCG TGGGAGTTGC
CGGATTTTAT TCGCGCCACG GGAATTCAGA AGATTCGCGA CCACTACGCG GAGAAGGAAG
ACGCCAAGTC GTTGAAACAG AAGGCAAAAG ATACGAAAAC GGCCAAGCTC GGGCGCATGG
ATATCGATTA TCAAATTCTG CACGATGCGT TCTTCGTCTA CCAATCCAAA CCCAAGATGT
CCAAACCCGG GGATTTGTAC TTTGAGGGCA AGGAGTTTGA GGTTTCCATC GGACGCAAGC
CCGGCAAGCT GAGCGAAGAA CTCAAGGCTG CGCTCGGCAT GACGGACGGC GGACCACCTC
CGTGGTTGAT CAACATGCAA CGATACGGTC CGCCGCCGAG CTATCCGCAC CTGCGCGTGC
CGGGGCTTTC CGCGCCGATT CCCGCGGGAG CGCAGTTCGG CTACCATCCG GGCGGTTGGG
GCAAACCGCC CGTGGACGAG CTCGGCGTGC CCATCTACGG CGACGTGTTC GGAAGCACGA
AGACAAAGGC GAGTGATAGC ACGCCGTACG ACGTAGCTGT CGACAAGACG AAACGGTTCG
GTGAGATCGA CGAAGAGTTG GAGGAAGAGG AGAGCGAGGA AGAGGTGGAG GAGGAAGAAG
TCGCCGCGGA GGAGGCCGAG GAGGAGGAGG CCGAGGAAGA AGAAGAGTCC GCGGAACTCC
ACGCGGGCGC CGAAACGCCA GACGTTTTGG ATCTGCGCAA AAAGTCCGAA GGCCCGAAGT
CATTGTACAC GGTGTTACCG TCGCAAGAAG CCTCCGTCGG CGCCGACCAA ATCGTCGGCT
CGGCGCACAC GTACGCCATC CCTACCGACG GCGACGACAA GCCCAAGCGT CGCACCCGTG
GCGCGGGCGT CGAAGTCACG CTCGACGCCG AGGCCCTCGG AGACGATGGC TTGCAAGATG
AAGACGCGAT CCGCGCCGCG TACGAGGAAA CCGTCGCGGC GAAGAAGGCC GCCGCCGCGC
CAGAGGACTT TTCCGACATG GTCGCCGACC ACGCCCGCGC TCAAAAACGA AAGGCCAAAG
ACAAAAAAGA TAGCGACGCA AAAAAGTTCA AATTTTAA
 
Protein sequence
MGVAAKRRAK ARAAKARDRA TAEAKTGDDA TATTTRGETK ARGARREKSG ANAGDEDVEI 
EYVAAPIELE LEGTSDADGE DGLGEFKAIF EAFRARGARG GEVGTSADDA GAKKGDGGDG
ADEDDGDDGD DDGAGEELSN KKKKELRRMK VAELKQHCAK PEVVEVWDAS ANDPRLLVFL
KAHRNTVPVP RHWSQKRAFL QGKRGIEKPP WELPDFIRAT GIQKIRDHYA EKEDAKSLKQ
KAKDTKTAKL GRMDIDYQIL HDAFFVYQSK PKMSKPGDLY FEGKEFEVSI GRKPGKLSEE
LKAALGMTDG GPPPWLINMQ RYGPPPSYPH LRVPGLSAPI PAGAQFGYHP GGWGKPPVDE
LGVPIYGDVF GSTKTKASDS TPYDVAVDKT KRFGEIDEEL EEEESEEEVE EEEVAAEEAE
EEEAEEEEES AELHAGAETP DVLDLRKKSE GPKSLYTVLP SQEASVGADQ IVGSAHTYAI
PTDGDDKPKR RTRGAGVEVT LDAEALGDDG LQDEDAIRAA YEETVAAKKA AAAPEDFSDM
VADHARAQKR KAKDKKDSDA KKFKF