Gene OSTLU_14418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_14418 
Symbol 
ID5000860 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp37124 
End bp40465 
Gene Length3342 bp 
Protein Length1091 aa 
Translation table 
GC content57% 
IMG OID640416281 
Productpredicted protein 
Protein accessionXP_001416533 
Protein GI145344012 
COG category[A] RNA processing and modification 
COG ID[COG5181] U2 snRNP spliceosome subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGAGG CGGCGGTGGA GCGAGAGCGC GACAACACGC TGCGAAACAT TGAAAAGAAG 
CAACGAGAAG AGGCGGAACG CGCGGCGGAC GAGGCGCGAC TCGCCGCGGC GCGAGCGAGC
GAACCCGCGG CATCGTCGGC GAGGCCAGAG AAGCGCAAGC GACGATGGGA CGCGAAACCG
GACGACAGCG CGGCGCGAAG CGGGCAGGCG CCGGCGCCGG CGCGCGTGAG CGAGTGGGAA
AGCGACGACA GCGCGGCGAC TGGGACGAAG TGGGACGACG CCACGCCGCA GCGCCCATCA
GAGTGGGAAA CGATGGAAAG CGGCACGCGA GCGGCGGACG TCAAACCGAC GCCGCGACGC
TCGAGAAGCC GTTGGGACGA GACGCCGATG ATTCGCGCGG GTGGCGATCC GAGCGCCACG
CCGGCGTGGA CGGGTGGCGA GACGCCCGTC ATCGCCGCCG GTGGGGAGAC GCCGAAAATC
ACGGCTGGTA TGGCGACGCC ATCTGCAGCG CAAATCGCCG CGCACGCCGC CATGCAATCG
AACGTTCCTT TGACGCCCGA GCAGTATCAG CAGATGCGTT TTCAACGGGA AATCGAAGAG
CGCAACCGAC CGCAGACGGA TGAGGAACTC GACGAGTTGT TGCCGTCCGA AGGGTACAAG
ATTCTCGAGC CACCGGCGAG CTACGTGCCC ATTCGCACGC CTGCGCGTAA GCTCATGCAA
ACGCCGATGC CGTACGGCTC AAACGCCGGC TTTTTCAGCA TTCCCGAGGA GGATCGCGGG
CAAAAATTTG ACGTCGCCCT CGTCCCCGAA GGTTTGCCGG AGATGAAACC CGAGGACGTG
CAGTACTTTG CACCTTTGTT GAAAGAAACG GACGAGGAAG CCTTGACGAT CGAAGAGCAG
AAGGAGCGCA AGATTATGCG TTTGTTACTT CGAGTCAAGA ACGGGACGCC GCAGCAGCGC
AAGACGTCGC TTCGTCAAAT CACCGATCGC GCCAAAGAGT TCGGCGCCGG ACCGTTGTTC
AACCAGATTT TACCTTTGCT CATGTCGCCG ACGTTGGAAG ATCAAGAGCG TCACTTACTC
GTCAAGGTGA TTGATCGCAT CTTGTACAAG CTCGACGACT TGGTGCGCCC CTACGTGCAC
AAGATTCTCG TCGTCATCGA ACCTTTGTTG ATTGATGAGG ACTACTACGC CCGCGTCGAG
GGGCGGGAAA TCATTAGTAA CGTCGCAAAG GCTGCAGGAT TAGCGACGAT GATCGCCGCT
ATGCGCCCTG ACATTGATAA CGTGGATGAG TACGTGCGGA ACACGACGGC GAGAGCGTTC
GCCGTCGTCG CGCAAGCGCT CGGCGTGCAG TCTTTGCTTC CTTTCTTGAA GGCGGTGTGC
CAAAGTAAGA AGAGTTGGCA AGCGCGTCAC ACGGGCATCA AAATCGTTCA GCAAATCGCA
ATCCTTCACG GGTGCGCCGT GCTGCCGCAC CTCAAATCTT TGGTGGACAT CATTGAAAAC
GGTTTGGGCG ATGAAAACCA AAAGGTGCGC ACCATCACCG CGCTTTCCAT CGCCGCGCTC
GCTGAGGCTG CGACGCCGTA CGGTATCGAG TCGTTCGATA ACGTTCTCAA ACCGTTGTGG
AAAGGCATTC GCGCACACCG AGGGAAAGTG TTAGCCGCGT TCCTCAAGGC CATCGGTTTC
ATCATTCCGC TCATGGACGC GATGTACGCC AACTATTACA CCCGCGAGGT CATGGTGATT
CTGATTCGTG AATTCGCTAC TGCGGACGAA GAGATGAAGA AAATTACGTT GAAAGTTGTC
AAGCAGTGCG TAGCCACCGA CGGGGTCGAG CCAGAGTACA TTCGAGCCGA AGTCATGCCT
GAATTCTTCA AACATTTCTG GGTTCGACGC ATGGCTCTCG ATCGACGCAA CTATCAACAG
CTCGTGGAGA CGACGTTGGA GGTTTCCTTA AAAGTCGGTG CCGCGGAAAT CATCGGGCGA
ATCGTGGAAG ATTTGAAGGA TGAGTCCGAG CCTTATCGCC GTATGGTGAT GGAAACAATC
ACCAAGGTGA TTGAAGAGCT TGGTACGGCG GACGTGGACA CTCGCATGGA GGAACTCCTG
ATTGACGGCA TGCTGTACGC CTTTCAAGAG CAAACGTCGG ATGAGAATGA TATCATGCTC
AAGGGCGTCG GTACCATCGT CAACGCCCTT GGACTTCGTG CGAAGCCGTA CTTGCCCCAA
ATTTGCGGTA CGATCAAATG GCGCATGAAC AACAAAAGCG CCGACATTCG CGAACAAGCG
GCTGATTTGA TCAGCGCCAT CGCTCCCGTG ATGCGCAAGT GCGAAGAAGA GCAACTTCTC
GGGCACTTGG GGGTCGTTTT GTACGAGTAC CTCGGTGAAG AATATCCCGA AGTTTTGGGT
TCGATTCTTG GCGCGTTGAA GGCGATCGTG AGCGTGCAAG GAATGACGCG AATGACGCCG
CCGATTAAGG ATCTTCTCCC GCGATTGACG CCAATTTTGA AGAATAGGCA CGAGAAGGTG
CAAGAGAACA CGATCGATTT GATCGGTCGA ATTGCCGATC GAGGTGCGGA GTACGTCGCC
GCGCGCGAGT GGATGCGAAT TTGCTTTGAA CTTTTAGAAT TGTTGAAGGC GCCGAAGAAG
GCGATTCGCC GCGCGACGGT GAACACGTTT GGTTACATCG CCAAGGCGAT TGGTCCGCAA
GACGTGCTCG CGACGCTTTT GAACAACCTC AAGGTACAAG AGCGTCAAAT GCGCGTGTGT
ACGACTGTGG CGATCGCAAT CGTTGCGGAA ACGTGCGCGC CGTTTACCGT GCTACCGGCG
CTCATGAATG AGTACCGCGT GCCCGAGCTC AACGTGCAAA ACGGCGTGCT GAAATCGTTG
GCGTTTTTGT TCGAGTACAT TGGAGAGATG GGAAAGGATT ACATCTACGC CGTCACGCCG
CTGCTCGAAG ACGCGCTCAT GGATCGAGAT CTGGTACACC GCCAAACTGC GGCGGTGACG
GTAAAGCATC TCGCCTTGGG ATGCGCTGGT CTTGGATGCG AAGACGCCGT GACGCATCTG
ATCAATTACA CTTGGCCCAA CGTATTTGAG CCGTCGCCGC ACGTCATCAA CGCTGTGACG
GAGGCGATCG AAGCCGCACG AGTGGCGCTC GGACCACATT TCGTACTCGC ATACACGTTG
CAAGGGCTAT TCCATCCCGC ACGCAAGGTT CGCGACATTT ACTGGAAGAT TTACAACACG
CTGTACATTT CATCCGAGGA TGCTCTCGTG CCGGCATACC CGGCGCTCGA CGACGACGGA
CCGAACACGT ACCGTCGCGT CGAGCTTGAC TGTTTCGTGT AG
 
Protein sequence
MKEAAVERER DNTLRNIEKK QREEAERAAD EARLAAARAS EPAASSARPE KRKRRWDAKP 
DDSAARSGQA PAPARRPSEW ETMESGTRAA DVKPTPRRSR SRWDETPMIR AGGDPSATPA
WTGGETPVIA AGGETPKITA GMATPSAAQI AAHAAMQSNV PLTPEQYQQM RFQREIEERN
RPQTDEELDE LLPSEGYKIL EPPASYVPIR TPARKLMQTP MPYGSNAGFF SIPEEDRGQK
FDVALVPEGL PEMKPEDVQY FAPLLKETDE EALTIEEQKE RKIMRLLLRV KNGTPQQRKT
SLRQITDRAK EFGAGPLFNQ ILPLLMSPTL EDQERHLLVK VIDRILYKLD DLVRPYVHKI
LVVIEPLLID EDYYARVEGR EIISNVAKAA GLATMIAAMR PDIDNVDEYV RNTTARAFAV
VAQALGVQSL LPFLKAVCQS KKSWQARHTG IKIVQQIAIL HGCAVLPHLK SLVDIIENGL
GDENQKVRTI TALSIAALAE AATPYGIESF DNVLKPLWKG IRAHRGKVLA AFLKAIGFII
PLMDAMYANY YTREVMVILI REFATADEEM KKITLKVVKQ CVATDGVEPE YIRAEVMPEF
FKHFWVRRMA LDRRNYQQLV ETTLEVSLKV GAAEIIGRIV EDLKDESEPY RRMVMETITK
VIEELGTADV DTRMEELLID GMLYAFQEQT SDENDIMLKG VGTIVNALGL RAKPYLPQIC
GTIKWRMNNK SADIREQAAD LISAIAPVMR KCEEEQLLGH LGVVLYEYLG EEYPEVLGSI
LGALKAIVSV QGMTRMTPPI KDLLPRLTPI LKNRHEKVQE NTIDLIGRIA DRGAEYVAAR
EWMRICFELL ELLKAPKKAI RRATVNTFGY IAKAIGPQDV LATLLNNLKV QERQMRVCTT
VAIAIVAETC APFTVLPALM NEYRVPELNV QNGVLKSLAF LFEYIGEMGK DYIYAVTPLL
EDALMDRDLV HRQTAAVTVK HLALGCAGLG CEDAVTHLIN YTWPNVFEPS PHVINAVTEA
IEAARVALGP HFVLAYTLQG LFHPARKVRD IYWKIYNTLY ISSEDALVPA YPALDDDGPN
TYRRVELDCF V