Gene OSTLU_88595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_88595 
SymbolSDG3527 
ID5004588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp270999 
End bp275084 
Gene Length4086 bp 
Protein Length1361 aa 
Translation table 
GC content57% 
IMG OID640420009 
Productpredicted protein 
Protein accessionXP_001420292 
Protein GI145351886 
COG category[R] General function prediction only 
COG ID[COG2940] Proteins containing SET domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.116924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0256987 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCGC CGGGCGCGGA GGGGACGGAG CGCGCGTGCG GCGCGTCCGG ACGCGCGCTG 
CGCGACCGCG CGACGCTGAA ACACACGAGC AGACAGCAGG AAATCATCTT GCAATCGATT
CGAGCCACCG AACAGACGAT CGGCGCGATC AAACGCGCGT CGCTGAGCGA AGAGATCGAA
AGAAGGCGCG AGACGGCGCT TGGAATCGTG CGTGATATCG ATGCGGCGTG GAAGGCGATG
CCGGCGGTGC CCGAACGCGG CGCGGGGGCG GCGAGGGAGT GCTTTTTGCA GTGCGACGCC
GCGGGCTGCG GCAAGTGGCG GCGAGCGCCG CGCGCGATGG GCGACGCGCT CGCGGGCGAC
GCGAAGTTTT ACTGCAAAGA CGGCCGAGAC GGGAGATTTA ATCGATGCGA GATGGCGCAG
GAGGTCGGCG ACGATGAGGT GGATAGCATT TTGGATGAGA GCGTCGCGGC GGACGCGGAG
GCGCGGGAGA GGTGCAAGCG GGACATCTCG AACGAGCGAG GGAGGATGTA TCAGAAACGG
AAGAGGTTGT TGGAGTTGGA GCGAAGGCAC GATATTGGAG AGATCGAACT TCAGGAGGAC
GGGACGTACG TGACGCTGGC GAAGCGGGGG AAGAATGGAA AGAATGGGAA GCGTCACGGG
AATAACGGCG TCGGCGGACT CGGTTCAAAG GCGCCGCCGC ATCCTGGTCC GTTCGATGAG
TTCGCGTGGG TGCAGTGCGA AGCGGAGGGG TGTGGGAAGT GGCGTAAGGT GAAGCATGCA
GCGGTAGAGG GGATGTCGGT GCGAGATCGT TGGGTGTGCT GCATGAACGA AGACAAGGAG
CATTCGACGT GCAAAACGCC TCAGGAGATG AGCGACGAAG ACGTTGACAA GTTTAAAGTC
GATCAAGAGA GGCGAATGAT TTTGCACCAG AAGCAATTAC AAGAGCACTA TGCGCACTTC
GCACCGCGAG GGGCGAAGTT GCCCGAAGGT TTGATTCGCG TCACGGTGCC GGTGCAGGAC
ACAACGCCGC CCGAGGCGGA GCCGCCCGCC GCGGTACCAG TCGCGGCGCC GGCGCCAGGA
ACCGCCGAGA CCACTAGAAT ATCGATCGAC GATGGACCGG AAGACTCGCC CATGTGGGTA
CCCGGGATAC CGCGTCCGAT GGCGCCGCTT GTGACGCCGA ACACCATTCC TGTGCAAGTG
GCTGTTTCGT GCAACGGGCA CGCGGGAGTC TACCTTCCGC GTCGGAACCT GTTTAGATGC
CACTGCGAGC AACCAGAACA CATGTGTGCG TCGGTGAGCG GCGATGGTGA CGGTCGTTTA
TTCGGGGGAT CAGTTTGGGA GGCGCACTGC GGTAAGGCAT CGACGAGAAA GTATAAAGCG
TCCGTCAAGG TTCATATGAA TGGGCCGAAT CCGCAAATGT TCATTGGCCG CTGGTTCGAT
CGTGTCGGCT TGAAAGTCGC CGACGCGAAA GGCGGCGGTG GTGGAGGCAA ACCGAAGAAG
CCAAAGGTGA AGAGTTTCAT TATGGAGCTC GAAGAGGCGA TCCTTCCTGG ACCGATTAAC
CAACTGTCGC TTCGTAATTT GATGACCATT TTCGGTTTCA TTTTGGACGG CAGCGCCGTC
GGCGAAGTTA AAGTCAAGAC TGAAGGTGAG GAAAAGGAAA AGCCGAAGGC TTTGACCGCG
AATGAGCGAC GTGAAGAAGG TGCAAAAGAA CTCGGGCGAT TCGCATCGGT ATGTAAGGCG
TGGAAGCAGG CGTCGTTGAC AGTGTTGAAG ACGGCGGGCG TAGCCAAGGC GGAGAAGAAG
AATTTTGACG GGACCGCGCT GCGAACGCAG TTTACCGTGA TTACAGAGTA CGTCGACCGC
AAAGATGAGA AGGAGTGCCA AATAGAACGC TCAGTGACGC TCGCGGCTAA TCACCCTTTC
GACAAAAAGT CAAAGATGAA GACGGCGCAC GTCGACGACG GAACCCCTCC CGGGTGGTGG
CCGTCGCTCG GTGTCAAGGC GGAACGTAAA AAGCTCGTCA CAGAAGTCAT CGAGCAAGAA
ACCTACGGTG TTGATTTCGT TACCGGTCGC GACGCAACAG AAACACTCAA GCGCGTGCTG
CCCGATTACT CTGAAGACGA TGTATGGGGA CTTTACAAGC AACTTCTCGC GCAAGTGAAC
GAAAGCTACG GAGCGATGAC TCCCGATACT TTGGCGACGC AAAACTTGGC GGTTGCCGCT
GAAGATCTCG CGGTAAAGCT TGAACGTAAG GCGGACGCTA AGAGTTTAGC GTTTTCAAAG
GCTTTATGGA AACTCGCTGC GGCGGCGATC GAGACGCCAG AATATTATTA CGTGCACAGA
AAAGGATTTG GCGTCGTGTG CAACCAACCG ATTAAAAAGG GTGAGTTTCT CATCGACTTT
CTTGGTGAGA TCTATCCACC CTGGGCGTGG GCCGAGAAGC AAGATGCCAT CCGTCAGGTG
CAAAAGGCAC GCGGCCTTCG CGATCGCGGC CCTCCGGAGT TTTACAACAT GCAAATCGAG
CGCCCGGGAG GCGACGCAGA GGGTTACTCG GTGCTCTTTT GCGACGCGAT GCACGAGAAC
AACTATGCCG GGCGGCTTTC GCACACCTGT GATCCAAACG TCGAAGTCAA CTTGAAGGCC
ATTAATGGCA AGTATGAGAT TCATTTCATC ACAACTCGAG ACATTGCACC TGGCGAAGAG
TTAGCGTACA ACTATCACAG CTGTACGGAT AACATGAAGG AGGTCGAGAT GGCGTTTTGC
TTGTGTGGCG CTCGCATGTG TCGAGGTTCG TATTTGAACT TTGTTGGCGA AGACCACCAC
TCGCAAGTTT TGGAGAGCAA GCATAAGCTT ATCGATCGAC AAGTGATGTC GTTCAAGGCG
ATCGATAAGG CGGCGGATCC GCTAACTTCG AAACAAGAGC GATCTCTGGC GGCGGTCGGA
TTCTATCCTG GGAAAGGTTT ACTCAGAAAC TGCCCGGGAT GGTTGCTTCA GTTTGTGGGT
GATGTAGCGG TGTACATGGA TACCGAGCTC AACGAACTTC CCAAGCACAT CCTCGCGGCG
GCGAAGAAAG AGCATGCAAA GCTGTTGGAG AAGAACCCGC AAGCTGAGTT TTCTTACACC
GAAAAGTTTG CCAAGATTGA TGCGCTTGCG ATGCGGGAAA ACCGAACGCA GTGCGTCGCC
ATCATGCTGA GTAAATTGCG TCGACTTTTG ACGCGGGCGC GCGATGATGG TCCGCAGAAA
AGCGTGTACG AGTGCATGGA TGTGTTTGAA AAGTCCGCAC CACCGTACGT TACGCTCACC
GAAGCCGAGA TCGCCGCGCA TTTTTGGGGG TCTGGTCCGG AGAATTTCGA AAAGTCCATC
GTCTGTGGTC TCATTCGCGC CATGGGCCCG CACGAGCGCA AAAACGACGC GGATAAGTTC
ATCAAGTGGA CTAGTATGGT GGAAAGTGTC GCCGTGGAAG TCCGCAAAGG CAAGATGACG
CGTAAAGAGT CATTACTTTG GCTTCGTGAT GAACTAAAAA AGCTCAAGCA AACCGATGGC
GCGCGCCACG ACCTCGCTGC GGGACTTATT CACTTATACG CCGAGACCAA TCGGTTCTGG
CAGCCCTCTT CGGCACCAGA GCATCAAGTT TATAAGAGTG ACAAGGTCGC GGTGCGCGAA
GACGAAGTCA ACGCGTGGGG CGTTGGTGCG GGTGGCGGTG GCGATAAAAT CGTCGCCAGA
GTGGAGAAAA CGTATCGCCC GGGATTTTCT GCGGCGACGA TGCTGCAGTG GCACAAGCAA
GAGATGGCCG ACCCGACGCA GTACATCACC GCGAATCGTA GGGGTAACTT GAGCATGCCA
GATATCGCGT GCTGTTATTC CAGTCGTCCC GGACAGCCTC TCGCGAGGTC GAGTGATCGC
GAGCATGAAA CCTGGCTGGC GCACTTGCAG AGCTGGCCCG AGGAGCCATG GCCGCAATCA
AGCGGGCCGT GGGGGGTCGC AAACTCGCAA AAACTGATCG GTTCGCCGGT TTTAGATGCT
TGGATGAAGG GCCAGCGATC GATTCCGGCT AAATGCTTGG CGTGGTTGAA AACGAACACG
GGCTAA
 
Protein sequence
MPAPGAEGTE RACGASGRAL RDRATLKHTS RQQEIILQSI RATEQTIGAI KRASLSEEIE 
RRRETALGIV RDIDAAWKAM PAVPERGAGA ARECFLQCDA AGCGKWRRAP RAMGDALAGD
AKFYCKDGRD GRFNRCEMAQ EVGDDEVDSI LDESVAADAE ARERCKRDIS NERGRMYQKR
KRLLELERRH DIGEIELQED GTYVTLAKRG KNGKNGKRHG NNGVGGLGSK APPHPGPFDE
FAWVQCEAEG CGKWRKVKHA AVEGMSVRDR WVCCMNEDKE HSTCKTPQEM SDEDVDKFKV
DQERRMILHQ KQLQEHYAHF APRGAKLPEG LIRVTVPVQD TTPPEAEPPA AVPVAAPAPG
TAETTRISID DGPEDSPMWV PGIPRPMAPL VTPNTIPVQV AVSCNGHAGV YLPRRNLFRC
HCEQPEHMCA SVSGDGDGRL FGGSVWEAHC GKASTRKYKA SVKVHMNGPN PQMFIGRWFD
RVGLKVADAK GGGGGGKPKK PKVKSFIMEL EEAILPGPIN QLSLRNLMTI FGFILDGSAV
GEVKVKTEGE EKEKPKALTA NERREEGAKE LGRFASVCKA WKQASLTVLK TAGVAKAEKK
NFDGTALRTQ FTVITEYVDR KDEKECQIER SVTLAANHPF DKKSKMKTAH VDDGTPPGWW
PSLGVKAERK KLVTEVIEQE TYGVDFVTGR DATETLKRVL PDYSEDDVWG LYKQLLAQVN
ESYGAMTPDT LATQNLAVAA EDLAVKLERK ADAKSLAFSK ALWKLAAAAI ETPEYYYVHR
KGFGVVCNQP IKKGEFLIDF LGEIYPPWAW AEKQDAIRQV QKARGLRDRG PPEFYNMQIE
RPGGDAEGYS VLFCDAMHEN NYAGRLSHTC DPNVEVNLKA INGKYEIHFI TTRDIAPGEE
LAYNYHSCTD NMKEVEMAFC LCGARMCRGS YLNFVGEDHH SQVLESKHKL IDRQVMSFKA
IDKAADPLTS KQERSLAAVG FYPGKGLLRN CPGWLLQFVG DVAVYMDTEL NELPKHILAA
AKKEHAKLLE KNPQAEFSYT EKFAKIDALA MRENRTQCVA IMLSKLRRLL TRARDDGPQK
SVYECMDVFE KSAPPYVTLT EAEIAAHFWG SGPENFEKSI VCGLIRAMGP HERKNDADKF
IKWTSMVESV AVEVRKGKMT RKESLLWLRD ELKKLKQTDG ARHDLAAGLI HLYAETNRFW
QPSSAPEHQV YKSDKVAVRE DEVNAWGVGA GGGGDKIVAR VEKTYRPGFS AATMLQWHKQ
EMADPTQYIT ANRRGNLSMP DIACCYSSRP GQPLARSSDR EHETWLAHLQ SWPEEPWPQS
SGPWGVANSQ KLIGSPVLDA WMKGQRSIPA KCLAWLKTNT G