Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_88595 |
Symbol | SDG3527 |
ID | 5004588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 270999 |
End bp | 275084 |
Gene Length | 4086 bp |
Protein Length | 1361 aa |
Translation table | |
GC content | 57% |
IMG OID | 640420009 |
Product | predicted protein |
Protein accession | XP_001420292 |
Protein GI | 145351886 |
COG category | [R] General function prediction only |
COG ID | [COG2940] Proteins containing SET domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.116924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0256987 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCGC CGGGCGCGGA GGGGACGGAG CGCGCGTGCG GCGCGTCCGG ACGCGCGCTG CGCGACCGCG CGACGCTGAA ACACACGAGC AGACAGCAGG AAATCATCTT GCAATCGATT CGAGCCACCG AACAGACGAT CGGCGCGATC AAACGCGCGT CGCTGAGCGA AGAGATCGAA AGAAGGCGCG AGACGGCGCT TGGAATCGTG CGTGATATCG ATGCGGCGTG GAAGGCGATG CCGGCGGTGC CCGAACGCGG CGCGGGGGCG GCGAGGGAGT GCTTTTTGCA GTGCGACGCC GCGGGCTGCG GCAAGTGGCG GCGAGCGCCG CGCGCGATGG GCGACGCGCT CGCGGGCGAC GCGAAGTTTT ACTGCAAAGA CGGCCGAGAC GGGAGATTTA ATCGATGCGA GATGGCGCAG GAGGTCGGCG ACGATGAGGT GGATAGCATT TTGGATGAGA GCGTCGCGGC GGACGCGGAG GCGCGGGAGA GGTGCAAGCG GGACATCTCG AACGAGCGAG GGAGGATGTA TCAGAAACGG AAGAGGTTGT TGGAGTTGGA GCGAAGGCAC GATATTGGAG AGATCGAACT TCAGGAGGAC GGGACGTACG TGACGCTGGC GAAGCGGGGG AAGAATGGAA AGAATGGGAA GCGTCACGGG AATAACGGCG TCGGCGGACT CGGTTCAAAG GCGCCGCCGC ATCCTGGTCC GTTCGATGAG TTCGCGTGGG TGCAGTGCGA AGCGGAGGGG TGTGGGAAGT GGCGTAAGGT GAAGCATGCA GCGGTAGAGG GGATGTCGGT GCGAGATCGT TGGGTGTGCT GCATGAACGA AGACAAGGAG CATTCGACGT GCAAAACGCC TCAGGAGATG AGCGACGAAG ACGTTGACAA GTTTAAAGTC GATCAAGAGA GGCGAATGAT TTTGCACCAG AAGCAATTAC AAGAGCACTA TGCGCACTTC GCACCGCGAG GGGCGAAGTT GCCCGAAGGT TTGATTCGCG TCACGGTGCC GGTGCAGGAC ACAACGCCGC CCGAGGCGGA GCCGCCCGCC GCGGTACCAG TCGCGGCGCC GGCGCCAGGA ACCGCCGAGA CCACTAGAAT ATCGATCGAC GATGGACCGG AAGACTCGCC CATGTGGGTA CCCGGGATAC CGCGTCCGAT GGCGCCGCTT GTGACGCCGA ACACCATTCC TGTGCAAGTG GCTGTTTCGT GCAACGGGCA CGCGGGAGTC TACCTTCCGC GTCGGAACCT GTTTAGATGC CACTGCGAGC AACCAGAACA CATGTGTGCG TCGGTGAGCG GCGATGGTGA CGGTCGTTTA TTCGGGGGAT CAGTTTGGGA GGCGCACTGC GGTAAGGCAT CGACGAGAAA GTATAAAGCG TCCGTCAAGG TTCATATGAA TGGGCCGAAT CCGCAAATGT TCATTGGCCG CTGGTTCGAT CGTGTCGGCT TGAAAGTCGC CGACGCGAAA GGCGGCGGTG GTGGAGGCAA ACCGAAGAAG CCAAAGGTGA AGAGTTTCAT TATGGAGCTC GAAGAGGCGA TCCTTCCTGG ACCGATTAAC CAACTGTCGC TTCGTAATTT GATGACCATT TTCGGTTTCA TTTTGGACGG CAGCGCCGTC GGCGAAGTTA AAGTCAAGAC TGAAGGTGAG GAAAAGGAAA AGCCGAAGGC TTTGACCGCG AATGAGCGAC GTGAAGAAGG TGCAAAAGAA CTCGGGCGAT TCGCATCGGT ATGTAAGGCG TGGAAGCAGG CGTCGTTGAC AGTGTTGAAG ACGGCGGGCG TAGCCAAGGC GGAGAAGAAG AATTTTGACG GGACCGCGCT GCGAACGCAG TTTACCGTGA TTACAGAGTA CGTCGACCGC AAAGATGAGA AGGAGTGCCA AATAGAACGC TCAGTGACGC TCGCGGCTAA TCACCCTTTC GACAAAAAGT CAAAGATGAA GACGGCGCAC GTCGACGACG GAACCCCTCC CGGGTGGTGG CCGTCGCTCG GTGTCAAGGC GGAACGTAAA AAGCTCGTCA CAGAAGTCAT CGAGCAAGAA ACCTACGGTG TTGATTTCGT TACCGGTCGC GACGCAACAG AAACACTCAA GCGCGTGCTG CCCGATTACT CTGAAGACGA TGTATGGGGA CTTTACAAGC AACTTCTCGC GCAAGTGAAC GAAAGCTACG GAGCGATGAC TCCCGATACT TTGGCGACGC AAAACTTGGC GGTTGCCGCT GAAGATCTCG CGGTAAAGCT TGAACGTAAG GCGGACGCTA AGAGTTTAGC GTTTTCAAAG GCTTTATGGA AACTCGCTGC GGCGGCGATC GAGACGCCAG AATATTATTA CGTGCACAGA AAAGGATTTG GCGTCGTGTG CAACCAACCG ATTAAAAAGG GTGAGTTTCT CATCGACTTT CTTGGTGAGA TCTATCCACC CTGGGCGTGG GCCGAGAAGC AAGATGCCAT CCGTCAGGTG CAAAAGGCAC GCGGCCTTCG CGATCGCGGC CCTCCGGAGT TTTACAACAT GCAAATCGAG CGCCCGGGAG GCGACGCAGA GGGTTACTCG GTGCTCTTTT GCGACGCGAT GCACGAGAAC AACTATGCCG GGCGGCTTTC GCACACCTGT GATCCAAACG TCGAAGTCAA CTTGAAGGCC ATTAATGGCA AGTATGAGAT TCATTTCATC ACAACTCGAG ACATTGCACC TGGCGAAGAG TTAGCGTACA ACTATCACAG CTGTACGGAT AACATGAAGG AGGTCGAGAT GGCGTTTTGC TTGTGTGGCG CTCGCATGTG TCGAGGTTCG TATTTGAACT TTGTTGGCGA AGACCACCAC TCGCAAGTTT TGGAGAGCAA GCATAAGCTT ATCGATCGAC AAGTGATGTC GTTCAAGGCG ATCGATAAGG CGGCGGATCC GCTAACTTCG AAACAAGAGC GATCTCTGGC GGCGGTCGGA TTCTATCCTG GGAAAGGTTT ACTCAGAAAC TGCCCGGGAT GGTTGCTTCA GTTTGTGGGT GATGTAGCGG TGTACATGGA TACCGAGCTC AACGAACTTC CCAAGCACAT CCTCGCGGCG GCGAAGAAAG AGCATGCAAA GCTGTTGGAG AAGAACCCGC AAGCTGAGTT TTCTTACACC GAAAAGTTTG CCAAGATTGA TGCGCTTGCG ATGCGGGAAA ACCGAACGCA GTGCGTCGCC ATCATGCTGA GTAAATTGCG TCGACTTTTG ACGCGGGCGC GCGATGATGG TCCGCAGAAA AGCGTGTACG AGTGCATGGA TGTGTTTGAA AAGTCCGCAC CACCGTACGT TACGCTCACC GAAGCCGAGA TCGCCGCGCA TTTTTGGGGG TCTGGTCCGG AGAATTTCGA AAAGTCCATC GTCTGTGGTC TCATTCGCGC CATGGGCCCG CACGAGCGCA AAAACGACGC GGATAAGTTC ATCAAGTGGA CTAGTATGGT GGAAAGTGTC GCCGTGGAAG TCCGCAAAGG CAAGATGACG CGTAAAGAGT CATTACTTTG GCTTCGTGAT GAACTAAAAA AGCTCAAGCA AACCGATGGC GCGCGCCACG ACCTCGCTGC GGGACTTATT CACTTATACG CCGAGACCAA TCGGTTCTGG CAGCCCTCTT CGGCACCAGA GCATCAAGTT TATAAGAGTG ACAAGGTCGC GGTGCGCGAA GACGAAGTCA ACGCGTGGGG CGTTGGTGCG GGTGGCGGTG GCGATAAAAT CGTCGCCAGA GTGGAGAAAA CGTATCGCCC GGGATTTTCT GCGGCGACGA TGCTGCAGTG GCACAAGCAA GAGATGGCCG ACCCGACGCA GTACATCACC GCGAATCGTA GGGGTAACTT GAGCATGCCA GATATCGCGT GCTGTTATTC CAGTCGTCCC GGACAGCCTC TCGCGAGGTC GAGTGATCGC GAGCATGAAA CCTGGCTGGC GCACTTGCAG AGCTGGCCCG AGGAGCCATG GCCGCAATCA AGCGGGCCGT GGGGGGTCGC AAACTCGCAA AAACTGATCG GTTCGCCGGT TTTAGATGCT TGGATGAAGG GCCAGCGATC GATTCCGGCT AAATGCTTGG CGTGGTTGAA AACGAACACG GGCTAA
|
Protein sequence | MPAPGAEGTE RACGASGRAL RDRATLKHTS RQQEIILQSI RATEQTIGAI KRASLSEEIE RRRETALGIV RDIDAAWKAM PAVPERGAGA ARECFLQCDA AGCGKWRRAP RAMGDALAGD AKFYCKDGRD GRFNRCEMAQ EVGDDEVDSI LDESVAADAE ARERCKRDIS NERGRMYQKR KRLLELERRH DIGEIELQED GTYVTLAKRG KNGKNGKRHG NNGVGGLGSK APPHPGPFDE FAWVQCEAEG CGKWRKVKHA AVEGMSVRDR WVCCMNEDKE HSTCKTPQEM SDEDVDKFKV DQERRMILHQ KQLQEHYAHF APRGAKLPEG LIRVTVPVQD TTPPEAEPPA AVPVAAPAPG TAETTRISID DGPEDSPMWV PGIPRPMAPL VTPNTIPVQV AVSCNGHAGV YLPRRNLFRC HCEQPEHMCA SVSGDGDGRL FGGSVWEAHC GKASTRKYKA SVKVHMNGPN PQMFIGRWFD RVGLKVADAK GGGGGGKPKK PKVKSFIMEL EEAILPGPIN QLSLRNLMTI FGFILDGSAV GEVKVKTEGE EKEKPKALTA NERREEGAKE LGRFASVCKA WKQASLTVLK TAGVAKAEKK NFDGTALRTQ FTVITEYVDR KDEKECQIER SVTLAANHPF DKKSKMKTAH VDDGTPPGWW PSLGVKAERK KLVTEVIEQE TYGVDFVTGR DATETLKRVL PDYSEDDVWG LYKQLLAQVN ESYGAMTPDT LATQNLAVAA EDLAVKLERK ADAKSLAFSK ALWKLAAAAI ETPEYYYVHR KGFGVVCNQP IKKGEFLIDF LGEIYPPWAW AEKQDAIRQV QKARGLRDRG PPEFYNMQIE RPGGDAEGYS VLFCDAMHEN NYAGRLSHTC DPNVEVNLKA INGKYEIHFI TTRDIAPGEE LAYNYHSCTD NMKEVEMAFC LCGARMCRGS YLNFVGEDHH SQVLESKHKL IDRQVMSFKA IDKAADPLTS KQERSLAAVG FYPGKGLLRN CPGWLLQFVG DVAVYMDTEL NELPKHILAA AKKEHAKLLE KNPQAEFSYT EKFAKIDALA MRENRTQCVA IMLSKLRRLL TRARDDGPQK SVYECMDVFE KSAPPYVTLT EAEIAAHFWG SGPENFEKSI VCGLIRAMGP HERKNDADKF IKWTSMVESV AVEVRKGKMT RKESLLWLRD ELKKLKQTDG ARHDLAAGLI HLYAETNRFW QPSSAPEHQV YKSDKVAVRE DEVNAWGVGA GGGGDKIVAR VEKTYRPGFS AATMLQWHKQ EMADPTQYIT ANRRGNLSMP DIACCYSSRP GQPLARSSDR EHETWLAHLQ SWPEEPWPQS SGPWGVANSQ KLIGSPVLDA WMKGQRSIPA KCLAWLKTNT G
|
| |