Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19017 |
Symbol | SDG3503 |
ID | 5006750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 247408 |
End bp | 250350 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | |
GC content | 58% |
IMG OID | 640422171 |
Product | predicted protein |
Protein accession | XP_001422533 |
Protein GI | 145356635 |
COG category | [R] General function prediction only |
COG ID | [COG2940] Proteins containing SET domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.238798 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGC GCGCCGCGCG CGCGCCGCGC GCGGCGTCGC CGCGCGGCGC GGACGCGACG CGACGGCGCG ACGGCGCGAT CGACGCCGAC GACGCCGACG CGACGGCGAC CGCGCGCGCG CGCGCGAGCG CGAACGACGC GCGCGCGCGA TCGGCGGCGG TGCTGGACGT CGTCGATCTC ACGCTCGACG TCGAGGCGCC GCGCGCGGCG ACGGCGACGG CGACGGCGAC GGCGGACGCG CGAGGCGACG CGACGCGCGG CGACGACGCC GAGGCGGTGA AGATCCTCGA CGACGGTTTG CCGACGTCGA AGCCGAAGGC GAGGGATGGA AAGCCGACGG CGCGATCGGG TGAGAATGGA AAGTCGCGCG CGTCGGGGTC GGGGCGGATC GAACGCGTGG CGGAGGTGCT GCAGTCGCCC GAGAAGGCGT GGCCGCGAGA GGAGGCGGCG GCGGAGGACG ACGACGACGC GAAGGTTTCG GTGAGTTTTG AACTCGAGTT CTGCGCGGCG ATGCGAGAGG GGTTGGAGGG AAGACGGACG GAGGCGATGG AGGCGATGGA GACGCTCGAA GCGACGGTGG GATCGAGGGT GTTTCAGAAC GCGGACGCGG CGGCGGCGGA GGCGCGGACG AGGACGAGGA GCGCGGCGAA GCGCGCGCGT AGGATTTTTC GGGAGGAAAC TAAGCTTTGG GCGGCGGCGG TGGTGAAAGC GCAGCAACTG ACGATAGACG ATGACGTGCC GTCGACGTCG GAACCCGTCG CGATGGAGTT GCCGTTTTGG CAGCGCATCT CGAGCGAGCA CCTGGGCGCG TCGAGCGCGA GGAATACGAA GATTTTCACG CAGCGCACCG AACGGCCGTA TTCGAGTGAT CCGACCGATG TAAGCGGCTG GAAGCTGCAA ACAATAGTAG GGCGACAATC TGTACCTCGA ATAGACATCG GTAGCGCCAA GGCGGTGCCG CCCTACGCTT ACTTTGCGTA CTCGACGCAC TGCAACTCGT ACGAGGCGGA AGGAAACGTC TCGCGCCTAC TCTTCAGGGA CGACGATGGA GAGTTCTTAG AATCAGATCC CGTCGATCGG CGCGAAGACG AGAGTAATGA GCTCACTCGC GAGCAGGAAA TCATCATGTG CGCGATATGC GCGGAGTTTA GCGAGTTCAT TTTGACAGAA AAAGAAGTCG TGCGAGGGGT GAATCGCGAA GACGGCGTGA AAGCCGTCGT GGTACAAACA GCCGAATATT TGAACTTGGA CGAAAACCAA GTCAAGGATT GGTTTGATGA GACGCGCACG AAGCACAGCA CGAGCCGTGC ATGGTGTATG TTTCTCGAAG TTGCCTCGCA CGTACGCAAG TTGATGGGCT TCTCCTCGGC GCATTGGCGC GCGAAGATGG CAAACACATT CAGCGTTCTC GAAACTCTCG GGATCAGCGA ATTGTTCTGG CGGAAGTTTT CGCGAATCAT CATCAATTGT CCGACGCTCG CTCCGTTGAA AAAGCCTGTC ATCGTGTTTG ACAATCTCAA CGAAGCCATG GATCAGCTCG CCGGCATGTT TTGTCCGCGA TGCTTCATCT TTGATTGCAG AACGCACGGG TCGTTGCAGC CCAAGTCGGA AGGCAGGAAG CTTGATGCGG AAAGAAAGCT TGCATGGCGC GAGCGCATGG CAAAGAGCGG CATGTCTGCG GAAAAGCCGT TAGCTGAGCG ACGATGTTCG ACAGATTGCT GGTATCAAAC AGAAGAGTAC AAGTACTACT CTGCGCAGAC AACCTGCGCA CCATGCGATC CCACAGAAAC TCTCAATCGT CCGTCGACGA AAGATCCGTT CATCGAGACG ACGAGGAAAT GGCGCAACGC GATGGATATT GAAGTCTTGA AGAAGGCTGT CAAAATAATC GGTGAGAAAA CCACGGCGTG CGAGGCAGCG TTGTTCTTTG GCCGTCGTCG CACGTGCGCC GAAGTCGGGA AGCAAATGCA CTGTTTGGAT CTCATCAACC TTGGAACTGT GGTGAAGGAA GAAGAGCGCG ACGCGATGGA TGAAGATACC GACGAATTGA GTAATCCGAA GAAGCGCAAA CGCGCGCCGA CGGGGGTCAA AAATCCAACA ATTGCGCGAC GATTGAAGAT GCAAAAGGAT GCCGATTTTC TGGAAACGCA ATACTCCCCG TGCGAATGCG TCGGCGCTTG TGACGCTAAC ACGTGCTCCT GCATTAAGAA TGGTACCTTT TGTGAGAGAT TTTGCAACTG TGGACCGAAG TGTCACAACG AGTTCGAGGG TTGCAAGTGC GACAGTACGA AGCGCGCAAC GTGCGGCACA AGAACGTGTC CGTGCTACGC CGCCGGTCGC GAATGCACGC CAGATAAATG TAAACGGTGT TGCAAGACCG CTGATGCGTA CTCTTTGCCC GCTCGTAAAA GGTATGGCCT CGTCGATCCG AACATGCAAC TGCCCATGCC GGCGTTTCCG TGTGAGAACA TGAAGCTACA ACTTCGACAG AAGGAGCACA TTTGTTTGGG TCGAAGCGGT GTTGCCGGTT GGGGTGCGTT CGTGTTGAAA GGCGCTCGGA AAGGAGAGTT CATCGGCGAA TACGTCGGCG AACTCGTGAC TCAGGACGAA GCCGAACGTC GAGGAACGGT GTACGATGTC AACAACTGCT CGTACTTGTT CAATCTCAAC AGCGAATGGT GCGTCGACGC TCAATACAGA GGGAACAAAC TGCGCTTTGC CAATCACTCG AAGAACCCGA ATTGCGTGCC TCGCGTTCTC GCGGTGAATG GTGATCATCG ACTGGCGCTG ATATCAGACA AAGACATCAA ACCAGGCGAT GAATTACTGT TCGACTACAA TTACAAGGAC GAAGTCGCAC CCGACTGGCA CGAGAAAAAC GCATCGACGT TGCCCAAGTC GAAGCACCTT CCAACGAAAA GCGCGAAGAA ATCATCGAAC TGA
|
Protein sequence | MTTRAARAPR AASPRGADAT RRRDGAIDAD DADATATARA RASANDARAR SAAVLDVVDL TLDVEAPRAA TATATATADA RGDATRGDDA EAVKILDDGL PTSKPKARDG KPTARSGENG KSRASGSGRI ERVAEVLQSP EKAWPREEAA AEDDDDAKVS VSFELEFCAA MREGLEGRRT EAMEAMETLE ATVGSRVFQN ADAAAAEART RTRSAAKRAR RIFREETKLW AAAVVKAQQL TIDDDVPSTS EPVAMELPFW QRISSEHLGA SSARNTKIFT QRTERPYSSD PTDVSGWKLQ TIVGRQSVPR IDIGSAKAVP PYAYFAYSTH CNSYEAEGNV SRLLFRDDDG EFLESDPVDR REDESNELTR EQEIIMCAIC AEFSEFILTE KEVVRGVNRE DGVKAVVVQT AEYLNLDENQ VKDWFDETRT KHSTSRAWCM FLEVASHVRK LMGFSSAHWR AKMANTFSVL ETLGISELFW RKFSRIIINC PTLAPLKKPV IVFDNLNEAM DQLAGMFCPR CFIFDCRTHG SLQPKSEGRK LDAERKLAWR ERMAKSGMSA EKPLAERRCS TDCWYQTEEY KYYSAQTTCA PCDPTETLNR PSTKDPFIET TRKWRNAMDI EVLKKAVKII GEKTTACEAA LFFGRRRTCA EVGKQMHCLD LINLGTVVKE EERDAMDEDT DELSNPKKRK RAPTGVKNPT IARRLKMQKD ADFLETQYSP CECVGACDAN TCSCIKNGTF CERFCNCGPK CHNEFEGCKC DSTKRATCGT RTCPCYAAGR ECTPDKCKRC CKTADAYSLP ARKRYGLVDP NMQLPMPAFP CENMKLQLRQ KEHICLGRSG VAGWGAFVLK GARKGEFIGE YVGELVTQDE AERRGTVYDV NNCSYLFNLN SEWCVDAQYR GNKLRFANHS KNPNCVPRVL AVNGDHRLAL ISDKDIKPGD ELLFDYNYKD EVAPDWHEKN ASTLPKSKHL PTKSAKKSSN
|
| |