Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32976 |
Symbol | JMJ3501 |
ID | 5003377 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 178438 |
End bp | 182181 |
Gene Length | 3744 bp |
Protein Length | 1194 aa |
Translation table | |
GC content | 61% |
IMG OID | 640418798 |
Product | predicted protein |
Protein accession | XP_001419094 |
Protein GI | 145349340 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01557] myb-like DNA-binding domain, SHAQKYF class |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.878376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.932901 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGCCGCGAC CTCCATGAGC GCGCGCCGCG ACGTCGACGC GCGTCACGCG TCGACGTCCG AAAAGGTGCG CCACGCGATG CCGCGCGAGT CCGCGCCGCG CCGCGCCGCG CCGCGCGCGC GAACCGTCGA CTGACGCGCG ATACCGACGA CGGCAGGATT CCGAAAGAAA ACCGTCGCGA ACGACGACGG CGGACATCGC GAGCGCGCCG ACGTTCCGTC CGACGCTCGA GGAGTTCGCG GACCCGATCG CGTACCTGTC GTCGATCGAG GCGCGCGCGC GCGAAGCGGG GATATGCAAG GTGATACCGC CGCGAGGCGC GGCGCCGAGG TGGAACGGCG AGGCGTGGAG GCGAGACGAC GCGCGATTCG AGACGAAATT GCAGAACGTA CACTCGCTGA GCGAGGGAAG GACGTTTCAG TTCGGGAAGG AGTACGCGAA AGGGGAGTAC GAGGCGATGG CGAAGGCGTA TGAGGAACGG TGGGCGAAGG AACGTCCGGA CGTCGACGCG AACGACGCGA ACGCGCTGGA GCGAGCGTTT TGGGATATGG TGGAGACGCG GAGCGAGCAG GCGCGAGTCG AGTACGGGAA TGATTTAGAT ACCAAGATTT TCGGTACCGG GTTCGGGGTG GACGAAAACG GGGAGAAGCA TCCGTGGGAT TTCGAGCATT TGTACTCGCA TCCGCTTAAT TTATTGCGCG TCGTCGAGCA CGACATTCCG GGACTCACCA AGCCTTGGTT GTATCTTGGC ATGCTTTTCG CCACGTTTTG CTGGCACGTT GAGGATCATT TCTTGTGTTC GCTTAACTAT TTGCATCGCG GGGCGGCGAA GACGTGGTAC GGTGTGCCAG GAAGCGACGC GGAGGCGTTC GAGAATTGCG CTCGGGCGAC GGTGCCGCGC CTATTCGAGC AAGCGCCAGA TATTTTACAT CAAATCGTCA CGATCGTCCC ACCTGGAGTA TTGGTAGATC ATGGCGTCAA GGTCGTGCAC ACGGTGCAAC AGCCTGGGGA GTTTGTCGTG ACGTTCCCTC GTGCTTACCA CGCCGGGTTT TCCCACGGTT TCAACGTCGC CGAAGCGGTG AACTTCGGTC ATGTCAACTG GCTCGATTTC GGCCGTCGAG CCATCGACGT GTACAGCACC GGATCGTTCA AACGCAACGC CGTGTTTGCA CATCATCGCC TCGTTTCGCG CGCCGCCGAA ACCTTCGTCG AAGTTCTGGG TAAGAACGCT CGACTGGTGA AGAGTAAAGC CATGGGCGCC ATCGTATCGA CGCTTCGCAA GGAGCTCGAA ACGATTTTGA GCGATGAAGA AATTTATCGT GCCTCCCTCG TGCGTCGTGG ATTGAACATA GAAATCGTTC AAGCACCTAA CGAGGACGAC GATGCGTGCT GTATTCGCTG CAAAGCGATG CCGTTTCTCT CCGTCGTGCG ATGCAAGTGT CTACCGACGG CGGTGCGATG CCTTCGACAC GCCATGGACG CTTGTGATTG CGCGGCGGGG GAGAGAACCT TAGAGATTCG CGTGGTTGAT TCACGACTTC GCGAGCTCAT TAAAGCACTG TTCTTCGGTG ACGGCATACA AACCAAGAAC GACGCCGCGA AAGCGCGCGT AGATTTCTCG GCCAATGTGA ACAGAGTCGC CGTCAATCGA GCGCCGCCGC CCAAGCCGAA AGTCGTCCTT CCAAAGCCGA AAACCGTAAA ACCGCCGCCG ACGCGAGCGG TACTCGCGTC TCCACCCCCC ACGCGCATCG TCGCATCGAA AGCCGACGAT GCTTTCACCG CGCGCGGTCT TCCGCGCAAG CGAGCCAAGT GCGAAACCCG GCGGCGCTGG ACCGCCGAGA TGGTCGCCGA CTTCGAAGTC GCCGTCGAGC GCCTGGGCGG CGTCGACGCG GCGACGGGCA AAAAGCTCGC CGAAGCGTTA TCCGCGCACG ACGTCACGCG AGACCAATGC GCGAGTCGCT TGCAAAAGCA CCGCGAGAAA ATCAAATCAA ACGCGGACGC GCGCGCAACC TTGTAATGTT ATTCCCTCGC CCCGCGCAGC GCGCGTTACA GCATGGCGTC CAAACGACCC CGCGGCGACG CCACCGACGC GTCCACGCCT CGAGCGCGCG TCTCCGAGGA AGACGCGCGC GCCCCAGTCT CCCTCAAGTC CCTGCTCGAG CGATGGAACC TCGGCGACGT CGTAAACACC GCCGGCGTGT CGCGTCGAAT AAAGTGTCGA CTCGTCCCGG TGTGTCGAAC GAAGGACGAC GAGCGCGCGC GCATCGCTCG AGGCGAGCCG GTGATATGCG CGAGCGCAGA GTACGCGTCC GAAGTGTTCG ACGCGATCGG GACGGCGCGC GACGGCCTCG CGTGGGCGAC GCGGTCGAGT TTCTCAAACT TTGACGCCGA GCGCGGCAAG GTGGCCATGC GAAGCGGTTG GGGCGCGCCG GGGACGCACG TGCTGAGCAA TCGAGACATG ACGATCGTGA CGTCGTTCGC CGAGGTCGTC GACCCGGCGA ATGAAAAGCC GCTGAATTTA CAAATGTTTC ATCGCGAGGA CACCGCGGTG CCGGTGTTGT CGAGGAAATT GAAGTGGCCG AGCGAGGAGG AATTTTTTGG AATCGAAGGC GAAGACGCGC CGGGGAGGCT TTTAGACGAC GCGACGCGCG TGAGCGCGAG AGGGGCGATG ACGTGGTGGC ACTTGGATGA CTGTGGGGAG TTTGTGTGTC AAGTCGGGTT GCCCGAGGCG GGGGAGGCGG CGGAGGACGT GTTGCTCGGG CCGACGGGGA AACCCGTGGT GAAGTTGTTC ATTTTCGCCC AGAGGAAAGA CTACGCGTGG GTGGCGCAAG ACGCAGAGAT GAATAAATCT TACAAAAATT GCGCACTGGA TCTTTTCGAT ACGCCGGATC ATTATTATCC CACGGCGAGC GAGATGTGCC ACCCATCGAG CGCGCCGCTT GACGTTTCGT CGCCAAAGGC GTTCGACGGC GCCGCGACGA GCGACGACGC CGAAGATCCA TGTCCAACGT TTTGGGTCGC TCCGCTCGAG GCTGGAGGGC CACCTTTATT ATCACCTCCC AATATCATAC ACTGTGTGCT CACCGTACGC GACTGCGTGA TGTGTGAAGA GCGCCGGCTT TCGCTGGCGT ACATGGATGA AGTGTTGTAC TTTCAGCGAC GCGCGGCGAG ATGGTGCGAA CCACCCATCT TCTACGCTTT CGTTCGTGAA GATTTGAGCG ACACGGAGAA GGCTAGGTCG AACGCGATGC GGCCACTCGT GAAGATGCTG AATGACTTGA AGCGCGTTGG CGCGACAGAC GGCGACGCGT ATCGCTTTGC GCGGTGTTTA ACGTCGTTGC GAGTTTTGGC GAATCATTCG CCCGAATTCT ACGCACTCGA CGCAGATGGC GTCGCCGAGG CTCGTAAAAG CATCGACAAG CTCGAGTCTT GGTTGGCTGA CGACTCAAAC TGTGAGTTCG TCGAGAAAAT TCAAGCCGCG GTAAAGGCGG ATCCGCGAGC GGTGGAGGAC GCAGAACTCG CAGAGTCTAT GATGAGCGAA ACACTTGGCG TGCTCAATCT CGCCGACGGA CGATCGTGCG CCGTCGTTCA CGAGCGTGGT CGACCTCGCT GGGGCCCGGT GCGCAACTCC AAGTCGCTCG TAGACAAGGA CCGAAAAGAT ATGAAGAATG CGATTCGTTC GGGAACGCTC GACGCGCTTC TCCTCGCGTA TCGCCGCGAC ATCATCTAAT CTAGAAACTA GCGA
|
Protein sequence | MSARRDVDAR HASTSEKDSE RKPSRTTTAD IASAPTFRPT LEEFADPIAY LSSIEARARE AGICKVIPPR GAAPRWNGEA WRRDDARFET KLQNVHSLSE GRTFQFGKEY AKGEYEAMAK AYEERWAKER PDVDANDANA LERAFWDMVE TRSEQARVEY GNDLDTKIFG TGFGVDENGE KHPWDFEHLY SHPLNLLRVV EHDIPGLTKP WLYLGMLFAT FCWHVEDHFL CSLNYLHRGA AKTWYGVPGS DAEAFENCAR ATVPRLFEQA PDILHQIVTI VPPGVLVDHG VKVVHTVQQP GEFVVTFPRA YHAGFSHGFN VAEAVNFGHV NWLDFGRRAI DVYSTGSFKR NAVFAHHRLV SRAAETFVEV LGKNARLVKS KAMGAIVSTL RKELETILSD EEIYRASLVR RGLNIEIVQA PNEDDDACCI RCKAMPFLSV VRCKCLPTAV RCLRHAMDAC DCAAGERTLE IRVVDSRLRE LIKALFFGDG IQTKNDAAKA RVDFSANVNR VAVNRAPPPK PKVVLPKPKT VKPPPTRAVL ASPPPTRIVA SKADDAFTAR GLPRKRAKCE TRRRWTAEMV ADFEVAVERL GGVDAATGKK LAEALSAHDV TRDQCASRLQ KHREKIKSNA DARATFMASK RPRGDATDAS TPRARVSEED ARAPVSLKSL LERWNLGDVV NTAGVSRRIK CRLVPVCRTK DDERARIARG EPVICASAEY ASEVFDAIGT ARDGLAWATR SSFSNFDAER GKVAMRSGWG APGTHVLSNR DMTIVTSFAE VVDPANEKPL NLQMFHREDT AVPVLSRKLK WPSEEEFFGI EGEDAPGRLL DDATRVSARG AMTWWHLDDC GEFVCQVGLP EAGEAAEDVL LGPTGKPVVK LFIFAQRKDY AWVAQDAEMN KSYKNCALDL FDTPDHYYPT ASEMCHPSSA PLDVSSPKAF DGAATSDDAE DPCPTFWVAP LEAGGPPLLS PPNIIHCVLT VRDCVMCEER RLSLAYMDEV LYFQRRAARW CEPPIFYAFV REDLSDTEKA RSNAMRPLVK MLNDLKRVGA TDGDAYRFAR CLTSLRVLAN HSPEFYALDA DGVAEARKSI DKLESWLADD SNCEFVEKIQ AAVKADPRAV EDAELAESMM SETLGVLNLA DGRSCAVVHE RGRPRWGPVR NSKSLVDKDR KDMKNAIRSG TLDALLLAYR RDII
|
| |