Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_28551 |
Symbol | NURF-140 |
ID | 7201969 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 774580 |
End bp | 778431 |
Gene Length | 3852 bp |
Protein Length | 1023 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | imitation switch isoform 1, alias nucleosome remodeling factor 140 kDa subunit |
Protein accession | XP_002181260 |
Protein GI | 219121827 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0148713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATGAGGAGA TCGATTCCGA CGTTGAAATG GAGGGCATGG ATCTGGAGGA CGACGACGGC CAAGTCGACG AAGCGGGAGA CGACGAGGAA GCGACCGAAT CGGACAATGC ACCCGTTGAA AACGAAGAAG ACTTTGCGGC TTTGGCGACG GAAGAAGCTC AGGAGATGGA AGAAGCGCGT CGGGAACGTA CGGAACTCAT GGCAGCCGAG CAGAAAAAAG CCATGGGCAG CAATCCACAG CCCTTGACAG CGGCGGAGCG ATTGGAATAC ATTCTCGCAC AATCGGACGT CTTTGCGCAT TTTCTGGCCG GTACGTGAGA GGACAGTGGG AGGAAACGGG GAGGCCAGTC ACCCGCAACG ACACCGTACT CGCGTGTGCG TGTGTGTGTG TGTGTGTACT AACCATCCCA CACTACTCCT GACGAGTGTA GATTCGCTCA CTAAAAAAAA TCATTGTTCT TTCTTTTACC AACGACAGGA TCCGTCGCAG CGGGCAGCAA AAAAGGCAAA GGTTCGCGAG GCAAAAAGGG ACGCATGACC GAGGCCGAAG AGGACGCCCA GCTTCTCAAG TCGGCGCAGT CCAAGCGTCG GGTGATCCGG GTCGACCAGC AGCCCTCCAA CCTAGCCCCG CACTGTCGTA TGCACCCGTA TCAGCTAGAG GGACTCAACT GGCTCATTAA ATTGCACGAT CACGGGATCA ACGGAATACT GGCGGACGAG GTACGTCAAT CGTCGAAACT TCCAGGCGTC GGTCGTTCCC GTCGAAGCCT ACTACTCACT CCGATATCTC CCTTTTCGAG TATAGATGGG TCTTGGCAAA ACGCTCCAGA CTATTTCGCT TTTGGCCTAC CTTCGGGAAA GCCGCGGAGT GCGGGGGGCA CATATGGTCA TTGTCCCCAA ATCGGTCGTT GGCAATTGGA TTCGGGAGTT CAAAAAGTGG TGCCCCTCTA TCAAGGCCAT TCGGATGGGA GGAACCAAGG ACGAACGACA GAAATTTGTC ACGGAAGACT TGCCTTTGGA TCCCAACACT GGTAAACGAA AGTTTGATGT CCTCGTCACC TCGTACGAAG GTTTGCTCCG GGAAAAGGGC AAACTCTCCA GGATTCCGTG GAAATACGTC ATGTAAGTAC GACCGCAGCG CGGCACCCAC GGTGCTAATA GGGCGGTCCA AAACTCTTTA TTTCACGTTG TGGTACGCTC ACGCAGTCCT GTCTTCTTTC CTCACAGCAT TGACGAGGCG CACCGTATCA AGAATGAGAA TTCGTCGCTT AGCAAGGTCG TCCGGACCAT GAAGACCGAA TTTCGCTTGC TCATTACGGG AACTCCTTTG CAGGTACGTT CGTGTGTTTG AGTGTGTGTG TGTGTGTGTT TGCGTGTAAA AGTCGAAGCG TCACCACGCG TGCGCTTATA GTGACGAACA GGAGGACCGT CACTAACATT TGTACCCTAT TTGATTCACA GAATAATCTG CGTGAGCTAT GGGCGCTTCT CAACTTTTTA ATGCCCGATA TCTTTGGAGA CGCTGAGCAA TTCGACGAAT GGTTCAGTTT AACGGACGCG TCGGGCAAGG AAAACGTAAT CAAGAAGCTC CATACCATTT TGCGACCGTT CATGCTTCGG CGCGTCAAAA AGGACGTCGC CACTTCGCTC CCGCCCAAGA AGGAAACGAA ACTCTACATC GGGTTGACAA AAATGCAACA GGAATGGTAC GTTCGTTGTC TCCAGAAAGA TGCGCACGAA CTAAACAAAC TTGGTGGTCC GGATCGCAAT CGCCTTTTGA ACGTTCTGAT GCAGTTGCGA AAGGTCTGCA ATCATCCGTA CCTATTTGAC GGAGCCGAAC AGGGCCCACC CTATATTGAC GGGCCTCATC TCTGGGAAAA TAGTGGCAAG ATGCAACTTA TGCACAAGCT ATTGCCCAAG CTCCAAGCGA AAGGATCTCG CGTTCTCATC TTTTGTCAAA TGACCCGCGT TCTTGATATA CTCGAGGACT ACTTTCGATT GACAAAATTG GAATACTGTC GCATCGACGG TAACACGGAC GGTGAGCGTC GGGATTCTCA AATGGACGAG TTCAACGCCG AAGGCTCAAG CAAATTTGCC TTTCTTTTGA GTACTCGTGC GGGGGGTCTC GGTATCAATT TGGCGACCGC CGACATTGTT ATTCTCTACG ATTCCGATTG GAACCCCCAA GTGGATCTCC AAGCAATGGA TCGTGCCCAC CGCATAGGAC AAACCAAACC GGTACAGGTC TTCCGGTTCG TGACGGAGGG TACCGTGGAA GAAAAAATCA TTGAGCGCGC CGACCGCAAG CTATTCCTCG ATGCCGCAGT TATTCAGCAA GGGAGATTGG CCGAGCAGCA TTCGTCGTTG GAAAAGGGCG ATCTCATGAA GATGGTACGG TTCGGTGCCG ATCAAATTCT CAGTGGAACA GGGGGGACCT ACACTGACGA AGATATTGAC GCTTTGATTG CTCGTGGAGA AGAGCGTACC ACCGAAATGC AAGCCAAATT GCAGACTGAC GCCCAACACA ATTTGGCCAA CTTCAGCTTG ATGGCCGAGG ACGAGGGTGG GACCGACACA TTCTCTTTTG GAGGAAAGAA TTATCGCGAT TCGGAAAAGT CGGCCGGAAA TTTTATCAAC TTGCCACAGC GACAAAGGAA GCGCAACTAC GATCTGAGCG GCTCTACGGG CAATACTGGC TTGTCAGGCA GCATGAAAGC GCACGCCGCT GATGCGGCAG CCAAGAAGAA ACGCAAGGGA CCAGCGTTAC ATGACTACCA ATTGTTTAAC ATGTCACGAC TTCATGCATT GATCGAAAAA GAGCGCGCTC TGGCGTCTGC GAAAGAGCAG GAGGTAAAGC TTATTGGCGA GTTGCGTCAA CGTGCGATTG AAGCTCCTCC GTTGGGAAGT GGTCACGCTC CGGGGTACAG TCGCGAAGAG CTTTTGCAAC TGGCTCATGC GAAAGAGCAA GGTCTTGATA CAAGCTTTGT TTTGTCCTCG AAAGAAGAAG CCGAACGAAG GAGCCTCTTG GCGGAAGGAT TTCCGGATTG GAGCCGAAAG GACTTTCGCT CCTTCTGTTT GAGCTTGGAA CGCCACGGAC GGTACGATTT CAATAGTATT GCGCAGGACG TTTTGCGTGA AACGGATAAA GATTTGCAGG AAGTCCAACG CTATTTTGTG GCCTTTTTTA CAAACTACAC GCGAATCAAT GACTGGCAAA AGATTCTTGA CAAGATTGAA CGCGGGGAAA AGAAGATTCT GCGACTGCGA CAAATCCGTG ACGCTATTCA GGAGAAGGTG GAGCGCCATT TGGAAGACGC GTTTGGTCCA CACTACGGAG AGAACGCTAA AAACGGAGAC GAATCCCAAA AGTTGCCGCC TGTATCTGAA TTGCTGCACT ACTCGTGGCC CCAGATGAGG ATAAGCTACG GTTCTAGTGG CCGAGGTCGG GGATATCAGG AAGAGGAAGA TGCCTTTCTT GTTTGCATGA TGTACCGGCA TGGCTATGGA GCGGCAGAAC GCATTCGTAT GGAAATCCGC CGCGCTTGGC AATTTCGATT CGACTGGTAC TTTAAATCTC GTTCGGCGCA AGAAATCCAG AAGCGATGTG ACATGTTGGT GCGCGTCGTG GAACGCGACA ACGCTGAAGT GCGAGAAAAA GAAGCAGAAG AAGAACGTAA GCAAGAACGG GACAGCTTTA CCGTCCCCAC TCCACAATCG GCAAAGGATA GTTTCCAAAA GGTTCCTGAA GGAGCGGTTG CCCAGTTTGC GTCTAACTAA GCAAGTCTGG CGAGATCTCG AACCGACAAC AGAACGGTAA ATGACCAAAA GTTATCGAAT GAATGTATAC GT
|
Protein sequence | MEGMDLEDDD GQVDEAGDDE EATESDNAPV ENEEDFAALA TEEAQEMEEA RRERTELMAA EQKKAMGSNP QPLTAAERLE YILAQSDVFA HFLAGSVAAG SKKGKGSRGK KGRMTEAEED AQLLKSAQSK RRVIRVDQQP SNLAPHCRMH PYQLEGLNWL IKLHDHGING ILADEMGLGK TLQTISLLAY LRESRGVRGA HMVIVPKSVV GNWIREFKKW CPSIKAIRMG GTKDERQKFV TEDLPLDPNT GKRKFDVLVT SYEGLLREKG KLSRIPWKYV IIDEAHRIKN ENSSLSKVVR TMKTEFRLLI TGTPLQNNLR ELWALLNFLM PDIFGDAEQF DEWFSLTDAS GKENVIKKLH TILRPFMLRR VKKDVATSLP PKKETKLYIG LTKMQQEWYV RCLQKDAHEL NKLGGPDRNR LLNVLMQLRK VCNHPYLFDG AEQGPPYIDG PHLWENSGKM QLMHKLLPKL QAKGSRVLIF CQMTRVLDIL EDYFRLTKLE YCRIDGNTDG ERRDSQMDEF NAEGSSKFAF LLSTRAGGLG INLATADIVI LYDSDWNPQV DLQAMDRAHR IGQTKPVQVF RFVTEGTVEE KIIERADRKL FLDAAVIQQG RLAEQHSSLE KGDLMKMVRF GADQILSGTG GTYTDEDIDA LIARGEERTT EMQAKLQTDA QHNLANFSLM AEDEGGTDTF SFGGKNYRDS EKSAGNFINL PQRQRKRNYD LSGSTGNTGL SGSMKAHAAD AAAKKKRKGP ALHDYQLFNM SRLHALIEKE RALAREELLQ LAHAKEQGLD TSFVLSSKEE AERRSLLAEG FPDWSRKDFR SFCLSLERHG RYDFNSIAQD VLRETDKDLQ EVQRYFVAFF TNYTRINDWQ KILDKIERGE KKILRLRQIR DAIQEKVERH LEDAISYGSS GRGRGYQEEE DAFLVCMMYR HGYGAAERIR MEIRRAWQFR FDWYFKSRSA QEIQKRCDML VRVVERDNAE VREKEAEEER KQERDSFTVP TPQSAKDSFQ KVPEGAVAQF ASN
|
| |