Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54330 |
Symbol | Dph1 |
ID | 7199631 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 774434 |
End bp | 777962 |
Gene Length | 3529 bp |
Protein Length | 1010 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | diatom PHytochrome 1 |
Protein accession | XP_002179062 |
Protein GI | 219116534 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.784777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCGCCTTAC AGTACAAGTT TGGTATTTCC CAAAGTATTC AAGGAACCAC GCCATGAGCG GGGCAAATTA TAGAGAAGCC GACTTCCCTG GTGTCCGTGC CGCGGGTCGG CACAACAATT CCATTACAAC CAAAGAGCTG ACGGAATGTG ATCGTGAGCC TGTGCACTTG ATCGCAAACG TACAAGGGGG TACCGGCCAT TTGTTGTTCA TTCACTACCC GTCTGGAAAA ATCTTGGCTC ATGATCGCGA CATCGAACAC ATTCCTTGGA TCCGGTGTCA CGAAAACAGA ACAGTTACTG CTGGCCGCAC TGGGGCGAGA ACTACATCTT CTTTCCATGG AGAACAGCAG AGTGGTGAGA GTCCTCACGA AGCTATTGGC ATATCTGGAG GCTTTTTACT GAACTGGGTT CCGCACGATT TCTACGAGAA GATTCTCGAT TTGGTCCTCG GTATTATCCA TTCCGATACG CACAGAAATT TTTATTTTTA TTCATATGAT GGTTCAGCGT ACGCTATTTC TATTTCAGCG ACGGAAATGG ACTACTCCGT GATTGGCATC GAAATTGAAA CAGTTGGTTT GGATGATGTG AGTTTTCCTG CAAATACAAG GGCAGCCTGA CGCACTTGCT TGCTAAACGC TCTCTTTTCG TCTTACTTTT TCCCTTTTTG TAGACTGCCT CCCATTTTTC ATCCTCATTG TTGCATTTGG GACGTATTGT GGAATTCTAC CAGCACGAAG CAATTGCCAA GACAGCCTGC GACACTGTTT TTCACCTGTT GGGAAAGTAT GACAGGGGCA TGGTGTACCG ATTCCACGAT GATCTGTCCG GCGAGGTCGT GCACGAGATT AAAGCAAATC ATGTGGAATC CAGCTATCTT GGCATGCGAT TTCCTTCCTC TGATATTCCT TTGCCATCGC GACAGCTTTA TATAAAAAAT GGTGTGCGGT ACATTTACGA CGTTGATACC GAGGATCTAC CGATTTTATC CCTGGACAAT GAAAAGATGG ATCTCAGTCA AATTCGCATG CGTGCTGTAG CCAAACCGCA TATTGTGTAC TTAAGAAATA TGGGGGTGGT GTCGTCGTTG AGCTTGGCGA TTGTTGTCGA CAATGATCTG TGGGGGTTGC TGGCTTTTCA TGGGTACGGC GCGAGGTACA AGCCTTCGCT CCATCAGCGA ATTGCTTGTG AAACCATAAG TGCGATGGTC TCAGTTCGTA TTGAATCTCT CATGAAAAAG GCGCAGAGTG CCCGAATTAT TAAGTTGGGC CAGTGCACTA TGAGCTTAAA GCATGACCAG AGCCTGATTC ACAATCTCTA TGAATGGGGT GAAGGCATAC TCGAAATTGT TGATGGAGAT GTGTTGGTTG CACATGTACA AGATCCTAGA GATGGCGAAG GCGACAGAAT TGTGCTGGGT GATCCTTTGT TGGTACCGAA GGATTCTTTT TGGACTAAGA TGAGTTCCTA TCAGAATCGC GAACTCTGTG TCATTTCAAC ACGCAAAGCT CTCACAGATA TCAAATTGAC ACAAGAAGAG TGCCCAGCAA GTGGAATTGT ATTTTTCCAA GAGGGTCGTA CTCAGATCAT GATTGGACGA GCAATGCGAT CCAAAGATGT CGTATGGGCG GGTAATCCTG ACGAACCAAA ACTAAGGATT GGAGGAATTT TGAATCCGCG CAACTCCTTT ACTCAATTCA TTGAAAAAGC GCGAAAGGAA TCACGAGCCT GGACTGTGCA AGATATTAGT GTGATTTCTG TGCTTCGTGA CCGTATATGT GAGCATTCGT ACGCATACAT GATGGGATTA CTGAGAGGTG ATATTCAAGA TGCAAACCGG AAATATTTGG CGGCAATTGA CCGAGCGCGG GACAATTACG AATTCTTTGC GCATATGAGG TAAGGCAGAA GCTTATCGAC GTTAGAAGAG TTTAATCCTT TCCCCTCTTA CTTTTTGCTT TTGAAACAGC CACGAACTAC GGACTCCTTT CCATGGCGTT ATGGGATGCT TAAGTATTCT GCATGAGTCA ATTGAAGATA TGCCAGCAGC GGAAGTCAGA GATGTTGTCG ATACAGCAAT AGCTTCCGGA AACCACATGA TCAATCTTCT CAACGATATT CTGGACATCT CGAAGAACAA ACACTTGTCT CATATATCGG CGCAGGATAA GGTTATTTAC CAGACGTTAG CCTTCGAGAC AATTGACTGT ATGAAGTCAC TGGCCACTTC TCGAAAGATC GAGATGAGAT CGTCAATCGA GCCGAAAGGC TTGGAAAAAG TGGTGATTGT GACGGATCGT ACAAAAATTA TTCAAATCGT TTCCAACGTT GTGAACAATG CCATCAAGTT TACGGGTGAA GGGACTGTCG ATGTTGTATT TAGGCTCGTT GATTCGCTGC AAGAGGCAAC TATGATGTGG GAGCGAGGCG CGGAAGTTCA TGCTGGATCA GTGTTTTCGA TGAAGGAGAG TGAAATGCAC ACATCGGCTG AAGAAGTAAG ACGGAGCACT ATGACGTTTA ATGAGACGCA TGATCAAAAG TGGATGACAA TGAGTGTTTC AGACACCGGA TGCGGTATGG AGCCGTGTGA ACTAGTAGAA ATGTTCTCAC CATATACCCA ATCGAGTCAT GGATCCAATC GCATTTTTCA GGGAACAGGG CTTGGGCTTT TCATTTGCGT TTCATTATGT TACCAGCTCA ATGGTTTTAT TTCTTGTGCA AGCACCCCCG ATAAAGGAAC ACTTTTTCAT ATGGGAATCC CAGTCGGATT GTTAGCTGAA GACACAGTTG AGGGAAATCA GACACTAACA GATGATACGA AGGAAACAGA GAGCGTGATC CAAATGTCGG GTCCGATTTT GATCGTAGAT GACAATGTTG TGAACGTCAA AATTCTAAAC CGGGCGCTAC TTTTGGATAT TAGAAGAGCT GGTCTTGCAA TAGAGGTCCT CACAGCAGGG GGTGGGGCTG AAGGTGTCCA GGTCTTTCGA GACAAGCGCC CCAGTCTATG CATTATCGAC TATCACATGC CCGATGTCGA TGGCATTGAA GCGACCTGCA CCATACGGAA ATACGAGCAA GAAAACAAAA TTGATCCTAC CTACATTTTG ATGTACACTG CTGATGCCAC AGAGCAAGCT AGAGCATTAA TCTTGAGCTC CGGCGTTAAC GATATCATGT CCAAGCCTCC GCCGAAGGGA TTCATTGCCG GATTGGTGCA GAGGCTGCGG GTTCCGGAAT AGCATGGTCG TTCATTCATA GAAGAATCGA CTTCTCTTCC AGATTTCGAA TGCTTGTAGC TTTCGTTTAT TCTTCGGCCG CGCAGCTGCT GCTATCAATT CACAATGTTA AAATATATAC CTTCGCTTCC ACTACATGGT AATTACAAGC GTGGCTGGAG ATAGCTCGTG TGTATTTCTC AAACACACGG CCAGAAAGTC TTTGCATCTC GGCACTAGAC TTTTCTATGG AAGAGACATC CGAAAGAATC GCCAAATACG TGATAAAATA AAATTGAATG CCCACGACA
|
Protein sequence | MSGANYREAD FPGVRAAGRH NNSITTKELT ECDREPVHLI ANVQGGTGHL LFIHYPSGKI LAHDRDIEHI PWIRCHENRT VTAGRTGART TSSFHGEQQS GESPHEAIGI SGGFLLNWVP HDFYEKILDL VLGIIHSDTH RNFYFYSYDG SAYAISISAT EMDYSVIGIE IETVGLDDTA SHFSSSLLHL GRIVEFYQHE AIAKTACDTV FHLLGKYDRG MVYRFHDDLS GEVVHEIKAN HVESSYLGMR FPSSDIPLPS RQLYIKNGVR YIYDVDTEDL PILSLDNEKM DLSQIRMRAV AKPHIVYLRN MGVVSSLSLA IVVDNDLWGL LAFHGYGARY KPSLHQRIAC ETISAMVSVR IESLMKKAQS ARIIKLGQCT MSLKHDQSLI HNLYEWGEGI LEIVDGDVLV AHVQDPRDGE GDRIVLGDPL LVPKDSFWTK MSSYQNRELC VISTRKALTD IKLTQEECPA SGIVFFQEGR TQIMIGRAMR SKDVVWAGNP DEPKLRIGGI LNPRNSFTQF IEKARKESRA WTVQDISVIS VLRDRICEHS YAYMMGLLRG DIQDANRKYL AAIDRARDNY EFFAHMSHEL RTPFHGVMGC LSILHESIED MPAAEVRDVV DTAIASGNHM INLLNDILDI SKNKHLSHIS AQDKVIYQTL AFETIDCMKS LATSRKIEMR SSIEPKGLEK VVIVTDRTKI IQIVSNVVNN AIKFTGEGTV DVVFRLVDSL QEATMMWERG AEVHAGSVFS MKESEMHTSA EEVRRSTMTF NETHDQKWMT MSVSDTGCGM EPCELVEMFS PYTQSSHGSN RIFQGTGLGL FICVSLCYQL NGFISCASTP DKGTLFHMGI PVGLLAEDTV EGNQTLTDDT KETESVIQMS GPILIVDDNV VNVKILNRAL LLDIRRAGLA IEVLTAGGGA EGVQVFRDKR PSLCIIDYHM PDVDGIEATC TIRKYEQENK IDPTYILMYT ADATEQARAL ILSSGVNDIM SKPPPKGFIA GLVQRLRVPE
|
| |