Gene PHATRDRAFT_54330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54330 
SymbolDph1 
ID7199631 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp774434 
End bp777962 
Gene Length3529 bp 
Protein Length1010 aa 
Translation table 
GC content45% 
IMG OID 
Productdiatom PHytochrome 1 
Protein accessionXP_002179062 
Protein GI219116534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.784777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCGCCTTAC AGTACAAGTT TGGTATTTCC CAAAGTATTC AAGGAACCAC GCCATGAGCG 
GGGCAAATTA TAGAGAAGCC GACTTCCCTG GTGTCCGTGC CGCGGGTCGG CACAACAATT
CCATTACAAC CAAAGAGCTG ACGGAATGTG ATCGTGAGCC TGTGCACTTG ATCGCAAACG
TACAAGGGGG TACCGGCCAT TTGTTGTTCA TTCACTACCC GTCTGGAAAA ATCTTGGCTC
ATGATCGCGA CATCGAACAC ATTCCTTGGA TCCGGTGTCA CGAAAACAGA ACAGTTACTG
CTGGCCGCAC TGGGGCGAGA ACTACATCTT CTTTCCATGG AGAACAGCAG AGTGGTGAGA
GTCCTCACGA AGCTATTGGC ATATCTGGAG GCTTTTTACT GAACTGGGTT CCGCACGATT
TCTACGAGAA GATTCTCGAT TTGGTCCTCG GTATTATCCA TTCCGATACG CACAGAAATT
TTTATTTTTA TTCATATGAT GGTTCAGCGT ACGCTATTTC TATTTCAGCG ACGGAAATGG
ACTACTCCGT GATTGGCATC GAAATTGAAA CAGTTGGTTT GGATGATGTG AGTTTTCCTG
CAAATACAAG GGCAGCCTGA CGCACTTGCT TGCTAAACGC TCTCTTTTCG TCTTACTTTT
TCCCTTTTTG TAGACTGCCT CCCATTTTTC ATCCTCATTG TTGCATTTGG GACGTATTGT
GGAATTCTAC CAGCACGAAG CAATTGCCAA GACAGCCTGC GACACTGTTT TTCACCTGTT
GGGAAAGTAT GACAGGGGCA TGGTGTACCG ATTCCACGAT GATCTGTCCG GCGAGGTCGT
GCACGAGATT AAAGCAAATC ATGTGGAATC CAGCTATCTT GGCATGCGAT TTCCTTCCTC
TGATATTCCT TTGCCATCGC GACAGCTTTA TATAAAAAAT GGTGTGCGGT ACATTTACGA
CGTTGATACC GAGGATCTAC CGATTTTATC CCTGGACAAT GAAAAGATGG ATCTCAGTCA
AATTCGCATG CGTGCTGTAG CCAAACCGCA TATTGTGTAC TTAAGAAATA TGGGGGTGGT
GTCGTCGTTG AGCTTGGCGA TTGTTGTCGA CAATGATCTG TGGGGGTTGC TGGCTTTTCA
TGGGTACGGC GCGAGGTACA AGCCTTCGCT CCATCAGCGA ATTGCTTGTG AAACCATAAG
TGCGATGGTC TCAGTTCGTA TTGAATCTCT CATGAAAAAG GCGCAGAGTG CCCGAATTAT
TAAGTTGGGC CAGTGCACTA TGAGCTTAAA GCATGACCAG AGCCTGATTC ACAATCTCTA
TGAATGGGGT GAAGGCATAC TCGAAATTGT TGATGGAGAT GTGTTGGTTG CACATGTACA
AGATCCTAGA GATGGCGAAG GCGACAGAAT TGTGCTGGGT GATCCTTTGT TGGTACCGAA
GGATTCTTTT TGGACTAAGA TGAGTTCCTA TCAGAATCGC GAACTCTGTG TCATTTCAAC
ACGCAAAGCT CTCACAGATA TCAAATTGAC ACAAGAAGAG TGCCCAGCAA GTGGAATTGT
ATTTTTCCAA GAGGGTCGTA CTCAGATCAT GATTGGACGA GCAATGCGAT CCAAAGATGT
CGTATGGGCG GGTAATCCTG ACGAACCAAA ACTAAGGATT GGAGGAATTT TGAATCCGCG
CAACTCCTTT ACTCAATTCA TTGAAAAAGC GCGAAAGGAA TCACGAGCCT GGACTGTGCA
AGATATTAGT GTGATTTCTG TGCTTCGTGA CCGTATATGT GAGCATTCGT ACGCATACAT
GATGGGATTA CTGAGAGGTG ATATTCAAGA TGCAAACCGG AAATATTTGG CGGCAATTGA
CCGAGCGCGG GACAATTACG AATTCTTTGC GCATATGAGG TAAGGCAGAA GCTTATCGAC
GTTAGAAGAG TTTAATCCTT TCCCCTCTTA CTTTTTGCTT TTGAAACAGC CACGAACTAC
GGACTCCTTT CCATGGCGTT ATGGGATGCT TAAGTATTCT GCATGAGTCA ATTGAAGATA
TGCCAGCAGC GGAAGTCAGA GATGTTGTCG ATACAGCAAT AGCTTCCGGA AACCACATGA
TCAATCTTCT CAACGATATT CTGGACATCT CGAAGAACAA ACACTTGTCT CATATATCGG
CGCAGGATAA GGTTATTTAC CAGACGTTAG CCTTCGAGAC AATTGACTGT ATGAAGTCAC
TGGCCACTTC TCGAAAGATC GAGATGAGAT CGTCAATCGA GCCGAAAGGC TTGGAAAAAG
TGGTGATTGT GACGGATCGT ACAAAAATTA TTCAAATCGT TTCCAACGTT GTGAACAATG
CCATCAAGTT TACGGGTGAA GGGACTGTCG ATGTTGTATT TAGGCTCGTT GATTCGCTGC
AAGAGGCAAC TATGATGTGG GAGCGAGGCG CGGAAGTTCA TGCTGGATCA GTGTTTTCGA
TGAAGGAGAG TGAAATGCAC ACATCGGCTG AAGAAGTAAG ACGGAGCACT ATGACGTTTA
ATGAGACGCA TGATCAAAAG TGGATGACAA TGAGTGTTTC AGACACCGGA TGCGGTATGG
AGCCGTGTGA ACTAGTAGAA ATGTTCTCAC CATATACCCA ATCGAGTCAT GGATCCAATC
GCATTTTTCA GGGAACAGGG CTTGGGCTTT TCATTTGCGT TTCATTATGT TACCAGCTCA
ATGGTTTTAT TTCTTGTGCA AGCACCCCCG ATAAAGGAAC ACTTTTTCAT ATGGGAATCC
CAGTCGGATT GTTAGCTGAA GACACAGTTG AGGGAAATCA GACACTAACA GATGATACGA
AGGAAACAGA GAGCGTGATC CAAATGTCGG GTCCGATTTT GATCGTAGAT GACAATGTTG
TGAACGTCAA AATTCTAAAC CGGGCGCTAC TTTTGGATAT TAGAAGAGCT GGTCTTGCAA
TAGAGGTCCT CACAGCAGGG GGTGGGGCTG AAGGTGTCCA GGTCTTTCGA GACAAGCGCC
CCAGTCTATG CATTATCGAC TATCACATGC CCGATGTCGA TGGCATTGAA GCGACCTGCA
CCATACGGAA ATACGAGCAA GAAAACAAAA TTGATCCTAC CTACATTTTG ATGTACACTG
CTGATGCCAC AGAGCAAGCT AGAGCATTAA TCTTGAGCTC CGGCGTTAAC GATATCATGT
CCAAGCCTCC GCCGAAGGGA TTCATTGCCG GATTGGTGCA GAGGCTGCGG GTTCCGGAAT
AGCATGGTCG TTCATTCATA GAAGAATCGA CTTCTCTTCC AGATTTCGAA TGCTTGTAGC
TTTCGTTTAT TCTTCGGCCG CGCAGCTGCT GCTATCAATT CACAATGTTA AAATATATAC
CTTCGCTTCC ACTACATGGT AATTACAAGC GTGGCTGGAG ATAGCTCGTG TGTATTTCTC
AAACACACGG CCAGAAAGTC TTTGCATCTC GGCACTAGAC TTTTCTATGG AAGAGACATC
CGAAAGAATC GCCAAATACG TGATAAAATA AAATTGAATG CCCACGACA
 
Protein sequence
MSGANYREAD FPGVRAAGRH NNSITTKELT ECDREPVHLI ANVQGGTGHL LFIHYPSGKI 
LAHDRDIEHI PWIRCHENRT VTAGRTGART TSSFHGEQQS GESPHEAIGI SGGFLLNWVP
HDFYEKILDL VLGIIHSDTH RNFYFYSYDG SAYAISISAT EMDYSVIGIE IETVGLDDTA
SHFSSSLLHL GRIVEFYQHE AIAKTACDTV FHLLGKYDRG MVYRFHDDLS GEVVHEIKAN
HVESSYLGMR FPSSDIPLPS RQLYIKNGVR YIYDVDTEDL PILSLDNEKM DLSQIRMRAV
AKPHIVYLRN MGVVSSLSLA IVVDNDLWGL LAFHGYGARY KPSLHQRIAC ETISAMVSVR
IESLMKKAQS ARIIKLGQCT MSLKHDQSLI HNLYEWGEGI LEIVDGDVLV AHVQDPRDGE
GDRIVLGDPL LVPKDSFWTK MSSYQNRELC VISTRKALTD IKLTQEECPA SGIVFFQEGR
TQIMIGRAMR SKDVVWAGNP DEPKLRIGGI LNPRNSFTQF IEKARKESRA WTVQDISVIS
VLRDRICEHS YAYMMGLLRG DIQDANRKYL AAIDRARDNY EFFAHMSHEL RTPFHGVMGC
LSILHESIED MPAAEVRDVV DTAIASGNHM INLLNDILDI SKNKHLSHIS AQDKVIYQTL
AFETIDCMKS LATSRKIEMR SSIEPKGLEK VVIVTDRTKI IQIVSNVVNN AIKFTGEGTV
DVVFRLVDSL QEATMMWERG AEVHAGSVFS MKESEMHTSA EEVRRSTMTF NETHDQKWMT
MSVSDTGCGM EPCELVEMFS PYTQSSHGSN RIFQGTGLGL FICVSLCYQL NGFISCASTP
DKGTLFHMGI PVGLLAEDTV EGNQTLTDDT KETESVIQMS GPILIVDDNV VNVKILNRAL
LLDIRRAGLA IEVLTAGGGA EGVQVFRDKR PSLCIIDYHM PDVDGIEATC TIRKYEQENK
IDPTYILMYT ADATEQARAL ILSSGVNDIM SKPPPKGFIA GLVQRLRVPE