Gene PHATRDRAFT_50157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50157 
Symbol 
ID7198941 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp216186 
End bp221538 
Gene Length5353 bp 
Protein Length1494 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184992 
Protein GI219129641 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATCGAATC AGCTTTCTAG CGGGTTGGCT CATTGAAAGT CCGTTCGAGA GTGAAACTAT 
CTCGCATTAT ATTTGCCAGT GTGTAGCTTT TGTCCTTCCT TTGTCGAGCC CAGCCCACGA
ACTGTTCGTC CGGTTTCGAA AATTCGAAAC CAAAACAACC TAAAGACATT TGTATCCGAA
CCTGAACACC ATGTCGTCCT ACTTTCGTCA ACAAGCTCCG GCTCCACTCC GCACCGTGCA
TCGCGACTAC GAGTATACGT CAGCAATCAA CGGAACAACT CCCACACCGA CGTCGGCTTC
GGCGCATGGT GTAGTCCAGA ACCAGTTTTA CCGCACCAGG GCGCCGCCTC CATCGCGGTC
TCCCCGGAAC GCTCGTGTCG AACCCGACGT AGACGCCGAG GAAGCCTTGC TAGATCACGC
TCATTTACGA GAGCTGCACG AAGAGGCGGA GAAAATGAAA GCACTTGGGA ACAAACACAT
GGCAGCACAG GTACGTCCCT CGCAAGCTCC AGATGCCAGT CTTGAGGCGC GCGCCATTCC
CAGCACTGGT TGCACTCGCT CACGTCCCTA CTCTTTATTT TTCTTCCGTT GTTGACAGGA
ATATGCCAGA GCCTACAATG CCTATTCGGC GGCGCTTCAG CTCGCTCCGG TCGGACCCTC
TTCCCACGTG TTTTTGTCGA ATCGAGCCGC CGCCTTATTG AGTATGAAAC GGTACGAGGC
CGCGGCCACG GACGCAAAAC GAGCCATAGT CCTGGCTCCA ACTTTCGGCA AGGCCCACGC
TCGCCTGGGA CAGGCTCTCT ACTTTTTGAA AGACTACTAG GCCGCGGTCG AAGCCTACGA
AGAAGCGGTA GCCTACGAAC CCGACAACGC CACCACGCTC ACGTATTTAG AAAAGGCCAG
GGCCAAAGGT GAGCGCTACA ACACTCGAGC CCGCGGCGAC GACGGCTCCG TGGGGGGCGA
TGCGTCGACT GCCTACTCCA TACAAAATAG TGTTGCTACG GATCACTACC AAAAGGGCGT
CGTCGAGTCT GGTTATAGAG GAATTACCAA TCAATCCGTT TTGAACGCAG CCGTCAAGTC
GCCCAGGGAG AGAGCCACCG GGTCGTATCG TGCCAGTCTT TCGCCATCGT ACCAGCAGTA
CGACATGAAC GAAGATGACC CTGACTTTGA TGAAGCCCTG CGCATTCAAC AGCGCGCCGC
TAAATTCCTC ACCAACAAGG CGTACAGGGC AGCTATTGAA GAATACACGG CGGCGTTGTT
CTTGGTTCCT GACGACCCCA ACCTTTCACC AGAATTGCAT TTGGGTCGAG CGCACGCCTT
GAACGGATCA CGACGGCACG AATCCGCCAA AAACGACGCC CGTATGGCTA TTCGGCTTAA
CCCTCAGCCC GCTGCCTTTT CGACAATGGC CAAGTCACTA TTTTACATGA AAGACTATCG
AGGTGCCGTT GAGGCCTTTG AGGAATGCGT CAGGCATCTA CCTGCAGGCG AGACCCTTGG
CATGTTTGAC AAAGCGTATT TACAAAAAGC CCAGGCTGCT CTCGATGAAG AAGAATTCAG
TTTGCGGATG GCGGGAACTC CAACGCGCCA GCCAAAAACG CCTATTCCCA AACTCCCCCC
ACCCCGTTTT GTTCCACGGG AACAAGCCAT GCAGTCATCG CCACAAGTGC CTCCCATGCC
CAAACAGTGG CCTCAGCAAT CGTCGCTCGC CCCTTCCACC CTGCGTTGTG GACCGGAACG
GCAGGTTTTC TTCTTGTCGG AAGGTCTAGG CATCAAACTG AACCGCGGAC CCGACGGTAT
TGTACGGGTC TTGTCGGTGA CTTCGAATAC TCCAGCGGCT CCGGTTGCCC GTAGAGGCAT
TATTGAAGCA GGTGACGTGG TTCGTGAAGC CGCTGGCGTC GACATTCGTC GGCCTATTAC
AAACATTATG TGGGGCGACA CGGTCGCACT CATCAAAATG GCAGCCCGGC CAATTGTGCT
CGTCGTTGCG AAAGAAGTCT CCAAAGTGCC TTTGTCGGTA TTGGAAGAAC AAATGAAGGC
CTTGTCGCCT TTTGGATCGA CATCAACAAA ATTTGGTGGG AACCACGTTT ACCGTCCGTC
GAAATCGAGT GGCGACGAGA CAGTCCGGTA TGTCTTGGAA GAATCCATGG GTACGCCAGT
AAGTATGGAG TGTTCCTGTT TCTTGTGCTT TGTGGATACA GTAGGACGAC TCCAGTTTCG
CTGACGTCTT TGTTGTTTCA CTCAGAGTAG CTAGCTGCTG GTCTGCCAGG TGAGGATATA
GTAGACGAGG AGGGCACGGG TGTAGAGATA ACTGAGGCTG GACCGTTGGA AGAAGAAGAT
ATCGAAAGCA ATGATGAGGA AGTTGAATCC GACTCCGCTT CGGACGTAGC TGTTCTAGCC
GGCGAACCTG AAAAAGAAGA CGTTAGTGAT ACTGTGGATG CTTTATTGGA CGAATTAGAG
AAGATGGAAG TGAGAAAAAG TAACTCGGAT GACGTGGAAG GCTCTGGTAC ACGCCGCATG
TCAGCCGCAG ATTATGAGCT GGAAATGCTG TGCACCGAGA TTGAAGCGAC CAACTCGGAA
AGGAAGTCAG TCGAATCACC TCCGACAGTG GATGCAATGC CTCTTGATGG CGATGACGTG
CCTCCGGAGC GTGTGATTAC TGTCAAGCCA GGCAGATGAG GGGGAAGGAA ATGGGGGAGC
TGCCAAGAGC CTATCTTTAC GTGATCGAGA AGAGCAAATG GTTGGAGGGG AGATTCTTTT
TGGCTCGGAA GCAAATTTGT CTACCGGGAG CTGGGACAAT TTGCGTTGGA TGTCCTACTC
GGGGTCCCGC AAAATACGAT TTTGTCAGAT GATTTATCGC CTTCTCACTC CAGAAAAGAA
GAACATGTTT TGGGTGACAT CGGGTAGAGC ATATGAGAAG CGGGGGCTAG CCATTTATGA
AGAGCCGCGA TTAATTCTGG TTCTGCGGAG GGTGGTAGAT ATGCAGGAGC TCCGACTACT
TCTAGGTCTA CCTGACATCG CCGAAATAGA CAACCCAGAC GTTGCTTTAA CGCGTTATTG
GGTTGTCGAA AGTGCTGTGG ACCCTGCGGT CAGCAGGCTA CGTCTGTCTC CTCTCACAAC
TCCAACATCA TGGGGAAGCG AACAAGCGGA CACCAGGGAG AAATCCTGTT TTGAACTTTT
GTCGCCGGCG GAATCGATCA TGCTCTCGGC CGTACGAGTA CGTGAAGGAA TCAAGAAGAA
AGAACGATCT TTCGTTGACA GTGGTGCTTT CCTGGAAACG ACTGCAGTCG AAACCGCTCT
CACAAAAGCT CTTTGTGATG CTAACGACCA CGCCGGTAAA ATTGGATCTC TGGATGTAGA
CATGACGTGG AAGCACCAGG TTATTTTGGG AACGCTTCAC TCGATTGTCC TCTCCGGAAA
TCTTAAAGGA TTGGAGGAGG CAATACAACG ATTGCGAGTT TCCGTGAAAG ATGGTAATGG
GTCATCAAAA TTTCTTCCAA CTCGTGTAGT CGACCCGCTT GATGAGAACG GCCGCACTCC
TTTGTACTAT GCTTGTACTT GTCGCATGAG CACTGCTGTA GCATGTCTTA TTAACTCTGG
GGCAAGGATA AACGTCAAGA CAACGTCGGG CGGTATGGCT TTGAGTCACA TTTGTGCATC
AAACCTCGAC GATAAGAGTC TTTCGATTGT GCTTTCGGCG ACACGTCCCT CTAGGCTTGA
TCCGAACGAG CTTGACACCA TGGGAAGAAC GCCGATGTAT GTAGCCCTCG TCAACGGTCG
TTCAGTGGCC GGAACACGAG ATGCCCGGGC TCTGAGTCGA TGTCTTGTCG CTTTGGCAGC
ATGGGGTGGT CGGATAATTG TGACCGAAAC GACTTCATTA GCAAACCCGG TGAAAGTGTT
AGCATCTGAG TGGCGATCAG AGGACCTTTC TGTACTTCTG GATCATATTG GTTTCCGGTA
TCCTCTTCGG AAGCCACAAT CCTCGGATCT ATCACCGATC GCGCTGTCTC TGGGTGCATT
CTATAACTTT CCAATACACA GTGCGTTGAT TTCTCTGCAT GGTCAGTTGG AAGCAGTAAC
TTGTCGAGAC GAAGCTTCTT CGTATACTGG CGTTCAGCGA ACAATCCGAA CTCTCTTACT
GAAAAGTTTC GAGCCCAATG AGCGCTTGGA TTTTTGTCAA TCAACGATGA CCGCTGCTCC
TGAGCTGGCA AATTTCGCTG GTTTTACGCC CCTCCAAATT CTCGCGGCTT CCGCTCTGCA
GCTGGACGCG GTTGAGGCGC AGATCGACGA CGACATCTAC CTTAGTCTTG TTGCTTTGCT
CGCTGAAGTT GGTGAGCTGC TGGTGAAGAA CGGGGCTCGA ATATCTCTTG ATGCGCCATC
GTTTAAGAGA ATACGTCGAA ATGCGTCTAC CGAGGGTGTT ACTACTAGCA AGAGTCAAAA
GGGCGATTCA GTTGTGGACG TTTATCGCTC ATCTTTGAAA ATTGATTCGA ATAAGAAAAT
AACTAAGATG CTGGGAGGCG CAGAAAGACT CTCACGGGCC CGCAAAGAGT TTATGCAGCT
AACAGCGGTG AATGCTTCGC CGGATATGAC CGTCAATTTA AACCTTGGTG ATGCTTTGCC
TCTGGAAGAT ACCAGTGAAG CTGGTGGTAA TAACGAAAAG TCTTGCGCCA TTTGCTGGGT
TGTTTTCGGC GCTCTCATGA ATCGCAAGCA CAAGTGTCGA GTCTCTCGAC GTTATATCTG
CGACGAATGT TCCACCAAAC GAATTCTTTG CGATGGTAAG GAATACCGAT TAAGTGACGG
TCAATTTGCT TTGGCCAGAG CAGACGCCGA CGAAGTTGCC AACGAGCGTG AAGCTGACTT
AAATGCGAGA GCGCGCGATA CGTCCATGGA GAGTCGGGTA CCGTTTGCTC AAGGATCTGA
GAGATTGCCG GAGAAGAAGC CTGCCGCCCG AAAGTCTTTA AAACAACTTC GTCTCGAAAG
GCTTGAAGCG GAGGGGGAAG CAGATCGTAA TTCGTTGTTT GGGGGAATCA TGGGATCCGC
AGCCAAATTA TTTGGTACTG AAGGAGAACC GCAAACGCCG ACTCAATCGG ACGAAGTGAA
GGGGTTAAGC GATTCGTTAG GACAAACACG TAACGCGTTG TTGGAACGCG GCGACAAATT
AGCGACACTG GACGACAAAT CAGCAAAAAT GGTGGACGCA AGCGCGGACT TTGCTCGAAT
GGCGAAAGAG CTTCGCAAAA AGTCGGAAAA ATCATGGTTC GGCTAATGTG TCAGTGGCAA
ACGTGTGAAG TATGTGAACT TTCTGTAAGC ATTCTATAGT AGACCATTGC ACATTTATAT
AGCTTCGAAA CGT
 
Protein sequence
MSSYFRQQAP APLRTVHRDY EYTSAINGTT PTPTSASAHG VVQNQFYRTR APPPSRSPRN 
ARVEPDVDAE EALLDHAHLR ELHEEAEKMK ALGNKHMAAQ EYARAYNAYS AALQLAPVGP
SSHVFLSNRA AALLSMKRYE AAATDAKRAI AAVEAYEEAV AYEPDNATTL TYLEKARAKG
ERYNTRARGD DGSVGGDAST AYSIQNSVAT DHYQKGVVES GYRGITNQSV LNAAVKSPRE
RATGSYRASL SPSYQQYDMN EDDPDFDEAL RIQQRAAKFL TNKAYRAAIE EYTAALFLVP
DDPNLSPELH LGRAHALNGS RRHESAKNDA RMAIRLNPQP AAFSTMAKSL FYMKDYRGAV
EAFEECVRHL PAGETLGMFD KAYLQKAQAA LDEEEFSLRM AGTPTRQPKT PIPKLPPPRF
VPREQAMQSS PQVPPMPKQW PQQSSLAPST LRCGPERQVF FLSEGLGIKL NRGPDGIVRV
LSVTSNTPAA PVARRGIIEA GDVVREAAGV DIRRPITNIM WGDTVALIKM AARPIVLVVA
KEVSKVPLSV LEEQMKALSP FGSTSTKFGG NHVYRPSKSS GDETVRYVLE ESMGTPLAAG
LPGEDIVDEE GTGVEITEAG PLEEEDIESN DEEVESDSAS DVAVLAGEPE KEDVSDTVDA
LLDELEKMEV RKSNSDDVEG SGTRRMSAAD YELEMLCTEI EATNSERKSV ESPPTVDAMP
LDGDDVPPER ADEGEGNGGA AKSLSLRDRE EQMVGGEILF GSEANLSTGS WDNLRWMSYS
GSRKIRFCQM IYRLLTPEKK NMFWVTSGRA YEKRGLAIYE EPRLILVLRR VVDMQELRLL
LGLPDIAEID NPDVALTRYW VVESAVDPAV SRLRLSPLTT PTSWGSEQAD TREKSCFELL
SPAESIMLSA VRVREGIKKK ERSFVDSGAF LETTAVETAL TKALCDANDH AGKIGSLDVD
MTWKHQVILG TLHSIVLSGN LKGLEEAIQR LRVSVKDGNG SSKFLPTRVV DPLDENGRTP
LYYACTCRMS TAVACLINSG ARINVKTTSG GMALSHICAS NLDDKSLSIV LSATRPSRLD
PNELDTMGRT PMYVALVNGR SVAGTRDARA LSRCLVALAA WGGRIIVTET TSLANPVKVL
ASEWRSEDLS VLLDHIGFRY PLRKPQSSDL SPIALSLGAF YNFPIHSALI SLHGQLEAVT
CRDEASSYTG VQRTIRTLLL KSFEPNERLD FCQSTMTAAP ELANFAGFTP LQILAASALQ
LDAVEAQIDD DIYLSLVALL AEVGELLVKN GARISLDAPS FKRIRRNAST EGVTTSKSQK
GDSVVDVYRS SLKIDSNKKI TKMLGGAERL SRARKEFMQL TAVNASPDMT VNLNLGDALP
LEDTSEAGGN NEKSCAICWV VFGALMNRKH KCRVSRRYIC DECSTKRILC DGKEYRLSDG
QFALARADAD EVANERLKRR GKQIVIRCLG ESWDPQPNYL VLKENRKRRL NRTK