Gene PHATRDRAFT_50272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50272 
Symbol 
ID7199114 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp157425 
End bp160809 
Gene Length3385 bp 
Protein Length1043 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185220 
Protein GI219130119 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.102491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCGCC TTTTGAACAA AAAGCGACGT CGTCGCAAGA TATCTTTAGA AGTGCACCGA 
ACCTACCGCT CCCCAACGAA GTTCTACGAC GAATGCTTCG TCCGGGTGTT GGAACCAGCC
ACCAAGGCAG GGCGCCGTGC GGCTCGCCGT CGCGCTCGCC ACCGCACTTC TGGTCAGCCT
GAAGCGTCTC TGGCTACCAT TGACGCGGAG GATGACGAAG ATGATGGAGA CGAAGATTCT
ACCGTTGTCC CGGCCGAGTA TCTGGACGGG GCCCGGGATG AAGAAAACAC GGGTATTGAG
GGGTGGAGAG CCAAGTATCG TTTCCCAATC AGCGGGTTAA AGGTCAAAAA TACTTACAAA
GATGCTGTTA TTGTCGCGAT TCTGCTGGGA AAACTCAAGC AGACTCGAGA ACTTATTTTT
GACAGCGTGA GTCTTGCGGA AGATTTTTGC AAGGTCTTGG AAAAGGAGCT GGAAGGCGAA
GAACAACGGG CTGAAGCTAA GGTCAAAGCG GAATTTGGCG ATATCGTCAT GCCACAGGGC
GAAATCACTT TGCTGATGGA GATTGTTTCG GGTTGGAATT TGCCGATTGG CGACTTCGAC
AAATCAGACC CGTTCGTGAT CTGTATGTTG AACGGCAAAG AAGTTCACCG GACCAAATAC
ATCTCCAAAA CGTACGTTTT GCGAACGCTG GAACACTACA CGTTTTCGAG GATTCTTGCT
TACCACTTCG TCTCTCTGCT GACAGGTTGG ATCCTATTTG GACACTCAAG ACGGGATCTT
TGTTCCTACT CACAATAAAT CCGAAGGAGC TCTTCCTCAG CGAAGGCCTG CTCTGCCTCG
TCATGGACTT TGACAAGGTC GGAAAGAACG AGAAGCTTGG GGCTATTACA ATTCCTCCCC
GAGTTCTTTT TGATTCAAAA GGTGACCGTA TGGAATTCAA ACTTGGGCCC CCTCCTGGGA
AAACTGGAGA AGTTGACGGT CATCTGGCTA TACGCTGTCG ACGGGCGTCC GAACACGACA
TCCACTTCTT GAAAGACTAC GCAGATTCGC AGAAACGCAA TGCCCTGCAA AACCGCTTTC
ACAAAGAACC CGCACAGAGT ACAGACACGA AAGGAGGCTC TGGCAATATT GCCTCCTACT
TTCGCAGGCA AAGTCGAACG GTCAAGGATG GCAACGAGGA AGTCAAGGAG TACAAAGTTC
GACCTGGTCC ACATCCGAAG CGCAAAGATC AAACAACATG GATGACTCAT GATCAGGTGG
AAAAGGAGAG TTTGAAGGAA TCAGAGGAAT GGATTGATAC AGGCAGTGGG AAACTCGGGC
GACTATTTGT CGAAATTATT GGATGCGACG ACCTTCCAAA CCTTGATACC GGCGGACGCA
ACAAAACGGA CACTTTCGTC TCGATTGTGT ACCAAGATTC TGTAGTTTCA ACAGACATCA
TCGATGACTG TTTGAGTCCT CGATGGATGC CCTGGACTAA GCGTGCGTTC ATTTTCCACA
TCATGCATAG CAGCAGCCAA CTCTTTCTTG GAGTTTTCGA CTTCGACGAA GGTATCAATC
CAACTGACGA TCACGACTTG GTTGGACGTG TTTCGGTCGA TTTGACGAAC TTACGCAAAG
ACACATTATA CACTTTAAAG TATAACATCT TTACGACAGC CCGCATGGCA GATCGCAAGC
GGCGGGGGTC GATCACGGTA CGATTGCGTC TCGAAATTGA AGACGACCGA AAACTCTTAC
TAAGCAACCT GGAGCCTCCG CCTGACATGT ATGTCAACGT GAAAAAGCGC AAAGATTTCC
GAGTAGTGCG ATACACATGC ACGGGAAAGT ATGACATGAC GAAGTATGAT ATGAAGTACA
TCAATTCGTA AGTTTGTCCT GTAAAGTCGT TCGACCATAT CGTCGGGTGA TTTTTTCTGA
TAAACTATCT TTGTCTTTTT CAGGTACATC GAAGAGCTAC TGTCAATCCA GCATGTCCTA
TATTACCTAC AGGATGCACT CATGGTATTG ATCTTGTGGC GTGGAACGCT ACCAATTGAA
ATTCGTGGCG AAACGTACAA GTTCCCTGTT CACTCTCTAT CTGCTTTTAT TGCTGCAGTT
TTGTTGGTAG AGCAGCCCCA ATTGATCCCC TCGCTGTTTT TTGGGTGTAT TGCTTGGTTA
ATTATAGCAA TCATGGACTA CCGCCAAAAC CTACCAGACC TCTGGAGTCG TTGCAAAACT
TTTCGTGAGT TCATATACAT ACTTTTTGTC GGAAAATCAC CTATTTCACC TCACAACATC
AAGCAATACG AGCAATACGA AGAAGCCAAA AAGTTTCTCG AGGACCAGCA AAAACGTATT
GAAGAGTCGG AGAAGGCAGC TGAAAGAGCG TACGAAGAGT CTGTCAAAGC GCAGGAAGAG
TACGAGCGAG AAATGGAAGA AATTGGAGAA GCAGATGTGG ACATTAGTAC GAAAACAGGT
GGAGTATCCC TCGACCCTTT TAAACCTATT TTATTTCCCG TCCAGCAAAA TCTTGCGCTG
ATTTGTCGAT ATCTGCGACA TGTTCGCTAC GTTCTGTTTT GGGAAGAATG CTACATTGCA
TTCTGGGTTT CTGCCGGATG TCTTCTGCTG TCAATTATCT GTGTGTTTAT TCCTTGGTTC
TTTCTGATCA AATGGACGTC TCGATTTTTG GTTTGGTTCA CATTTGGTCC TTGGATGAAG
CTTGTGGATG TCTACTATGT TGGCAAGATA AAGCCACCTA CCGAAGCGGA GATTCAGGAG
AAGAAGAAAC TGGATCGAGA AAAGCGCCGT CTTCAGACCT CCGCCGCCGC TGCAAAGGCT
CGGGTTAAAA GAGAGAACGC AACAAAGCTT AAGGCTATGA AAAAGTACAT GTTTGGGAGG
TACATTGCCA AGGTTCCAAT TCTAAAAGAA GACCGTTACC GTGACCTTCC ATTACCCTCC
TCCACTGCCG TTCCATACCG CCCTAAGCCG CTACCGTTGT CTGAACTCGC GATGCAGGAA
GCAGGCTATC ATCGAACTCG CCTTCCCGGC CAACATCTAG TTGGTGACAT GATTCCTAGG
GTGAGTCTGA CATAAAGTCT CGTCTTTTAT GAAAGTGGAA CTGATACTAA CACATCCATT
TGATTTTAGG CGGAAACTCT AGGTTTTACA GAAGCTCCAA TAGGCCAAGC AACTGCACAC
CCACGTCTTG TGGATAAAAA GCGGCCCGGT GGTAATATCT CTTCAGGTTT GGAGTCGACC
ACTAGTGCTT ATGCCAAGAT TGGATCGCTC ATTGTTGCGG CTGGTTTGAT TAGCTGGTTT
TGTGTCCCTG TATTTGCCGC GATGGCCGAG AAAGTTATCA ATTTCTTCTA GCCTTCTAAA
ACTTTTTTAA AACATAGGCA CTTGA
 
Protein sequence
MPRLLNKKRR RRKISLEVHR TYRSPTKFYD ECFVRVLEPA TKAGRRAARR RARHRTSGQP 
EASLATIDAE DDEDDGDEDS TVVPAEYLDG ARDEENTGIE GWRAKYRFPI SGLKVKNTYK
DAVIVAILLG KLKQTRELIF DSVSLAEDFC KVLEKELEGE EQRAEAKVKA EFGDIVMPQG
EITLLMEIVS GWNLPIGDFD KSDPFVICML NGKEVHRTKY ISKTLDPIWT LKTGSLFLLT
INPKELFLSE GLLCLVMDFD KVGKNEKLGA ITIPPRVLFD SKGDRMEFKL GPPPGKTGEV
DGHLAIRCRR ASEHDIHFLK DYADSQKRNA LQNRFHKEPA QSTDTKGGSG NIASYFRRQS
RTVKDGNEEV KEYKVRPGPH PKRKDQTTWM THDQVEKESL KESEEWIDTG SGKLGRLFVE
IIGCDDLPNL DTGGRNKTDT FVSIVYQDSV VSTDIIDDCL SPRWMPWTKR AFIFHIMHSS
SQLFLGVFDF DEGINPTDDH DLVGRVSVDL TNLRKDTLYT LKYNIFTTAR MADRKRRGSI
TVRLRLEIED DRKLLLSNLE PPPDMYVNVK KRKDFRVVRY TCTGKYDMTK YDMKYINSYI
EELLSIQHVL YYLQDALMVL ILWRGTLPIE IRGETYKFPV HSLSAFIAAV LLVEQPQLIP
SLFFGCIAWL IIAIMDYRQN LPDLWSRCKT FREFIYILFV GKSPISPHNI KQYEQYEEAK
KFLEDQQKRI EESEKAAERA YEESVKAQEE YEREMEEIGE ADVDISTKTG GVSLDPFKPI
LFPVQQNLAL ICRYLRHVRY VLFWEECYIA FWVSAGCLLL SIICVFIPWF FLIKWTSRFL
VWFTFGPWMK LVDVYYVGKI KPPTEAEIQE KKKLDREKRR LQTSAAAAKA RVKRENATKL
KAMKKYMFGR YIAKVPILKE DRYRDLPLPS STAVPYRPKP LPLSELAMQE AGYHRTRLPG
QHLVGDMIPR AETLGFTEAP IGQATAHPRL VDKKRPGGNI SSGLESTTSA YAKIGSLIVA
AGLISWFCVP VFAAMAEKVI NFF