Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50272 |
Symbol | |
ID | 7199114 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 157425 |
End bp | 160809 |
Gene Length | 3385 bp |
Protein Length | 1043 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185220 |
Protein GI | 219130119 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.102491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCGCC TTTTGAACAA AAAGCGACGT CGTCGCAAGA TATCTTTAGA AGTGCACCGA ACCTACCGCT CCCCAACGAA GTTCTACGAC GAATGCTTCG TCCGGGTGTT GGAACCAGCC ACCAAGGCAG GGCGCCGTGC GGCTCGCCGT CGCGCTCGCC ACCGCACTTC TGGTCAGCCT GAAGCGTCTC TGGCTACCAT TGACGCGGAG GATGACGAAG ATGATGGAGA CGAAGATTCT ACCGTTGTCC CGGCCGAGTA TCTGGACGGG GCCCGGGATG AAGAAAACAC GGGTATTGAG GGGTGGAGAG CCAAGTATCG TTTCCCAATC AGCGGGTTAA AGGTCAAAAA TACTTACAAA GATGCTGTTA TTGTCGCGAT TCTGCTGGGA AAACTCAAGC AGACTCGAGA ACTTATTTTT GACAGCGTGA GTCTTGCGGA AGATTTTTGC AAGGTCTTGG AAAAGGAGCT GGAAGGCGAA GAACAACGGG CTGAAGCTAA GGTCAAAGCG GAATTTGGCG ATATCGTCAT GCCACAGGGC GAAATCACTT TGCTGATGGA GATTGTTTCG GGTTGGAATT TGCCGATTGG CGACTTCGAC AAATCAGACC CGTTCGTGAT CTGTATGTTG AACGGCAAAG AAGTTCACCG GACCAAATAC ATCTCCAAAA CGTACGTTTT GCGAACGCTG GAACACTACA CGTTTTCGAG GATTCTTGCT TACCACTTCG TCTCTCTGCT GACAGGTTGG ATCCTATTTG GACACTCAAG ACGGGATCTT TGTTCCTACT CACAATAAAT CCGAAGGAGC TCTTCCTCAG CGAAGGCCTG CTCTGCCTCG TCATGGACTT TGACAAGGTC GGAAAGAACG AGAAGCTTGG GGCTATTACA ATTCCTCCCC GAGTTCTTTT TGATTCAAAA GGTGACCGTA TGGAATTCAA ACTTGGGCCC CCTCCTGGGA AAACTGGAGA AGTTGACGGT CATCTGGCTA TACGCTGTCG ACGGGCGTCC GAACACGACA TCCACTTCTT GAAAGACTAC GCAGATTCGC AGAAACGCAA TGCCCTGCAA AACCGCTTTC ACAAAGAACC CGCACAGAGT ACAGACACGA AAGGAGGCTC TGGCAATATT GCCTCCTACT TTCGCAGGCA AAGTCGAACG GTCAAGGATG GCAACGAGGA AGTCAAGGAG TACAAAGTTC GACCTGGTCC ACATCCGAAG CGCAAAGATC AAACAACATG GATGACTCAT GATCAGGTGG AAAAGGAGAG TTTGAAGGAA TCAGAGGAAT GGATTGATAC AGGCAGTGGG AAACTCGGGC GACTATTTGT CGAAATTATT GGATGCGACG ACCTTCCAAA CCTTGATACC GGCGGACGCA ACAAAACGGA CACTTTCGTC TCGATTGTGT ACCAAGATTC TGTAGTTTCA ACAGACATCA TCGATGACTG TTTGAGTCCT CGATGGATGC CCTGGACTAA GCGTGCGTTC ATTTTCCACA TCATGCATAG CAGCAGCCAA CTCTTTCTTG GAGTTTTCGA CTTCGACGAA GGTATCAATC CAACTGACGA TCACGACTTG GTTGGACGTG TTTCGGTCGA TTTGACGAAC TTACGCAAAG ACACATTATA CACTTTAAAG TATAACATCT TTACGACAGC CCGCATGGCA GATCGCAAGC GGCGGGGGTC GATCACGGTA CGATTGCGTC TCGAAATTGA AGACGACCGA AAACTCTTAC TAAGCAACCT GGAGCCTCCG CCTGACATGT ATGTCAACGT GAAAAAGCGC AAAGATTTCC GAGTAGTGCG ATACACATGC ACGGGAAAGT ATGACATGAC GAAGTATGAT ATGAAGTACA TCAATTCGTA AGTTTGTCCT GTAAAGTCGT TCGACCATAT CGTCGGGTGA TTTTTTCTGA TAAACTATCT TTGTCTTTTT CAGGTACATC GAAGAGCTAC TGTCAATCCA GCATGTCCTA TATTACCTAC AGGATGCACT CATGGTATTG ATCTTGTGGC GTGGAACGCT ACCAATTGAA ATTCGTGGCG AAACGTACAA GTTCCCTGTT CACTCTCTAT CTGCTTTTAT TGCTGCAGTT TTGTTGGTAG AGCAGCCCCA ATTGATCCCC TCGCTGTTTT TTGGGTGTAT TGCTTGGTTA ATTATAGCAA TCATGGACTA CCGCCAAAAC CTACCAGACC TCTGGAGTCG TTGCAAAACT TTTCGTGAGT TCATATACAT ACTTTTTGTC GGAAAATCAC CTATTTCACC TCACAACATC AAGCAATACG AGCAATACGA AGAAGCCAAA AAGTTTCTCG AGGACCAGCA AAAACGTATT GAAGAGTCGG AGAAGGCAGC TGAAAGAGCG TACGAAGAGT CTGTCAAAGC GCAGGAAGAG TACGAGCGAG AAATGGAAGA AATTGGAGAA GCAGATGTGG ACATTAGTAC GAAAACAGGT GGAGTATCCC TCGACCCTTT TAAACCTATT TTATTTCCCG TCCAGCAAAA TCTTGCGCTG ATTTGTCGAT ATCTGCGACA TGTTCGCTAC GTTCTGTTTT GGGAAGAATG CTACATTGCA TTCTGGGTTT CTGCCGGATG TCTTCTGCTG TCAATTATCT GTGTGTTTAT TCCTTGGTTC TTTCTGATCA AATGGACGTC TCGATTTTTG GTTTGGTTCA CATTTGGTCC TTGGATGAAG CTTGTGGATG TCTACTATGT TGGCAAGATA AAGCCACCTA CCGAAGCGGA GATTCAGGAG AAGAAGAAAC TGGATCGAGA AAAGCGCCGT CTTCAGACCT CCGCCGCCGC TGCAAAGGCT CGGGTTAAAA GAGAGAACGC AACAAAGCTT AAGGCTATGA AAAAGTACAT GTTTGGGAGG TACATTGCCA AGGTTCCAAT TCTAAAAGAA GACCGTTACC GTGACCTTCC ATTACCCTCC TCCACTGCCG TTCCATACCG CCCTAAGCCG CTACCGTTGT CTGAACTCGC GATGCAGGAA GCAGGCTATC ATCGAACTCG CCTTCCCGGC CAACATCTAG TTGGTGACAT GATTCCTAGG GTGAGTCTGA CATAAAGTCT CGTCTTTTAT GAAAGTGGAA CTGATACTAA CACATCCATT TGATTTTAGG CGGAAACTCT AGGTTTTACA GAAGCTCCAA TAGGCCAAGC AACTGCACAC CCACGTCTTG TGGATAAAAA GCGGCCCGGT GGTAATATCT CTTCAGGTTT GGAGTCGACC ACTAGTGCTT ATGCCAAGAT TGGATCGCTC ATTGTTGCGG CTGGTTTGAT TAGCTGGTTT TGTGTCCCTG TATTTGCCGC GATGGCCGAG AAAGTTATCA ATTTCTTCTA GCCTTCTAAA ACTTTTTTAA AACATAGGCA CTTGA
|
Protein sequence | MPRLLNKKRR RRKISLEVHR TYRSPTKFYD ECFVRVLEPA TKAGRRAARR RARHRTSGQP EASLATIDAE DDEDDGDEDS TVVPAEYLDG ARDEENTGIE GWRAKYRFPI SGLKVKNTYK DAVIVAILLG KLKQTRELIF DSVSLAEDFC KVLEKELEGE EQRAEAKVKA EFGDIVMPQG EITLLMEIVS GWNLPIGDFD KSDPFVICML NGKEVHRTKY ISKTLDPIWT LKTGSLFLLT INPKELFLSE GLLCLVMDFD KVGKNEKLGA ITIPPRVLFD SKGDRMEFKL GPPPGKTGEV DGHLAIRCRR ASEHDIHFLK DYADSQKRNA LQNRFHKEPA QSTDTKGGSG NIASYFRRQS RTVKDGNEEV KEYKVRPGPH PKRKDQTTWM THDQVEKESL KESEEWIDTG SGKLGRLFVE IIGCDDLPNL DTGGRNKTDT FVSIVYQDSV VSTDIIDDCL SPRWMPWTKR AFIFHIMHSS SQLFLGVFDF DEGINPTDDH DLVGRVSVDL TNLRKDTLYT LKYNIFTTAR MADRKRRGSI TVRLRLEIED DRKLLLSNLE PPPDMYVNVK KRKDFRVVRY TCTGKYDMTK YDMKYINSYI EELLSIQHVL YYLQDALMVL ILWRGTLPIE IRGETYKFPV HSLSAFIAAV LLVEQPQLIP SLFFGCIAWL IIAIMDYRQN LPDLWSRCKT FREFIYILFV GKSPISPHNI KQYEQYEEAK KFLEDQQKRI EESEKAAERA YEESVKAQEE YEREMEEIGE ADVDISTKTG GVSLDPFKPI LFPVQQNLAL ICRYLRHVRY VLFWEECYIA FWVSAGCLLL SIICVFIPWF FLIKWTSRFL VWFTFGPWMK LVDVYYVGKI KPPTEAEIQE KKKLDREKRR LQTSAAAAKA RVKRENATKL KAMKKYMFGR YIAKVPILKE DRYRDLPLPS STAVPYRPKP LPLSELAMQE AGYHRTRLPG QHLVGDMIPR AETLGFTEAP IGQATAHPRL VDKKRPGGNI SSGLESTTSA YAKIGSLIVA AGLISWFCVP VFAAMAEKVI NFF
|
| |