Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47755 |
Symbol | |
ID | 7202920 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 740505 |
End bp | 743381 |
Gene Length | 2877 bp |
Protein Length | 749 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181967 |
Protein GI | 219123304 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.373674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGTCTC CTTACGGAAA CAAGAAAAAG GATCTAGCTA GTTCTAGCAC TATGCGTGTT CCTGCGTCAA AGACAAAAGA GTTGCGTCTA TTCGGACTCA CGATTTTGAT TTCCCTTTTG AACGCGATTT ATTGGTCATC GGAAATGTTG GGTTACCGGT TGAGCGACGG GGAGCCACGG TTCCTGACGC TCCCTAGCAC TTCATCGTTA CTGTCTTCCC CATCCTTTAC ATACCCGGCA ACATCTACCA AAAAACTTCC TGACGCGCCT TGGACAATCT TTTATAATGC GTACCTGCCG CTGGAGAACA CCACTGAAGC CTTTGCGATC ATTGAGGAGC AGTTGCAGCA GATCAATAAC TCTCACGGTG CTACAAGACC GGGACTTGCC CCAGTTAAGA TAAACTTACT CACGATCGGC GATCCGATTG GAGCGGAGAA GGTGCGAGCC TACTGTTCGA CGGAGACTAA TCTAGACTGC GAGCATTTGC AACACCATGA GGACGGCAGT TTCGAAGACG TTACCCTGTC TCGGTTATAC GACTTTTGTA GCAACCAGGA GACCGATGAT AATAAACTCG ATCCCAGCAC CTCGCCACAC GATCCGATTG TTGTCTACCT CCATTCGAAG GGAACTTATC ACCCCAGCGA AAGGAATGAT CACTGGAGGC GGTACATGAC AGACGCGGCG ATGAGTCAAG AATGCTTGGA GTTTCAAGAT CAACACGGCC GAGAGATTGC CCCAACGACA AACATTTCTT CTACTCATAA CACTCAGTGT AACGTGTGCG GATTTCTATT TCATCCGATA TGGACACCTT TTTTTCCAGG CAATATTTGG TCCGCCAAGT GCAGCTATAT CCGAAAGCTG ATGCATCCAC AAGTCTTTAA AAACCAATCC GAGTCGACGG CTGACAAGGC TGTGGCTCTC ATGAGAGAAG GACGTTTCAC TATGAATTTG CTATCCCCTT TTTTGGAAGT GGGCGGTTAC TTGGGACGTG AGCGATTCGC AGATGAGCAT TGGGTGGGGA GTCATCCCTC ATTGGTACCT TGTGACTTGG CTCCTAGGCC GTACCTGATG AAGTGGGTCC ACCCCCGACT ACGGAACAGT TTTCGTGCTC CGCCACTCGA ATGGTCAGTA GCGCCTCGCC ACGGAATCAA CGAAGATTGG TTGTTCTTGC AGGGTCTTCA AAAGAAGTAT CGGCTGGTGG CCGACCCTGA TTGGCGTCAT CGAGAGTTCT TCTTGTTGCC GGGAGCGCTG TGGAAGTGGA TCACCTGGTA CGACAAGGTA CCGCCGGCGT CTTCGTATGT GTGGAAATGG TTTCCGGATG GACTAGAATG GCTGGGACGC GTGGAAACAT TGGGTACGCA AGGTCTGCTG CAGTTTTTGA CTGAAGACAC CTTTCTATAC CCGTCAGATG AAGCGTCGAG TGTGAATCCG TTTCAGATCG TCGAGAAAGA TGATAAAGAA GGTTCATCGC GAACATTCTT CTTCCACATA CAGATTCCGA AGCAGCTTGC TGACGAGAAG AGCTTTCGTG GAGTTGTGCT TCGACGGCTT GAATCGATTG GCGAGCAATC CCCGGGAGCA ACGGTTTTCT TCAACACAGT AGGGGAATCG GGTATACTGG ACGTGGAAGA GATGAAAAAC TTTTGTAGAG AGAAGTTTCA TCTAGATTGT GTGCACATGG AGCACTTGGA TGCCGGGATG GATCTTGTCA CTTTGAATCG AGTACATGAG TTCTGCATGA GCCACAATAC ATCACGCGTT GGGTTTGTTA GGACAACAGG ATGGCCCGAG TTGTCGGCTG CTTTTGCGAG TCAAGAACGC CTAGAGCAAA CCAGAGTTGT GGCAAATGAT AAGTGCTGGA CAAACACAGA TTGCGATGTT TGCTCACTAA AGACCAACTC CAAATCCGGT AACATTTCAC CGAAGATTCA ACGTATTGTT ATTTCGTCAC CGTCGGTTAC CCATGGTGAA GTTAACAAAG AAACACGTGG TGATAAATCT TCCCGCAATG GATTGAAAAA GCGCGCCAAG CAGGCTCAGA AGGGACGGAG TCCTTTAATG TGGACAGCGA GTTGTGCGTA CATCAATGAT TTAATTCACC CAAACGAGTT CGCCTCTCGT TTGTCGGAAG ATCGTTGGAA AGACAGCCAA GTAAGTGGTG AACGTTGGAT ATTTAACAGC ACTTCAATCA CGCACTGCCA AGTTCATACA AAGGTAACAG CAGGGCAAGC ACCGATTAGT AAGAAGTGAA ACAAAGGCTT AAAGGAGAAG GGGAAAAGGA TGAAGGAATC CATGTCAGAA GCGGAGCTCA GCTGCTAGGA TTGGGGATCT CTCTCAACAG CAGGAAATGT GTTTGTACAG AATACCAACT TTGGATAAAT TATAGTGCTC TATACTGAAT CCTTTAACGA ACACGAGATC TGGAAACACA CTGTTTCCTC TACATTTATG GACAACGAAT GAGCTGTTGT CCAAACAGAC ACTATACCAG AATGGCAACT TTCTTTGTCA AACACAGTTT CAGGAATCAT GGTTCAATAA CTGTAAGCAA CCCCCCCCTC CCCCTCCCCC TTCCCTTTAG ACCATGTCTT GCTTGACAAG ATCCCTGTAG GAATCTGCTA CCTCCGATCA TGTTTCTGGA AACATGAATA TCAGCATCTT CCCATCCTCC TTGGAGACAG GGTTTTTCCA GCTCAAATGA ACACAAAGTA TACAGCTGGG CCATCGTATC TACAGTCAAT CGCTTTTCAT AAAAAAGGGA TAACTGCAAG TCTTGGCGAT TGTTGCAGGC CTTCCTCCAA GTCCTCTAAC CGAACATCGC TTCCAATCTC GAGCGACTCT GGATGGGTCG TTCTAAGGTG TGCGTGT
|
Protein sequence | MVSPYGNKKK DLASSSTMRV PASKTKELRL FGLTILISLL NAIYWSSEML GYRLSDGEPR FLTLPSTSSL LSSPSFTYPA TSTKKLPDAP WTIFYNAYLP LENTTEAFAI IEEQLQQINN SHGATRPGLA PVKINLLTIG DPIGAEKVRA YCSTETNLDC EHLQHHEDGS FEDVTLSRLY DFCSNQETDD NKLDPSTSPH DPIVVYLHSK GTYHPSERND HWRRYMTDAA MSQECLEFQD QHGREIAPTT NISSTHNTQC NVCGFLFHPI WTPFFPGNIW SAKCSYIRKL MHPQVFKNQS ESTADKAVAL MREGRFTMNL LSPFLEVGGY LGRERFADEH WVGSHPSLVP CDLAPRPYLM KWVHPRLRNS FRAPPLEWSV APRHGINEDW LFLQGLQKKY RLVADPDWRH REFFLLPGAL WKWITWYDKV PPASSYVWKW FPDGLEWLGR VETLGTQGLL QFLTEDTFLY PSDEASSVNP FQIVEKDDKE GSSRTFFFHI QIPKQLADEK SFRGVVLRRL ESIGEQSPGA TVFFNTVGES GILDVEEMKN FCREKFHLDC VHMEHLDAGM DLVTLNRVHE FCMSHNTSRV GFVRTTGWPE LSAAFASQER LEQTRVVAND KCWTNTDCDV CSLKTNSKSG NISPKIQRIV ISSPSVTHGE VNKETRGDKS SRNGLKKRAK QAQKGRSPLM WTASCAYIND LIHPNEFASR LSEDRWKDSQ ESATSDHVSG NMNISIFPSS LETGFFQLK
|
| |