Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47802 |
Symbol | |
ID | 7203045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 87630 |
End bp | 89502 |
Gene Length | 1873 bp |
Protein Length | 600 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182154 |
Protein GI | 219123693 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCGCACACA ATTGACGGCA ACGGCTAGTC TCGACAACGG CATTTCCTCG CCTTCCAATG CGCACGCAGA ATGTCGGTGG TTCCTGTCAA CGAAGGCAAG CTTGTGGGAG GCTTGACGGA AGATCATTTG CGACATCGAC ACGCAATGAA TGACGATCAT GACCGTACCA AGGCAACCGA AGGACCCGTG CCGTATGGGT TTAAAGTCAA CGACATTGGC ATTGACGACA AGTTAACGTC GACCGAGCTC GAAATGCAGC CACTGCATTT AAAGGAAAGT GATGGAAACT TGGAAGAGGA TGAACGAATA CTGTATCCGA TCACCGTCGG TACTACTAGT ATGTACGAGG GCTGGAGAAA AACCTTGGAC GATTTTCTCT TTCCTCCACA TCTACCCAGA AGTTGTCAAT TACTGCGACC GGAAAATATT GCCGTGCCAG CGTGCTACCT GCTAGTGGGG CTTCTGCAAG GTCTTTCTTC ACCACTGATT AATGTGTTTC CGCTGGATCT GGGTGCCACA GAAGCTCAGC AGACGACAAT TTCGTCGATC CGATCTCTAC CCGCATCCTT CAAGCTCGTC TTTGGGTTTA TGAGCGACAA TATCCCTATC GCAGGTTACC GAAGAAAACC GTACATGCTG ATGGGATGGC TTCTGGCTAG TCTATCACTT TTTTCGCTCA TTCTTGGTTC CAATCTGAAC ATTACCCCCC GCAATGCCGG TTGCTTCGAG TCCCAAGCCA GCGACAGCGA TTCGCCGACG ACACTGCCAG CGGACGCACC TTCTATACCC TTTTTCTCCG TCGCCCTCTT GGCCTTCGGC ACCGGCTTTT GGCTCGCCGA TGTCATGGGT GACAGCATTG TCGCCGAAAA AGCCAAACTG GAACCACCGG AAAGCCGCGG ATCCGTACAG TCCAGCTGCT ACTCGTACCG ATTTTTCGGA ATCATGGTGG CGGCGCCCTT GTCCACGTAC CTGTACGCCA CGTACGGGCC CCGGGCGGTG CTCCTGCTCC TCGCCACACT GCCCTTGTGT ATCTTGCCTT TGGTCTACCT GCTCTTTGAA GTGGAGAACG CTCCGGTCAG CTCGACGGCC GACCAGTGTC GTGAAATTTG GCGGACCGTC TGCAGTCGAG CTGTTTGGCA GCCCATGGGA TTCGTTTACG TGTACAATCT TATGCAAGTG AGCAACGCTG CGTGGCGAGA GTTTCTCGTC ACCTCCTTGC GGTTCACATC GTGTCAACTC AATCTGATCC TCATTGTGGC CTACGTGCTG TTGTACCTTG GGATTCTGGC CTACAAGTAC TACATGATGG ACTGGTCCTG GCGCAAAGTC TACTTCGTTA CCACTCTACT GAACGGATTC TTCAGTCTAC TCCAAGTCTT GTTGATTTAC AACATTACCT TGGGTTTGTC CAGTTTTTGG TTCGCCCTCG GCGACGACGC CTTTGCCGAA TTTATTGGTG GCATTCAGTT CTTACCGACC ACGATTATGA TGGTCCATCT CTGCCCCACC GGCAGCGAGG GTGCTTCGTA CGCCATGTTT ACGACCGTCA ACAATAGCGC TCTGACCTTG TCCAGTGCCA TTTCCACCCA ACTGTTGCGC ATTTGGGACG TGTCCCGCAC GGCCTTGGCG GCGGGGGACT TGTCCGGCAT GGTCCGACTG ACCTACCTCA CGACCGTGGT CCAAGTGGCA GCGATTGCCT TTGTTTCGTG GCTACCCCAC ACCAAGGAGG ATCTGGTGCA ATTGAACGAG CAGTCGTCCC GGAGTCGCGT GGGGGGTACC GTATTTTTGG TGGTCACGTT CGGCTCGATT CTGTACGCCG TGGGAGTGGG TCTGTTGAAC ATTGTGGCAC CAGGATGGAT GGGAGAATCG TAA
|
Protein sequence | MSVVPVNEGK LVGGLTEDHL RHRHAMNDDH DRTKATEGPV PYGFKVNDIG IDDKLTSTEL EMQPLHLKES DGNLEEDERI LYPITVGTTS MYEGWRKTLD DFLFPPHLPR SCQLLRPENI AVPACYLLVG LLQGLSSPLI NVFPLDLGAT EAQQTTISSI RSLPASFKLV FGFMSDNIPI AGYRRKPYML MGWLLASLSL FSLILGSNLN ITPRNAGCFE SQASDSDSPT TLPADAPSIP FFSVALLAFG TGFWLADVMG DSIVAEKAKL EPPESRGSVQ SSCYSYRFFG IMVAAPLSTY LYATYGPRAV LLLLATLPLC ILPLVYLLFE VENAPVSSTA DQCREIWRTV CSRAVWQPMG FVYVYNLMQV SNAAWREFLV TSLRFTSCQL NLILIVAYVL LYLGILAYKY YMMDWSWRKV YFVTTLLNGF FSLLQVLLIY NITLGLSSFW FALGDDAFAE FIGGIQFLPT TIMMVHLCPT GSEGASYAMF TTVNNSALTL SSAISTQLLR IWDVSRTALA AGDLSGMVRL TYLTTVVQVA AIAFVSWLPH TKEDLVQLNE QSSRSRVGGT VFLVVTFGSI LYAVGVGLLN IVAPGWMGES
|
| |