Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47785 |
Symbol | |
ID | 7202945 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 25026 |
End bp | 26925 |
Gene Length | 1900 bp |
Protein Length | 477 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182309 |
Protein GI | 219124015 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.183468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCTCA AGGGCGACAA AGGCCGTAAT ACAACTGGAC ACATGGCATT GTGGGAGACG AGAATGGTCA TCCTGAATCG AGTGAAAATG ACTGAATTAA TTTGATAACT AATGAAGATA GTTAGATTTT CCCAGACCGT GATAGGCAAC AAAATGCCAA GTCATCAACG AGTCTTTTGT TGCTTTATCA ATTGTCTAAG CGAAGAAAGT CAAATAGAGT TTGAGACTTA TAGCATTTAT CAAAACGGGA TCATGTGGTC TTATGCCTGT AACTTTTCGG TTGCTGGTGC TGTCGACTCA CAGTCAGGTT GACTGTGAAA ACAAATTCAG GACACAATTT ATCGAAGTCA TGAAGTTCGC TTACCTTCAA TTGTTCGCGC TCCTTGGAGG TCGAATGGCG CAAGCCAGCC AAATCTTAGT TCGTAAGTAG CTAGCGGTGG TCGCCAAGTC ACTGAGAGAA GGTACTAATT TGCCTAATTA CCTTGTATAT TGTAATTCCA AAGCTGGTTG TGGCTTGGTC GAGCCCGGCA AAGGTGACGA CATTTGTCCA ACCGCATCTC TGTGGATGGG TCAGTATGAC TTGATTACAT TAGAAGATTT CTCCAGCGAT TCGGACGTTG AAGGAACTTC CTTTGTTGGC GGCAACATAA TCTCCTCTCA ATCGGCAAAC TTCGCCATCC ATATGACTAG CCTCGACCCG AAGCTGCCGG GACTAGAAGT GGCTGGTTCC ATCGCTGACG GAAGTCCTCT CAATGTCAAC CGTGGGAGTC TCGTAATCGG TACGGGCTCA ATCACGGGAA GCGGGCCTGT TCCGTACAAA GTTGGGAATC GTCGGATAAA TGCAAACGCT GGAAATGAAG GAGCAAAAGT TAGAAAGGAT ATCACCCTCC GAGACAAGGC CGGCCAAATT GAGCGGGAGT TGAAAAAATG GTCTATTGAA CTCAACAATC TTTCAGCCCC CAACACATAC ACCCGTCCCA CAAATCAGCA TGGGCCTTTC ATTCTCAACG CAAAGACAAA AGATATTAAC GGAGTTGCAG TGTTTTGGAT CACAGCCAAT CACTTGTTCG GCAACGACAA AGTTCAACAA GTTGAGCTAC GAAACGACGT GAACGCTGCA ATGGTCGTCA TCAATGTTGC AGGCAAAACA ATTAACTGGA ATTCTGGCAA CATGGTTGGT GCTTTCAATA ACAAGTCCGT TCGCGGAACG ACGGTCTGGA ACTTTTTCGA AGCCGAGAAT ATCGACTTCG GATCGAAGAA CTTTATGGGG GCATTGTTGG CTCCCCTAGC ACACGTCAAA ACTCGAGGCA ACATCGACGG CGCCACAGCC GTTCGTATGC TGACAATCAA TGCTGAAGTT CATCTCCCTA CGCTTTCCTC TGTTTGTCCC GACCACGCGG GAGCAAGTGC TCAAATATCT CCGACCGTGC CAATCTCTGA CAGCGCACCA AAACAATTGC CAGCTGCATC CGTTACCAAA GCTCCTACGC GTGCACCTGT GGTACCTCCA ACGCCTGCTC CGGTAGCAAC ACCTACAGGA GGAAACAGTG GGCAGTGCGT TGCTGATTGG CAGGATTGCT CTAAGAGCCC GTCCTGTTGT GGGACTTCAG TCTGTGTCTT TCACAACCAG TGGTATAGCC AATGCAAGCC CAAACCTTCA AGCTTTTCTG GGCAATGTGT TTCCTTATGG GGAGATTGTA CCAGCAATCC CAGTTGTTGC GATGGTTCCA CATGCAATTA TTCTAACCAA TGGTACAGTC AGTGTAAGCC CGCCGCCTCC ACAAACAACA GTGGTGGGCA ATGTGTGGCT CTCTGGGGCG ATTGTTCAAA GAATCCCTCG TGCTGCGGCA GCAGTAAGTG CATCTACAAA GATCAGTGGT ACAGTCAGTG CAGAAATTAA
|
Protein sequence | MPLKGDKGRN TTGHMALTQF IEVMKFAYLQ LFALLGGRMA QASQILVPGC GLVEPGKGDD ICPTASLWMG QYDLITLEDF SSDSDVEGTS FVGGNIISSQ SANFAIHMTS LDPKLPGLEV AGSIADGSPL NVNRGSLVIG TGSITGSGPV PYKVGNRRIN ANAGNEGAKV RKDITLRDKA GQIERELKKW SIELNNLSAP NTYTRPTNQH GPFILNAKTK DINGVAVFWI TANHLFGNDK VQQVELRNDV NAAMVVINVA GKTINWNSGN MVGAFNNKSV RGTTVWNFFE AENIDFGSKN FMGALLAPLA HVKTRGNIDG ATAVRMLTIN AEVHLPTLSS VCPDHAGASA QISPTVPISD SAPKQLPAAS VTKAPTRAPV VPPTPAPVAT PTGGNSGQCV ADWQDCSKSP SCCGTSVCVF HNQWYSQCKP KPSSFSGQFS VSPPPPQTTV VGNVWLSGAI VQRIPRAAAA WYSQCRN
|
| |