Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33342 |
Symbol | |
ID | 7204403 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 652665 |
End bp | 653852 |
Gene Length | 1188 bp |
Protein Length | 371 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186390 |
Protein GI | 219113613 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0652338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAAAA CTGCTTCCCC CATTTTGACA GCAAGTCTTC CTCGCAACAG CGCTTCCGAC GCATCGAAAC CGGTCACCAA CACCTTTTTT GCTCAAACCG CTCTCGATGA GAGCGGTGAT AAAGGAGAGT TCAAACGTGT GGATGCTTCA TGGAGAAATT GGGTCAAGAA AGGTATGTTT ACTTTCCTTC GTAACGAGAA GCTAACGAAA CTTGCTCACC AATAATTTCT TTTCCTGGTC TTAGAACCTG ATGCACAGTT TCCAGCCGAA AAGGATAGAT ATCACTTATT CGTTGCGTAC GCTTGTCCCT GGGCCCACCG TACATTGATG ACGCGAGCCG TCAAAGGTCT TGATGATACA ATCGCCGCTA CCGTCGTCCA CCCAATTTGG CAGAAAACAA AGCCTGACCA AGACGAGCAT TCGGGTTGGG TATTTGGAAA CGCCGAGGGA GAAATGCTGA CGAATACTGA AGGTAACGGT GGTCCTTTCC CTTCAATCTT CCCACACAAC GAACCGGAAC CATTCTTTGG ATCTCAAAGT ATTCGCGAGC TGTACGAAAA GGCTGGCGAT ACCGATGGGA AGTACAGCGT TCCAATTCTC TGGGACAAGA AAAGGAACAC GATTGTCAGT AACGAATCCT CCGAGATCAT CCGAATGCTC AACTCGGAGT TCAATGACTT TGCAAAAAAC CCTGATCTAG ACCTTTACCC TATTGAGATG CATGTCGCCA TAGACAAAGT GAACAGTTGG GTCTATCCAA CGATTAACAA TGGAGTTTAC CGGTGCGGCT TTGCCAAATC CCAGGAAGCA TATGATACAG CGATCACTGA GCTGACGGAA TCGTTTGATC GGATAGCGGA TATTCTACAG AAGCAGCGCT TTATTGCAGG AAACAAATTC TCAGAGGCTG ATATTCGTCT TTTTGTTACC CTTGTGCGGT TTGATGAGGT TTACACGGTC TATTTCAAAA CAAACACGCG CTCTGTGGCC CACACTCCGT CTATTTTGAA CTACTGCCGT GAAATTTACC AGATGCCGGG AGTGAAAGAC ACTGTGAACA TGGAACAGAT TAAGGCCCAC TACTATTGCT CACACCCCAT TCTTAATCAT TTCTCAATTG TTCCTCGCGG GCCCGATTTT GTGGATTTGT TGGAACAGCC CCACAATCGC AATAATAGTT TAAACTAG
|
Protein sequence | MSKTASPILT ASLPRNSASD ASKPVTNTFF AQTALDESGD KGEFKRVDAS WRNWVKKEPD AQFPAEKDRY HLFVAYACPW AHRTLMTRAV KGLDDTIAAT VVHPIWQKTK PDQDEHSGWV FGNAEGEMLT NTEGNGGPFP SIFPHNEPEP FFGSQSIREL YEKAGDTDGK YSVPILWDKK RNTIVSNESS EIIRMLNSEF NDFAKNPDLD LYPIEMHVAI DKVNSWVYPT INNGVYRCGF AKSQEAYDTA ITELTESFDR IADILQKQRF IAGNKFSEAD IRLFVTLVRF DEVYTVYFKT NTRSVAHTPS ILNYCREIYQ MPGVKDTVNM EQIKAHYYCS HPILNHFSIV PRGPDFVDLL EQPHNRNNSL N
|
| |