Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45650 |
Symbol | |
ID | 7200440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 818119 |
End bp | 819345 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179921 |
Protein GI | 219118286 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00296757 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAAAC GGCGAAATGA GCAGGCGCAG ATGAGCAAAG AAGATTTCGA AGCACAAATG GACCGCGAAG ACACCGACAG TGTCCCTGCC GTGTTCGAAC AGGCCTCGGC CGAACAACTC GAGCAGCGTC GTATTGTTCG TCGATCGACA CGCCCCCCAT CCATGTCACC AGCCACGAAT ACGCCGGTCT CATCGTCTGT GCCGCTGTCG TCCAATCCTT TCGCTGGCGT CAAGCTATCT TCCGATTGCA AACCTCTTTT TTCGTTCGGA TCAAAGCCAA CCGACGGCAC CAGTGTCCAC GACAACCAAG AGCAACCGAA GCCATACGCT ACACCTTCCC CCTTTTTATT TGGTACTTCT GCTCCCGCGC CAAAAATCAG CACTATCAGT TTCGCCCCAC CTCCAAGTGG TGGCATGTTC CAATTCTCTC CAGCAGCCAG TACAACATCG GCATCCGCCA TATCGACGAT TTCAACGACC AATTGGGGGG AAAAGAGTGC CGAGCTGGAT AGGACCTTTC GTGAAACGGT GCTGTCGTCT AGCTGGGACG GTCCCCAATA TCATAGCAAA ATAGACAAGG CCTATCTTAA GGACTGCTTC GCCCAGGAAA CGGCGTACGT CAAGGAGAAA GAAGCGGCTA CAGCAACCAT TCGATACTCG CCATCCAAAC CAGCCACTTC CGAGTCTACG GCATTTGGTA CGTTTGGAAA GTTTGCACAA TCCACCAGTA GCAACTTTTC CAAAACACCG ACTCCAACTG CGTTTGCGCC AATCCCCGAC TCGTCTACGA CAGACAACGA CAACAACGCT GGAGAAGCCG AGGACGAAGA CACAATGATC CAACCAGCGT CAGATCCCGA CTGGGAGATG GACTCCGAAT TTGGCCGCGT TTTTTTCTAC CACCTGGTGG ATACCAAAAA GCCGGAATCT GGATACGCGG GGTTTGGCTC CGGCACGTTG CGTATCCAAA AGAACAGCAA GACCGGAAAA TATCGCATGC TGATGCGAAA TCCCGCCGGC ATCAAGGTTT TGATCAATAT TCTGATCACC TCCGATATGA CGTTCAAGTT GACTGCGTCT AAACGAAAGG GTCAAGACGC CACCGAAATT TCTTTTTTTG CCACGACTAG TGTGGATCGA GGCTACGAAC AGTTCCGGGT AGTATCGCTT GCCGAGACGG GAAAGAAGTT GCACAAAAAG CTCGAGTCGT TGGCCTCCGT AAGCTAA
|
Protein sequence | MGKRRNEQAQ MSKEDFEAQM DREDTDSVPA VFEQASAEQL EQRRIVRRST RPPSMSPATN TPVSSSVPLS SNPFAGVKLS SDCKPLFSFG SKPTDGTSVH DNQEQPKPYA TPSPFLFGTS APAPKISTIS FAPPPSGGMF QFSPAASTTS ASAISTISTT NWGEKSAELD RTFRETVLSS SWDGPQYHSK IDKAYLKDCF AQETAYVKEK EAATATIRYS PSKPATSEST AFGTFGKFAQ STSSNFSKTP TPTAFAPIPD SSTTDNDNNA GEAEDEDTMI QPASDPDWEM DSEFGRVFFY HLVDTKKPES GYAGFGSGTL RIQKNSKTGK YRMLMRNPAG IKVLINILIT SDMTFKLTAS KRKGQDATEI SFFATTSVDR GYEQFRVVSL AETGKKLHKK LESLASVS
|
| |