Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49474 |
Symbol | |
ID | 7195825 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 349357 |
End bp | 351854 |
Gene Length | 2498 bp |
Protein Length | 561 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184235 |
Protein GI | 219128048 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATGA CTACACAAAG CAAAAACCCA CTCTCAGCAC AGACGTCGTT GAATCCTCTC AAAGGTGTCT TGTATCCAAT TCAGTTGCAA TTGGGTCAAG GTGTTGTGGT GCTTAGTATC GCCACGAGCG TTTTTCTTTG GGAGGTATCT TACTACGCCT TTTGGATTAC CATGACGTGT TTTGCCGGAT CTCTTGTGAT TTGTTGGGTT CCGTGGGGCT GGGTTCTGCG TTGGATCTTT AGAATTGCAG TGGCTTGTCA TCTCCGGGCC GTGGATGATG ATTGTGGATC AATACTAGTT GTTTGTGGAA GACCCGAACC TAACGGAGGA GGAAAGGGAG AATGCAGTTA ACTGTAAGGC CCGTCTCTAG TCCCGCTACA CGGCTGTCTT GCAGGCTGCA ACAAACTACC AGACTCGCAA AGAACGGGCA ATCAAACTGA AAGACATGAA GCGCTACATG TTCGGAAAAT ATCGTTTGCG TGTTCCGCGT TTTATGGAAG ACGAGAGGGT TCACCGATAT TCCGTTGCCT ACTTTGTATG CGGAAGCCTA CGATCCAGAC ACTGCTACTC CAGTGACTAT CACAGAGCGC AAGTTTGGAC AAAACTTGTC GGGCGACATG ATCCCTGTTC GTGAGGTTCA GGCGGCCGAA GCGAAAAAGG AGAGTACTGG TTCGTCGGAA AGCAAGAAGC GAGGGTTTTT ACGATCTCGT CGCCGGAACG GTGCTTGCAC GGAGTGGTCT GGTACGTTGA TTTGTCTCGA AACCGTCAAA GGTCGATCGT CGATAGTTGT GTCCGTAGCA CCCGAAACCA AACTCCCCCA GAACTAACCA TCTCTCTCCC TCTCGCTTCG GACACTCTCA TCAACTTTCG TCTACTCTAC AGGAGTCCCC TGGTCAAAAC ACAAGTGCTA CACACCGTGA CACTGCTGGT GTCCGGGGTA CGCGACCCGT CCGCGTTATA CTTTATTTTA TCCCAAAACT GTATCAATGC ATTGATTGCC TGTACTTTAC CGCCAGCACT GGAGACCACA CCAGCACTCT ACGTTGGCCT CATTAATGTA AAACCCAGTC GTTACAGCTG GGTGGATCAC CACCCCAGCT CTTTCCTTTC TTGGCAGTCC ACGAACCCAC GAACGATCAC GTCCATTTTC CGCTCTCTTC CGTGACACTT GAGACTGCCA CGAGCGCGTG CGCGTACGCT CAGTCCAATG CCTACGTATA CTTTACCTGT CTTAATCTGA TTGTCGACTT GATGCAAAAT CAGTACGCAC CTATCCGGGA ATGGATCCGT GGAGCGGAAC GAGAACAATG TCTCTTAGCC AGTCACCTGG CCGCGTTGCT CCGACGGATG TATCAAAGGA TTGCCAATCT TGCAACCGGA CCAGTCGTGG ACGACGTGCG CAGTAACGCC ATTGTTAGCC AATTGACAGT GAACGGCCTA CACGAAGAAC TTGACGCCCT GAACGACGTC TTCTTTTGCG GCATTCAAGG ATTGAATGTT CGGCTTTGCG AAGATTTCCT GCAGACGGTC GTTTTTGTAC TTTTTTGTCT GGACTGTTAC CATCACAAGA AGGCAAAGTC TTTGATGGTT GGAATTGTGG ATGCCGATGT CATCCCCGAG AAGGAACCGC AGGCTCAAGT TACGGTGTTC GTCTCGGCCA AGCTTTTTCA ACGACTCGAA TATGCACCCT TGGTGCATAT GCTAGCTGTC GCTCTCCTGC ATTCCCGATC TGCGCCATTC TGGACGAGCA CCCAACCACA GGGAACACGT TCCCCTGACT ACATCTTTAC CTCTGCCTTG GACTCAATGG GTTCTTCAGG GGTCAAAATG TTCTTCTTTG TTCCGTATTT TGTAATGACA TAGATCGTCG AAAATGTATC GCCGGGTTTT GATGAGTGCG ATGCCGAAAA GCTTGACTCG TTTCATCGCT TGGCCACCTG CTTTCCAACA GGTATTGCCT TTGAAGATTG CAAATTCCAC CAGCATTGAG AATTATTTGG TCTTTTTGGT GTCAGTCGTC TGACTCTTTA CTCTCTCTCT CTCTCTCTGT TTCAAGGCCT CATCAATACG GATGTTATCG GCGAAAGTCT CTCGTCCCGC TGTCACACTG GTGGACTCTT TGACATTGCG CGCTCGACAA ACTTCCATCA AATACGTCCC ACGCGACGAG ACAACAGTGT TGTTACGAAT GTGGTGTATA CTCGTTCAAA AGTCTACACG TTTTGCAGTT AGTCAACTCT CCTTTGCATC TGCTCTGAAC CATTTCGGAC AGGTAGAAAT ACGTGATCTC TGTATTGTCT GTCCTGAATG AACACATGTG CATCTCCCAC ACGATCAATG GGACATCGAT GACGGCGTTT GCAGGATTGC CGAGAATCAG CCAACAATTG AGAAATGCCG TAGTCTTCTC GCAGCACTGT ATCCCACAAA GAAAAGACTG CTTTGCTGGT CATTTAGTAT TGCCTGGTAC GAAGTGCGAC AGAGCGGAGC AATCTGAT
|
Protein sequence | MDMTTQSKNP LSAQTSLNPL KGVLYPIQLQ LGQGVVVLSI ATSVFLWEVS YYAFWITMTC FAGSLVICWV PWGWVLRWIF RIAVACHLRA VDDDCGSILV VCGRPEPNGG GKGECRFTDI PLPTLYAEAY DPDTATPVTI TERKFGQNLS GDMIPVREVQ AAEAKKESTG SSESKKRGFL RSRRRNGACT EWSELTISLP LASDTLINFR LLYRSPLVKT QVLHTVTLLV SGVRDPSALY FILSQNCINA LIACTLPPAL ETTPALYVGL INLGGSPPQL FPFLAVHEPT NDHVHFPLSS VTLETATSAC AYAQSNAYVY FTCLNLIVDL MQNQYAPIRE WIRGAEREQC LLASHLAALL RRMYQRIANL ATGPVVDDVR SNAIVSQLTV NGLHEELDAL NDVFFCGIQG LNVRLCEDFL QTVVFVLFCL DCYHHKKAKS LMVGIVDADV IPEKEPQAQV TVFVSAKLFQ RLEYAPLVHM LAVALLHSRS APFWTSTQPQ GTRSPDYIFT SALDSMGSSG IVENVSPGFD ECDAEKLDSF HRLATCFPTG IAFEDCKFHQ H
|
| |