Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_3062 |
Symbol | myst |
ID | 7196255 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 388740 |
End bp | 390156 |
Gene Length | 1417 bp |
Protein Length | 349 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176575 |
Protein GI | 219109641 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0291461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACTACGTGC ACTACAAAGA CTTTAATCGA CGGATGGACG AATGGATCAG CGTAGATCGC ATCGTGTCGC CTCCCTCAGT TGGCAACGCC AAAGCTCGGG CACTAAAAAA AGAAGAGGAA CGAGAGAAGA AAAAGAAGCA GAAAAAGGAA GAAGAAGAAA AGGCGCTTAT GGATTGGACA CAACCTCGAA GTCGAAGGCG CGCTTCGGCT GTGGACATTA GCAAGGACGA TATGAATGGG CGCAGAAGAC GTCTGTCTCG TAAAAAGAGC TTCAACACGG ACGATGACAA CACTGTTGTC GCGTCGAACG TTGAAGAAGA CGACGAAGAA TATCAAGATG AGAAAGAGAA GTCCAATGAA GAGATGTTGG CACTACCGAC GGATACTGTG ACCACGCACA CAGTAGGGGA GCATGTCGTA GCGACGGTCA AAGCCCAGGA ACTTGATGAA CATGAAGGCT TGGATGAGGC GTCGTTGCGC GAGCATGAAG AGGTAACGAA GGTGAAAAAT GTTGGTTTCA TGGAGCTTGG GCAATAGTAA GTAATGCTTC GCTTTACGAC CTTGTTGGTG ACTGTCATAC TGTCTCTTTC TTCTTGTGAC TCAATAATCG TTTAGTCAAA TGGAGACCTG GTATTTTTCT CCCTTGCCAA AAGAGCTATT GAATGCAAGC GGCTTTATCG ACGTTCTGTA TGTTTGCGAG TTTTCCTTTG GGCTGTATGC CCGCAAGAGT GAGCTTCAAA GGTTCCAAGC ACGTCTTCCG AGTGAAAGGC GCCATCCACC TGGCAACGAA ATCTATCGAA ATGGCGATTT GGCAATGTTT GAAGTAGATG GAATGGAAGA GCGAATCTAC TGTCAAAATC TATGCTATCT CGCAAAACTC TTCCTTGATC ACAAGACGCT TTACTATGAT GTGGACCCTT TTCTGTTTTA CGTTCTGTGT GAAGTGGACA ATCGAGGCTT TCATCCTGTT GGCTACTATA GCAAAGAAAA GTACTCCGAT GTTGGCTACA ATTTGGCTTG TATTTTAACT TTTCCTGCGC ATCAACGCAA AGGGTATGGA CGCTTTCTCA TTGCTTTTTC GTATGAACTT AGCAAAAAGG AAGAGAAGGT TGGATCACCA GAAAAGCCTA TGTCAGACCT CGGGCAGCAA GCCTATAAGC CCTACTGGGG TTCAACGGTG GTTGATTATT TGTTGAATCA GTCCAATGAA TCTTCGTTGA GCATTATGGA CGTTTCGAAA AGGACATCAA TCATGGCCGA AGACATTGTT TTTACGTTGA ATCAACTAGG GATTTTGAAG ATCATCAACG GTATATACTT TATCGCAGCC GAAAAGAGCC TGCTTCAGCG ATTGGCAGAA AAATACCCCG TAAAGGAACC TCGAGTGGAT CCATCCAAGC TTCATTGGAC TCCCTTT
|
Protein sequence | YYVHYKDFNR RMDEWISKEK SNEEMLALPT DTVTTHTVGE HVVATVKAQE LDEHEGLDEA SLREHEEVTK VKNVGFMELG QYQMETWYFS PLPKELLNAS GFIDVLYVCE FSFGLYARKS ELQRFQARLP SERRHPPGNE IYRNGDLAMF EVDGMEERIY CQNLCYLAKL FLDHKTLYYD VDPFLFYVLC EVDNRGFHPV GYYSKEKYSD VGYNLACILT FPAHQRKGYG RFLIAFSYEL SKKEEKVGSP EKPMSDLGQQ AYKPYWGSTV VDYLLNQSNE SSLSIMDVSK RTSIMAEDIV FTLNQLGILK IINGIYFIAA EKSLLQRLAE KYPVKEPRVD PSKLHWTPF
|
| |