Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50050 |
Symbol | |
ID | 7198743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 250746 |
End bp | 252290 |
Gene Length | 1545 bp |
Protein Length | 482 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | homeobox protein |
Protein accession | XP_002184846 |
Protein GI | 219129334 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTCTC CAAACGGCAA TAATTGCTCA ATCAATCCGG CGACCCCGGT ACGATCGACT TCCAGCACGG CCCCTCCTAG CAAACGCCGC TCCGTCCGCC GTCCCAACGA CGATCTCGAG CTCATGCTTC AGCACGAAAA CTTTCCTCAT CTCTTGAATT TGGCACACAA AATCAAGACC GCGCATGTTC GAATTCAAAG CGTGACCACC GGCTTTGAGC CTTCCCAAGC GCTAGCCAAA TATAGTTTGG CCCCATCGGA AGTCATGGAA GCCGTTACAC GACTACAGAA TCAATTCGTC TCCAGCGAAA CCTCGCGTGA GACGATTCGG CCTGGAGTTT ATCCCATGAC TGATATTCTC GTCATGGATT CCATAGAAAC CTTGCAGCGT GAGTGGGAGC ACTGCCAGAA ATGGCTGGAA GCGTCGGAAA GTACTCGCCT TGACAAGGAT CCCACTGCCG AGCCTGCTGT ATCTGAGCCT CCGACCCTTC CAAAGAAACG AAAAGCTTCT GGCGCCAACA GCGGTAGCAA AAAGGAAGCT ATTGCGGTGA AGTACTCCAA ATGGCAGACA GATATTCTCA TGAACTGGAT GATCCAACAC GTCGATGAAC CCTTTCCGAA ACAAGGCGAG ATTCATCAGT TGATGGACAT GACCGGGCTC ACGCAATCAC AGGTCATCAA CTGGACAACA AATGTTCGCA AGCGCAACCG CAAGGCCACA TGCCAAAACG GAAAGAAGCC CCACCACTTT ATCGACTTTG TATTCCTGGC ACACGATCGC GAACGAAGAG CACGCAAGGC GTCTTCCATT GCGACGGCCC ACCCATTAAT GACTGGTTTG GACTCTTTTC AGAATGCCCC CAGGGAGAAC ATATTGGCGG TCCCGCCATC ACCGAGTGCA TGCGCTGTTC CTACCCGGCA AGCGGCCAGT TCGTACTCGT ACCCTTCTCC TCCTTCCTAT CCACAGCCAA CGTACAATCA TACAAGTCTA AGCTACTTTG CCAACCAAAC GCCAGATTTC AAGTGCGTGC ACACGGTGCC GTTTTCACCA GGTACAATGG CATCTTCACC TCCGCGTTGC GCGGAAGACT CGGCAATTCA GGAACATAGC CAGACACTCA TGCAAGAAGA AATGTGGGAG ATACACGATG ATTTTGATCC TGTACCAATG GAAGAAGAAT CAGACGAATT CATAATGGAA GAATTTGCCA AATCTTGGTT GTTTGAAAAC CCGATGGACG TGAACGATCC GGACTCCTTG ACAGTGCAGC AAGCTCCCTG CAGCACAATG CCTCGTCTGA ACGACCTGGG TTTGCTGCCC AGTGTTACTG AGGACAGTCA CGAAAAATTA CACCGTAACC GAACAGCTAG TTTTGATCTC GGAGAACTGG AGGACGAAGA CATTGACGCC TGGGCCGCCG ACATGGGACT GACGATTGAA ATTCAGTAGC CGCGCGGGTC TATCATGTGA GGGTCACATG ATGGAACTAG GTTGTACATT CGAAACGCGA CATAAATCTT TAGCTTAAAC AACGTAAATG AAACC
|
Protein sequence | MQSPNGNNCS INPATPVRST SSTAPPSKRR SVRRPNDDLE LMLQHENFPH LLNLAHKIKT AHVRIQSVTT GFEPSQALAK YSLAPSEVME AVTRLQNQFV SSETSRETIR PGVYPMTDIL VMDSIETLQR EWEHCQKWLE ASESTRLDKD PTAEPAVSEP PTLPKKRKAS GANSGSKKEA IAVKYSKWQT DILMNWMIQH VDEPFPKQGE IHQLMDMTGL TQSQVINWTT NVRKRNRKAT CQNGKKPHHF IDFVFLAHDR ERRARKASSI ATAHPLMTGL DSFQNAPREN ILAVPPSPSA CAVPTRQAAS SYSYPSPPSY PQPTYNHTSL SYFANQTPDF KCVHTVPFSP GTMASSPPRC AEDSAIQEHS QTLMQEEMWE IHDDFDPVPM EEESDEFIME EFAKSWLFEN PMDVNDPDSL TVQQAPCSTM PRLNDLGLLP SVTEDSHEKL HRNRTASFDL GELEDEDIDA WAADMGLTIE IQ
|
| |