Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55150 |
Symbol | |
ID | 7198835 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 129805 |
End bp | 132481 |
Gene Length | 2677 bp |
Protein Length | 406 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | homeodomain transcription factor |
Protein accession | XP_002184972 |
Protein GI | 219129600 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.57662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACCGACGGT CAACTGCAAC CATCGATCTA CCCATCAACT CGAATAAAAC GGTTGCAATC ATAAGGTAAG ACAAGCTGTT ACTGCACAGT GTCGGGTAGC TTCGTACCGC CGCGTTGTTA CTTTCGTGCT GGTTGATATG ACTAAACATC CACGCCGCCC AAAGCTGGCA GCGCTCCGAG TGAGAATCGA CGTGAAGGTA ATATTGACAA TCCGTGTCGG TCGTGATGTA AGCGAGCCGC TGTATTGCTT GCGCATAGCA GGGGAAACTC GTGTGGCATG TTGACCGACT CCATGGACGC GTATATATTG CAGTACGAGC TTGCCTTGCG AGAATAACGT TGCGTGACGA ATGTCAATCC GTCTCGATCT GCTTCCAATG AGAGTCGAGG CTTTCCTGCT TTGATGGCTG TTAGATTCCC GGGATCAGAA GTCGGCCTGC CACGACTATT TCTAAGGTAG TGCAGTTTGG CATTTGAATC CCAACGACTG CTCGTTCGTC CTATTCATGG ACACCCAGCT AGTTCCGAAA GAAAAAAAAA ACCACTCCGT CCATGGACGA GCCTTCATGT TGGACCGCGA ACAACGCGCG ATATCGTGTG ATGGGTTTCC TTCCCGCTTT TACCGATGTC GTCCCGGAAC TGGGACAACG ATACAAAATT TCGGTTTTAG GATTCGCAAG TTTCCGGGTT GGTGCTTACG TGGGCCAGCA AACCACACGC CTCGTCGCCC ACCTCATCGC CTTGGAAGGT TTTGATTGGT CGCCATCATA TGGCTCGTGA GGCGTCCTCC CTTTCAGACA CCCACGCGTT CGTGTCTCTC TTGGTTGTTG CCGTTCCAAT GGGAAAATAT CCATCCATCA GAAACCAATC CACTGCCCAT CGGCACTCAC ACTTTTTGCG TTCCGCGAAT CCATCACCAG CACAGATCGA CTGTATCATC GATACCAAGT TTCTCACTAA CAATCGAAAA TTGTCACAAT GATGGAAGGA TCAGTCTCTG AAGCCCCCAA CAAACACGGG TCGACAAGTA ATCCCGTAGC GACACGTGTC CGCGCTGTTT CGGTTTCCAA CGCGTCATCA ACCTCCGCCA GTAAAACCAA CAAACGCAAG TCTACCAGTC TGCCCACCGA GACTGTCGAA TACCTAAAAG CATGGATGAT GAGCCCGGAA CACATTGCTC ATCCATATCC GACGGAACAG GAAAAGGCCA AGATCATGGC GGACACCTGT ATCGAATTGA AGCAACTCAC CAACTGGTTC GTCAATAATC GCAAGCGCTA CTGGAAGCCA AGAGTCGAAG CCCGCGTCCA GACGCAAGCT TCCTTCAAGA CTGCCCCATC CAACACTAGC AAAAACACTG CTGATACGGA GACCGCCCCG GTTGATCTTT TCGACCCTGA TTTGGCGATT GTGGCAACGA CAACCGGCTC GGCGAACCAA CCAGAGATCT CCGCTCCGAG TGACGTTCCG TTGGCCAAGA TTGTTTCTGT TGCCAGCTTT CAAGCTCTTC TGGAAAAGGC TTCTGTCAGT CTATCGCCGA TCAGTACTCA CAACTCCGTC TTTCACCACG TGGAATCACT CCTCAATGGT CCTTCTCGAC TGGTTTCGGA TGCGTCCGAC AGCAGCTCGA TTGGAACCGA GGACGAAACC TCGAGCAACA CTAGTACCGC AGGGTCTGTC ACAACGCCAC AGTCGGAGGG CATCAAGACG GAACAAACAT CGGTTCACAT ACTGCGTCAT ACCGGACAAG TCCCGACAGT CGAAGATGTT ACGATCCTTA CCAATGTTCC GGCGGAACGC ATTCTCCGAA CCTTTGATGA TTGCGTCTTG AGTTACCGCG TGCCCACGTC CGGGAATCGG AAAAAGGTAG GTCGTGGGAG TCCTGGAGCT CGGAGCACTT TGCGGCGATA TCTCTTGATT GGTAATAGTG TCTCACAATC TCCGTTCGTT TGGTTTCGCA GAGTCAAAGT CGTCGGGACG CCGAAATCGT ACGCACCAAG AAGCACTACC TCAAGGTTTA CATGGCAGAA GTTGCGGAAG CGAGCAAGGC TCGCAACGAG ATCCGTGCTA CGGCTTGGAC CAAACGAGCC CGATCACCAG CGGGAAATGA GGCTACCGCG GAAGACCCTT TGTCGCCCCG CGCTAAGTTT CGCCGGGTCA GTGTCCAACT TTGGCAGGAT GCCTGCCGAT CGGCCTCGCA CGGGTATGAC CACGATGCGT TACCAACCTT GGAAGAAGCT TCGCAGCTTT TTGGCTACAG CAATAGCGAT TGCGTCGACT GTGTATCCAA CTAGTCTCAA GACGTACCGG GGCCTTCTGT GTTTTTTGGC TCTTTTATCT GCGCACCTTC ACCCGCTCAA TCCACCACTC GCTCGCGGAC TGCCTGTCAC ACATTCACGT CTTTCCTTCT GTCTCGGCTC GACGAGGCAT CGCATCACCA CTTGATTTAC ACTGCCGGCC TGCCGTCAAT CAAGGCAACG GCGTCTACCT AGATTACGAA TGGCGCCGTG ACAAAGCTAC ACTCGCGGCA CTTGCAATTC GAGGCCGGAT TGCACCCGTT TTCTCCTTTT TGTATTTGAT CCATCTACTC ATCTTATTCG CAAGGGTATG TTTCCTCAAT GCCTGTTTTC GCACAAAATA ACCACGTTAT AACGATAGTC CACCAAGGTT TAGCATC
|
Protein sequence | MMEGSVSEAP NKHGSTSNPV ATRVRAVSVS NASSTSASKT NKRKSTSLPT ETVEYLKAWM MSPEHIAHPY PTEQEKAKIM ADTCIELKQL TNWFVNNRKR YWKPRVEARV QTQASFKTAP SNTSKNTADT ETAPVDLFDP DLAIVATTTG SANQPEISAP SDVPLAKIVS VASFQALLEK ASVSLSPIST HNSVFHHVES LLNGPSRLVS DASDSSSIGT EDETSSNTST AGSVTTPQSE GIKTEQTSVH ILRHTGQVPT VEDVTILTNV PAERILRTFD DCVLSYRVPT SGNRKKSQSR RDAEIVRTKK HYLKVYMAEV AEASKARNEI RATAWTKRAR SPAGNEATAE DPLSPRAKFR RVSVQLWQDA CRSASHGYDH DALPTLEEAS QLFGYSNSDC VDCVSN
|
| |