Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_51026 |
Symbol | |
ID | 7202110 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 195087 |
End bp | 196681 |
Gene Length | 1595 bp |
Protein Length | 426 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | histone deacetylase 1 isoform |
Protein accession | XP_002181323 |
Protein GI | 219121958 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCTTTGT CGCTCAACAG ACAGGAAGGC ACATACATTA TACATCAAAG CTATTTTGCA ACCATTTTTG ATACTACTTT AATATCGCCT GGATTCAATA GGTCCCAATT CGATAGTAGA AACATACAAT GGGGGACTCT AGGCGTGTAT CGTACTTTTA CGATGCCGAG ATCGGAAACT ACCATTACGG CCAAGGTGGG TTGTTGTGCT TCGGATACGT AGATTTGTCC CGCTACTGGG TTATCCATGA ATAACAAACA GCACAATTGG AGTATTTCAA AAATCACCAT GGGTTACTTG ATATCTGGGG CTGTGTTCCA ATTGACAGAG AACGACTTTC TCTTACTCAC ACACCCTTGG CTTTCTCCCG TATTTGCGTA GGCCATCCGA TGAAGCCTCA TCGCGTGCGC ATGACGCACA ATCTCGTCGT CAACTACGGC CTCTACCGCA AGATGGAGGT CTTTCGTCCT CGTCTTGTTT CGCCCACCGC CATGACGCGT TTTCACAGCG ACGACTACAT CAACTTCCTC CGAGTCATTA CCCCCGACAA TATGCAGGAT TACATTCGTC CCCTCCAGCG CTTCAACGTG GGAGAAGACT GCCCGGTCTT CGACGGTTTG TTCGAGTTTT GTCAGCTCTA CACATCCGGA TCGATTGGCG GCGCCGCTCG GCTTAACGAA AATCGCGTCG ATATTGTTAT CAACTGGGCT GGCGGTCTGC ATCACGCTAA AAAGGCCGAA GCCTCCGGAT TCTGTTACGT CAATGACTGT GTCTTGGCCA TTCTCGAGCT TCTCAAGAAA CACGAACGAG TTCTATATAT CGACATTGAT ATTCATCACG GAGATGGAGT CGAAGAAGCC TTTTACTCGA CCAACCGCGT CATGACGGTC AGTTTTCACA AGTTTGGCGA GTACTTTCCA GGAACCGGGG ACGTCCTCGA TGTGGGCTAC GCCCAGGGCA AGAACTACGC CATTAACTTC CCGCTCAACG ACGGTATGGA TGACGATTCG TACGAATCCA TCTTTCGTCC AGTGATTGGC AAGATCATGG AAGTATTCGC ACCGGGCGCT GTTGTCTTGC AATGCGGCGC CGATTCGTTA TCGGGCGATC GTCTCGGCTG CTTTAATCTT TCGGCACAGG GCCACGCTAA CTGTGTTGAG TTTGTCCGCT CCTTCAATAT TCCAATGCTG GTGCTAGGAG GCGGTGGTTA TACGCTACGC AATGTACCGC GCTGCTGGAC GTACGAAACG TCAGTCCTTA CGGGAGAAAA AGTTTCGGAC GAATTACCCT TTAACGATTA TTTCGAGTAC TTTGGACCGG ATTATCGGCT CCATTTGCCC GTCTCGAACA TGGAAAATCT TAACTCGCGA GCCTATTTGG ATAAAACCAA GAATCAGCTT TTGGATATAC TGAGTCAAGT TGAGCCCGTG CCCAGCGTAC AAATCCAAAC CGGACAGATA GATTCACAGA CCAACCCTCG CTCTATGGCG ATGGAAGTGG ACGACGAGCC TCCAGCGGAG GAATCAAATC CGGATGCGCG CGTAACGAGG GAAGACACTG GGCGAAAGGA ACACGCGTCC GAGCTGGCGG CGTAA
|
Protein sequence | MGDSRRVSYF YDAEIGNYHY GQGHPMKPHR VRMTHNLVVN YGLYRKMEVF RPRLVSPTAM TRFHSDDYIN FLRVITPDNM QDYIRPLQRF NVGEDCPVFD GLFEFCQLYT SGSIGGAARL NENRVDIVIN WAGGLHHAKK AEASGFCYVN DCVLAILELL KKHERVLYID IDIHHGDGVE EAFYSTNRVM TVSFHKFGEY FPGTGDVLDV GYAQGKNYAI NFPLNDGMDD DSYESIFRPV IGKIMEVFAP GAVVLQCGAD SLSGDRLGCF NLSAQGHANC VEFVRSFNIP MLVLGGGGYT LRNVPRCWTY ETSVLTGEKV SDELPFNDYF EYFGPDYRLH LPVSNMENLN SRAYLDKTKN QLLDILSQVE PVPSVQIQTG QIDSQTNPRS MAMEVDDEPP AEESNPDARV TREDTGRKEH ASELAA
|
| |