Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49800 |
Symbol | |
ID | 7198371 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 357532 |
End bp | 359791 |
Gene Length | 2260 bp |
Protein Length | 426 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | histone deacetylase 1 isoform |
Protein accession | XP_002184530 |
Protein GI | 219128670 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGTTGCTCC CTTCAGCCAA CGAGGGACGC CTGAGGTAGG CTCGAAGGCC CCAGATCCTT CAGCAAAGTG GTTTCGGCTT CTGCCTTGCT TGTCGGTATC TTGAATCGTT GCGCTTCAAT TGTGAAATAG AACGATGACG GGAAAGTCGC GCGTCTCCTA TTTCTATCAT CCGACTTGCG TGAGTACTAG ACCAAGACAA CGCGGATCTG CCCAAATTTC CTCGTCCTTT GTCTAAGAGA ACCCACCCTT CACAAGCTCT TCTCGATCTA CTAATTTCTT TTGTTCAATT TTCTTTCCCT CTAAACGCAT GTTGCTAACA CTTGGCCCGT TCTGATTCCC AGCCTCTGTT CTACTATGGA CCGTCACACC CCATGAAACC GCATCGCCTA AAGTTGGCAC ATCATCTCAT TCTGACGTAT GGCCTTTACA AGGAGATGGA CTGCTACCGG CCTCACCCTG CAGCAGCAGG TGAAATGACG CAATTTCACT CGGAGGATTA CGTCAACTTT ATGAGTAAGG TTACTCCTGA TAATCTACGA CAGTATTCGG CATCGATGCA ACGTTTCAAT GCTGGCGATT CCACTGATTG TCCCGTCTTT GACGGTCTCT TTGAATTTAC ACAGCTTTAT ACGGGCTCTA GTCTCGACGG AGCTTTACAG CTTTGCCAGG GAAATACCGA TATAGCCATC AATTGGAGCG GTGGCTTGCA TCACGCCAAG AAGGGTGAGG CTTCGGGGTT CTGTTATATC AACGATATCG TACTGGCGAT TCTGGAACTC CTCAAGGTCC ACGCGCGTGT GCTATACGTC GACATTGACG TTCACCACGG CGACGGCGTG GAGGAAGCTT TCTACACTAC TGACCGTGTC ATGACGTTTT CGATTCACAA GTACGGAGAC TTTTTTCCCG GCACAGGGCA TATCAGTGAT ACGGGTGCCA AAGATGGCAC CGGCTTTAGT GTGAATGCAC CGTTGAATAG CGGTATTACG GACGAAACCT ACTTTCACGA TTTGTTCAAA CCGGTCATGG AGAAAATCAT GCAGGTATTT AATCCGGGCG CTGTGGTCTT GCAGTGTGGA GCGGACAGCT TGACGGGAGA TCGGCTTGGA TGTTTCAATC TAACGTTGAA GGGGCACGCA GCCTGTGTAG AGTACGTCAA GAGCTTCGGC GTGCCGACAT TGGTACTGGG AGGTGGAGGT TACACAATCC GCAACGTAGC ACGGTGTTGG GCCTACGAGA CGGCCGTTCT ATTGGACAAG AAGGACATTC CAAACGAAAT TCCATACAAT GATTATTACG AGTATTATGC CCCAGACTAC GAGCTTCATT TGACTCCGAC GCCGGAAGAG AACATGAATG GAAAAGATGC CCTTGAAGAT GTACGGACAG AATTGCTTCA GCAGTTGCAA GATTTGCAAG GAGCTCCCTC AGTAGCGATG CAACAAGTCC CCCCATCTTT TCAGCGAGCA GAAGCCACTG AAGAGGATCC GGATGTCAGG GAAGGGGCTA GCAAGACCAG ATCGGGTGAC GGGGTCCGTA AGCAGCATCC AAGCGAGCTT TACGACGAGG TAGATTAAGA GTAAAAGGCT TTACCGAACG ATGGCGAAAT ATCAGTCACC TCCGACCTCG TCGGCATTAG TTCGTAAAGC TCTTTCGTAT ATTTCTTCCC AGTCTTTGGA ACTCTTGCTC CGAAAAATTT CCTCGTTATG CTGCGCAATG GCAGCCGTTG CAGAGGGGCC CGCTTTGTCA ATAGTAGAGG TCGCAACGAT GGACACCGTT GAGTAAGGAC TGCGCTTTTC TAGCTGTATG GGATTCTCAT CTTCACCGGC AAACAACCAG GCAGGATGAA AAGGCGCCAG TGTGACAGCA TTTCCAACAT GGTCTGGATC TTCGTCTATA GCATCCAAAT ATTGCTCCTC CTTTTCAAGA AACCAGTCGT AAAAGGATTC AAAATCCCAT CTCTCTTCGA CTTCTTCGGC CAGTATTACA AACGCAATTG CGGTATTTGA ATCCAGGGTC CCCGTTTTCA TTTCCAGGCA GAAACGTTCT GCAACCGCAT CTATGGCCTT TTCAAATAAA GTTGTATCCT TGGTCAAGTA CAGTCGTATG GCACCTTGAG TGTTTACCGA GGACGAAGCC CAGGGGCATA GATTGTGCGG TACCACAAAG TTGGCGCACC AGCTCCAGGT CCGGACAGAA GCCTTTTGCA ATGAAGGACT GGGAGGGGTC GATCCAAAGC GAACGACTTC GTCAAACGAG
|
Protein sequence | MTGKSRVSYF YHPTCPLFYY GPSHPMKPHR LKLAHHLILT YGLYKEMDCY RPHPAAAGEM TQFHSEDYVN FMSKVTPDNL RQYSASMQRF NAGDSTDCPV FDGLFEFTQL YTGSSLDGAL QLCQGNTDIA INWSGGLHHA KKGEASGFCY INDIVLAILE LLKVHARVLY VDIDVHHGDG VEEAFYTTDR VMTFSIHKYG DFFPGTGHIS DTGAKDGTGF SVNAPLNSGI TDETYFHDLF KPVMEKIMQV FNPGAVVLQC GADSLTGDRL GCFNLTLKGH AACVEYVKSF GVPTLVLGGG GYTIRNVARC WAYETAVLLD KKDIPNEIPY NDYYEYYAPD YELHLTPTPE ENMNGKDALE DVRTELLQQL QDLQGAPSVA MQQVPPSFQR AEATEEDPDV REGASKTRSG DGVRKQHPSE LYDEVD
|
| |