Gene OSTLU_119545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119545 
SymbolHda2 
ID5000186 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp521520 
End bp523110 
Gene Length1591 bp 
Protein Length487 aa 
Translation table 
GC content47% 
IMG OID640415607 
Producthistone deacetylase 
Protein accessionXP_001416418 
Protein GI145343628 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGA AAATTGCATA CTTTTATGAC CAAGAAGTGG GAAACTTCTA CTACGGACAG 
GTGAGAAACG AAGAGGGATC ACGCAACAGC TTCTGATTGT TACGTTCGTA GGGACATCCA
ATGAAACCGC ACCGTATGAG AATGACACAT AATCTGCTCT TACACTACGA TTTGTACAAA
GACATGGAGG TACGTTGTTG CGATGATAGC GGTGGACTAG CCGTTGAAAA TCACGTGTTC
CATGATCTTG ACTCTCCACG TGTAGGTGTT TCAACCCACG CCCGCGCAAG CCGATGACAT
GACGCAATTT CATAGCGACG AATACATTGA ATTTCTACGC CTTGTCACCC CTGATAATCA
GCATGAACAC ATGCGCCAGT TGAAGAGGTT CAATGTTGCT GAGGACTGCC CGGTATTTGA
CGGACTGTTC CGCTTCTGCC AACTGTACAC AGGCGGCTCT GTTGGAGGGG CTGTCCGCTT
GAATCATGGA CTATCAGAAA CTGTCATAAA TTGGTCTGGC GGGCTTCATC ACGCAAAAAA
GAGTGAAGCA AGTGGGTTTT GCTATGTTAA CGATATTGTA CTTGCGATAT TGGAGTTACT
CAAGCAACAT CAGCGGGTTT TGTACATTGA TATAGATATT CATCACGGTG ACGGGGTTGA
AGAAGCGTTT TACACGACAG ACAGAGTAAT GACGGTTTCG TTTCACAAAT TCGGCGAATA
TTTCCCAGGG ACTGGGCACT TGCAGGACAT CGGCCAACAT GCTGGCAAGT ACTATAGTGT
CAACGTACCC CTAAAGGATG GAATAGATGA TGAAAGCTAC GAGCTTCTCT ACAAGCCGTT
GATGTCAAAA GTCATGGAGA TCTATCAGCC CGATGCAGTT GTATTTCAAT CTGGGGCAGA
CTCTCTTTCT GGAGACCGTT TGGGTTGTTT CAATCTGTCC ATCAAAGGTC ACGCAGAGTG
CCTCAAATAC ATGACTACGT TCAACGTACC TTTACTTGTA CTTGGGGGCG GTGGTTACAC
GATACGGAAC GTAGCCAGAT GCTGGGCATA CGAAACGGGT TGCTTACTTG ATCGAGAACT
GGTAGATGCT ATGCCACAAA ACGACTACTC AGAATATTTT GGCCCAACTC ACACACTGCA
TATCCAACCG AGCAACATGG AGAATCAAAA TACTCGCGAA TATCTTGAAG GGGTTCGAGC
ACATCTTTTG GAAAACCTGT CGAAGATGAC CTGCAAACCC AGTGTGCCTT TCCACGAAGT
ACCACGTGAT TCGACTAATA CTCGTAATGT CAGTGTTGAC GTCGAGCATA TTAGCGAAAA
GAGCGAAAAG GGCTTCTCAG CCAGCCTTGA TAAATCTCAA TACGAGAATG AGCGTCACGT
CGCGGCTCTG CGTCGTCAAC AAAGTATGGT TGTACGAGAT GATATCCCGA GCTCGACGAT
CTCTATTATG GAGAACACAC CGAGCTCTGA GAAGAGTCAT GATTTACCGA TATCGGCCAC
AGGTCTGCCG CAAACACATA CTCCACGGAG CGACGAAGCA ACGGCGATGA CGATGTTCCC
AAGCGTTTCG AATAAAAAAG AAGGAATTTA G
 
Protein sequence
MKKKIAYFYD QEVGNFYYGQ GHPMKPHRMR MTHNLLLHYD LYKDMEVFQP TPAQADDMTQ 
FHSDEYIEFL RLVTPDNQHE HMRQLKRFNV AEDCPVFDGL FRFCQLYTGG SVGGAVRLNH
GLSETVINWS GGLHHAKKSE ASGFCYVNDI VLAILELLKQ HQRVLYIDID IHHGDGVEEA
FYTTDRVMTV SFHKFGEYFP GTGHLQDIGQ HAGKYYSVNV PLKDGIDDES YELLYKPLMS
KVMEIYQPDA VVFQSGADSL SGDRLGCFNL SIKGHAECLK YMTTFNVPLL VLGGGGYTIR
NVARCWAYET GCLLDRELVD AMPQNDYSEY FGPTHTLHIQ PSNMENQNTR EYLEGVRAHL
LENLSKMTCK PSVPFHEVPR DSTNTRNVSV DVEHISEKSE KGFSASLDKS QYENERHVAA
LRRQQSMVVR DDIPSSTISI MENTPSSEKS HDLPISATGL PQTHTPRSDE ATAMTMFPSV
SNKKEGI