Gene Dret_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0223 
Symbol 
ID8418027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp279225 
End bp280565 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content61% 
IMG OID645036788 
ProductHistone deacetylase 
Protein accessionYP_003197103 
Protein GI258404361 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.195671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAG TCGCCAACAG TCTCGGCGTT GTTTTTTTCC CGGCTTTTGA CTGGGCTATC 
TCCCCAACGC ACCCCGAGCG GCAGGAGCGG CTCCTGTACA CCATGGATCA ACTGCAGGAG
GAGGGGGTCT TCGATATCCC GGGGATCGCC GAATATAAGC CGGATATTGC CAGTCTGGAG
GACGTGGAGC GGGTCCACTT CGCGTTTCCG CGCACCGAGG ACGTCCTCAC CGACTCCCAT
CTTATTTCGG CCGGGGGCGC CATCCGCGCC GGTCAGATGG TCCTGGACAA GGAGCGTGAC
AAGTCGTTCG CCCTGGTGCG CCCGCCGGGG CACCACGCCA TGAAGTCCGT GCACGGCGGC
CGCGGCTTTT GCAATGTGAA TATGGAAGCG ATCATGATCG AGCGGTTGCG CCGGCAGTAC
GGGGTGAACC GGGTGGCGGT GGTGGATACC GATTGCCATC ACGGCGATGG CACCCAGGAC
ATCTATTGGC ACGATCCGGA GACGCTGTTC ATCTCCCTGC ACCAGGACGG ACGGACCATT
TTTCCGGGCT CCGGCTTCCC CGGCGAGATC GGCGGGCCCA AAGCGGCCGG GCGGAACCTC
AATGTGCCGC TGCCGCCCGG GACAAGCGAT GCGGGATTTT TGTTGCTGAT GGACGAATTG
GTCTTGCCAG TGCTGCGGGA CTTTCAGCCG GAGCTGATTG TCCATTCCGC CGGGCAGGAC
AACCATTTTT CCGATCCCAT TACCTCCATG AATCTTTCGG CCCAGGGCTA CGCCCGCCTG
AGCCAGAAAT TGCAGGCCGA TATCGCCGTG CTCGAGGGGG GCTACGCCAT CGAAGGCGCC
CTGCCGTACG TCAATACCGG GATCATTCTC TCCATGGCCG GTCTGGACTT TTCCCATGTC
CGCGAACCAG CCCTGCGGCC CGAGAGCGTA GCGCAGGACG CGAAGATCAC CGAGTATCTC
AAGCAATTGG CCCCGGCGGT GCGGGATCTC TATTTCCATC CTCCGGAGAA GCTGATCGAC
CGGGAGAAGG AGGGGGATTT CTTCGTGCGC GACAAGGAAA TTTTTTACGA TACCGACGGG
CTCATGGAGC AGCAGCGGGA GTTTGTCCTG GATTGTCCGC ATTGCCCCGG GCTGTATAAG
GTCCAGACCT CCTCGACCAA GACCCCCTTT TGTCTGGGGA TCGAACTGGG GAGGCAGTGC
TGCGACAGTT GCGCCAGACG CGCGGAGGAG GAATTCGCCC GGGCCCAGAA AAGCCTTCGG
TATGCGGTTA TTCAATATAT TGACCGCATT CAGGATTTCT CGCAACGGGT CGTGGGCGAC
GCTATGGACA AGGAGATGTA G
 
Protein sequence
MLKVANSLGV VFFPAFDWAI SPTHPERQER LLYTMDQLQE EGVFDIPGIA EYKPDIASLE 
DVERVHFAFP RTEDVLTDSH LISAGGAIRA GQMVLDKERD KSFALVRPPG HHAMKSVHGG
RGFCNVNMEA IMIERLRRQY GVNRVAVVDT DCHHGDGTQD IYWHDPETLF ISLHQDGRTI
FPGSGFPGEI GGPKAAGRNL NVPLPPGTSD AGFLLLMDEL VLPVLRDFQP ELIVHSAGQD
NHFSDPITSM NLSAQGYARL SQKLQADIAV LEGGYAIEGA LPYVNTGIIL SMAGLDFSHV
REPALRPESV AQDAKITEYL KQLAPAVRDL YFHPPEKLID REKEGDFFVR DKEIFYDTDG
LMEQQREFVL DCPHCPGLYK VQTSSTKTPF CLGIELGRQC CDSCARRAEE EFARAQKSLR
YAVIQYIDRI QDFSQRVVGD AMDKEM