Gene Tneu_1746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1746 
Symbol 
ID6165389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1537236 
End bp1538246 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content63% 
IMG OID641668909 
Producthistone deacetylase superfamily protein 
Protein accessionYP_001795110 
Protein GI171186191 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.115562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGGTGT ACTTCTCAGA GGTGTTCAAA GGCCACGTGC CTCCCTACCG GCACCCCGAG 
GCGCCGGACC GGCTGGACTT CCTCATAGAG GGCGCGCGCG AGGCCGGGGC CGACATAAAA
GAGCCGAGGA TGAGGGAAGA CGTCTGGCAA CTCGTCGAGT CGGTACACGA CAGGAGCTAC
GTAGAGCTGG TGAGGCGCTT GTGTAGAAAA GGCGATGTGC AGATAGACGG GGACACCTAC
GTGTCCGCCG GGACATGCGA CGCGGCGGCG CTTGCGGTCT CTGCCGTGGT AGACGCCGTT
GATAGAAAGG AGACGGCCTT GGTCGCGGCG AGACCCCCCG GCCACCACGC CGGCTTTGCG
GGCAGAGCGC TTTCGGCGCC TAGCCAAGGC TTCTGCATAT TCAACACCGC GGCCATCGGC
GCTCTCTATG TGGGCGAGGG CGCCGCCGTG GTGGACATAG ACGTCCACCA CGGAAACGGC
ACACAGGAGA TACTATACGA CAGAGACCTG CTCTACATCT CCACACACCA GCACCCGCTA
ACCCTCTACC CAGGCACAGG CTATCCGGAG GAGGTGGGCG TGGGGAGGGG GGAGGGCTAC
AACGTGAACG TGCCGTTGCC CCCCCGCACC GGGGACGACC TCTACGCCAA GGCGGTAGAC
GAGGTGGTGG TTCCCGTCCT CAAGCAGTAC GGCCCCCGCC TAATAATTAT CTCCCTGGGA
TGGGATGCAC ACAGGGAGGA CCCCCTCGCC GACATGAACC TCACCCTCAA GAGCTACCTA
TACGTCTTCG ACGCCGTCTT ACGCCTCCAA AAGCCGACCA TCTTCCTCCT GGAGGGGGGC
TACAACCGCG GCGTTATAAA GAGGGGCACC AAGGCCCTCG TAAGACTCGT TGACGCGGGC
GAGTTCGCCC CAGGCGAAAG CCAGACATCC ACCGACGGCC ACACCGCGAA GAGGTACGAG
GAGGTCATGA GGGAGGTGAA GAGCCACGTG GGGAGGTACT GGAGGTTATA G
 
Protein sequence
MLVYFSEVFK GHVPPYRHPE APDRLDFLIE GAREAGADIK EPRMREDVWQ LVESVHDRSY 
VELVRRLCRK GDVQIDGDTY VSAGTCDAAA LAVSAVVDAV DRKETALVAA RPPGHHAGFA
GRALSAPSQG FCIFNTAAIG ALYVGEGAAV VDIDVHHGNG TQEILYDRDL LYISTHQHPL
TLYPGTGYPE EVGVGRGEGY NVNVPLPPRT GDDLYAKAVD EVVVPVLKQY GPRLIIISLG
WDAHREDPLA DMNLTLKSYL YVFDAVLRLQ KPTIFLLEGG YNRGVIKRGT KALVRLVDAG
EFAPGESQTS TDGHTAKRYE EVMREVKSHV GRYWRL