Gene Pars_1348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1348 
Symbol 
ID5056397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1212303 
End bp1213313 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content59% 
IMG OID640468894 
Producthistone deacetylase superfamily protein 
Protein accessionYP_001153563 
Protein GI145591561 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.104059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGTGT ACTACTCGGA GATTTTTAAG AAGCACACCC CTCCCTTTAG ACACCCTGAG 
GCGCCAGATA GGCTTGACTA CGTCATAAAG GGCGTAGTGG AGGCGGGAGG CGTTGTAAAG
GAGCCTAAAA TGCGGGAGGA CGCGTGGAGG CTTATATACT CGGCTCATGA TAAGAGCTAC
GTAGAGTACG TCAAAAGGCT CTGCGGCGCT GGCCAAGCAG AAATTGATGG CGATACATAC
GTCTCTGCGG GTACTTGCGA CGCAGCCGCG CTCGCCGTCT CTGCCGTAAT GGAGGCCCTC
GACAAGAGGG AGACAGCCCT CGTCGCGGCG AGGCCGCCCG GGCACCACGC CGGCTTCGCC
GGGAGGGCCC TCACGGCGCC TACCCAAGGC TTCTGTATAT TCAACACGGC GGCCATTGGG
GCCTTGTACG GGGGAGATGG CATCGCTGTG GTTGACATAG ACGTGCACCA CGGGAATGGA
ACGCAAGAAA TGCTCTACGA AAGAGATCTC TTGTACATAT CAACTCATCA ACACCCGGCT
ACGCTGTACC CCGGCACTGG GTACCCAGAC GAGGTGGGGA CAGGCAGAGG CGAGGGGTTT
AATGCCAACC TCCCCCTGCC GCCGGGGACA GGAGACGACC TGTACATCAA GGCTATTGAT
GAAGTGGTCT TGCCGTTGCT GAGGCAGTAC GACCCAAGGG CGGTAATAGT CTCGCTTGGG
TGGGACGCCC ACAAGGACGA CCCCCTAGCC GACCTCGCCT TGTCCCTAAA GGGCTACCTC
TACGCGTTGA GCGCGATCCT CAGCTTGCAG AAGCCAACTA TATTTCTCCT GGAGGGGGGC
TACAACAGAG AGGTGTTGCA GAGGGGGACA AAGGCGCTGG TGCGCCTAGT AGCGGCGGGG
GACTTCAGGC CCGAGGAAAC CCAAACAGAT TCGCCTCCCC ACGTGGCGAG GCGATACGAG
GAGATAATGC AAGAGGTAAG ACGCCACCTA GGCCGGTACT GGCGCCTATA A
 
Protein sequence
MYVYYSEIFK KHTPPFRHPE APDRLDYVIK GVVEAGGVVK EPKMREDAWR LIYSAHDKSY 
VEYVKRLCGA GQAEIDGDTY VSAGTCDAAA LAVSAVMEAL DKRETALVAA RPPGHHAGFA
GRALTAPTQG FCIFNTAAIG ALYGGDGIAV VDIDVHHGNG TQEMLYERDL LYISTHQHPA
TLYPGTGYPD EVGTGRGEGF NANLPLPPGT GDDLYIKAID EVVLPLLRQY DPRAVIVSLG
WDAHKDDPLA DLALSLKGYL YALSAILSLQ KPTIFLLEGG YNREVLQRGT KALVRLVAAG
DFRPEETQTD SPPHVARRYE EIMQEVRRHL GRYWRL