Gene Haur_0154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0154 
Symbol 
ID5732063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp184887 
End bp185924 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content53% 
IMG OID641277278 
Producthistone deacetylase superfamily protein 
Protein accessionYP_001542934 
Protein GI159896687 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000391208 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTCG CTCTCATTTC ACACGATCAT TATCTCGACC ACGATCAGCC TGAGCACCCC 
GAAAATGCCA ATCGGCTACG GGCGATTCAT GCCATGCTGG CAGCTGATTA TGAATTGCAA
CAGCACTTAA CCCCGCTTGC ACCACGCCAT GCAACTGCTG CTGAAATCGA GGCGGTGCAT
GTGCCAAGCC ATTTGCCAAC CCTACAACGG ATGGCCCAAT TTGGCGATTG GGCTGATGCT
GAAACCTATA TTTTGCCAGA TTCGGTCGAG ATTGCCCAAC TTGCTGCTGG CGGCGCGATT
GTCGCGACTG ATGCAGTGCT CAGTGGTCGG CATGCCAATA GTTTTGCTTT GGTTCGCCCA
CCAGGCCATC ATGCGACCGC CGACCAAGCT ATGGGATTTT GCTTGTTCAA TAATGCGGCG
ATTGCGGCGG CCTTTGCTCA ACGTGAATAT GGCCTCAAAC GAGTGGCAAT TTTGGATTGG
GACGTGCATC ATGGCAACGG AACCCAAGAT ATTTTCTACC AAAACCCCGA TGTGCTCTAC
ATCTCGACCC ATGGCTGGCC GCTTTGGCCC AACTCAGGTC ATTGGAAGGA GATGGGCGCG
AACGCTGGAT TGGGCACGAC CCTTAACTTG CCATTACGCC CATTAACTGG CGATATGGGT
TTTCATCTGG TGTTTGAACA AGCGATTGCC CCAGCTATTC GGCGCTTCAA GCCTGAGTTA
TTGATCATCT CGGCGGGCTA TGATGCGCAT ATTTACGACC CATTGGGGAA TTTGGCACTC
TCCACTGGCG GTTATGCCCA GCTATCGTCG ATCGTGTATA ATTTAGCCGC CGAGTGCTGC
GATGGACGCT TGGTGGGCTT GCTTGAGGGT GGTTACAACC TCGAAGCCCT CGCTCAAAGC
CTCACTGCAA CGCTGCAAAC ATGGGTTTCT GGTCAGCCTG CCCCGATTTT CAATCAAGAA
GTCAGTCATA CCCCAGAACC CGATGTTACC TGGCTGATCG AGCATCTCCG CCGCGAACAT
CCGCTTTTGA AGGGATGA
 
Protein sequence
MSLALISHDH YLDHDQPEHP ENANRLRAIH AMLAADYELQ QHLTPLAPRH ATAAEIEAVH 
VPSHLPTLQR MAQFGDWADA ETYILPDSVE IAQLAAGGAI VATDAVLSGR HANSFALVRP
PGHHATADQA MGFCLFNNAA IAAAFAQREY GLKRVAILDW DVHHGNGTQD IFYQNPDVLY
ISTHGWPLWP NSGHWKEMGA NAGLGTTLNL PLRPLTGDMG FHLVFEQAIA PAIRRFKPEL
LIISAGYDAH IYDPLGNLAL STGGYAQLSS IVYNLAAECC DGRLVGLLEG GYNLEALAQS
LTATLQTWVS GQPAPIFNQE VSHTPEPDVT WLIEHLRREH PLLKG