Gene Athe_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1031 
Symbol 
ID7409588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1125193 
End bp1126509 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content35% 
IMG OID643715397 
Producthistone deacetylase superfamily 
Protein accessionYP_002572905 
Protein GI222529023 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAG CAAAAAATAG TTTAGGACTC ATACTCTTTC CTGCCTTTGA CTGGAGAATT 
TCTCCAACGC ATCCTGAAAG AGAAGAAAGA CTTTTGTATA CAATTGACCA GATAGAAGAA
GAGGGACTTT TTGACTATGA AAATATAAAA ATTTACAATC CTGAAATGAT TGACTCAAAA
TATATAGAGA TGACCCATTT TTGCATTCCT ACTATTTCTT CAATTGTTAC CGTATCTCAC
AAGATAGCTG CAGGCTCAGC AATAACTATT GCTAAAAAGG TTCTGTCAAA GGAAGTAGAG
AAAGGCTTTG CTCTTATTCG ACCACCCGGC CATCATGCAC ACAGAGTAAC ATATGGCGAC
AGAGGATTTT GTATTATAAA CAATGAGGCG ATAATGGTGG AATATTTAAG AAAAGAATAT
AAAATTAAAA AGATAGCAAT AATAGACACT GACTGTCATC ATGGTGATGG AACACAGGAT
ATCTTTTGGA ACGACAAAGA TGTCTTGTTT ATTTCTCTGC ACCAAGATGG CACAACCCTA
TATCCCGGCA CTGGTTTTAC TGACGAAATA GGGGGACCAT CAGCAATTGG TTATACTTTG
AATCTGCCCT TGCCACCGTA TACATCTGAT GATGGCTTTT TATATTGCTT AGACAATCTT
ATCATCCCTG TTTTAGAAGA ATTTAAGCCT GATATCATAA TAAACTCAGC AGGGCAGGAC
AACCATTATT CTGACCCACT AACAAACATG AACTTTTCAG CACAAGGATA CGCAAAACTC
ACAGAAAAGT TAAGTCCAAA TATTTCTGTT TTAGAAGGCG GGTATTCTAT TGAAAGTGCT
CTTCCTTACG TCAATTTAGG TATAATATTT GCACTTGCAG GTATTGATTA TTCAAACATA
AAAGAACCTG ATTACAATCC TGAAAGACTT AAACAGTCTC AAAGAACAAC TGATTATATC
AAACGGCTTT GTGAACATGT ATACAGTATA TGGAAATCAA AAGAAGAAAG AGAATACAAA
ATCAAAATGG CAAATCAAGA TTATTATGTC AGAAAAAAAT CCATTTATTA TGACACAATG
GGATTCAGAG AAAATCAGCT TGAAAAAATA AAGAACTGCA AAAAATGCTC AGGTCTAATA
TTAATAGACT CTCTTTGTTT GGGATATAGG GTTTTAGCCA TAGTAATTCC ACACGACGCA
TGTAATGACT GCGAAAATGA AGGTCACAAG CTCTTTGAAA ATACTGAGGT TGGAGACTAT
TCTCATATAC TTTTGCAGGA TAAAAAGAGT TTTGAGTTTT TTAGAAGAAC CTCATAA
 
Protein sequence
MLKAKNSLGL ILFPAFDWRI SPTHPEREER LLYTIDQIEE EGLFDYENIK IYNPEMIDSK 
YIEMTHFCIP TISSIVTVSH KIAAGSAITI AKKVLSKEVE KGFALIRPPG HHAHRVTYGD
RGFCIINNEA IMVEYLRKEY KIKKIAIIDT DCHHGDGTQD IFWNDKDVLF ISLHQDGTTL
YPGTGFTDEI GGPSAIGYTL NLPLPPYTSD DGFLYCLDNL IIPVLEEFKP DIIINSAGQD
NHYSDPLTNM NFSAQGYAKL TEKLSPNISV LEGGYSIESA LPYVNLGIIF ALAGIDYSNI
KEPDYNPERL KQSQRTTDYI KRLCEHVYSI WKSKEEREYK IKMANQDYYV RKKSIYYDTM
GFRENQLEKI KNCKKCSGLI LIDSLCLGYR VLAIVIPHDA CNDCENEGHK LFENTEVGDY
SHILLQDKKS FEFFRRTS