Gene Athe_1160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1160 
Symbol 
ID7408742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1251759 
End bp1252784 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content35% 
IMG OID643715526 
ProductRadical SAM domain protein 
Protein accessionYP_002573034 
Protein GI222529152 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID[TIGR01212] radical SAM protein, TIGR01212 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.276403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACAGA GGATTTTGCC CATATTTATT CCTCAGTATG CCTGTCCTTT TAACTGTATA 
TTTTGCAATC AAAAAACAAT ATCTGGAGAA AAAGAAGAGG TAAGCTTGGA CAGGATAAAA
AGGCAGATTG AACAAGGGCT GAAAATTAAT TCAGATGAGG AGGTTGAACT TGCATATTAT
GGTGGTAATT TTACTGCCAT CGATATAGAT TTTCAAAAAA AATTGCTTGA ACTTGCTAAT
AGTTTCGAGA GAATAAAGAG TATTCGAATT TCCACAAGAC CTGACTGTAT TGATGAGGAG
AGATTAAGAT TATTAAAACT TTACAATGTG AGGACAATAG AACTTGGAAT CCAAAGCATG
TTTGACCATG TTTTAAATGC GAGTGCAAGA GGACACACTG CTCAACATAG CAAAAATGCA
ATGGAAATGA TAAAAAAGTT TGGGTTTTTA CTCGGGGTTC AAGTTATGGT AGGGCTTCCA
AAATCAACGT CTGAGAAGGA TATTGAAACA GCAAAGATAT TGACCAGTTT TTCACCAGAC
ATTGCAAGGA TTTATCCAAC TCTTGTCATA GAGGGTACTT ATCTTGCAAA GATGTACCAA
AGAGGAGAAT ATGAGCCTCT TTCTTTAAAT GAGGCAGTGG AAAGATGTAG CCAGATAAAA
TCCATATTTA TAGAAAAGGG CGTAAATGTT ATTAGAGTTG GGCTTCAGCC AACAGAAGAG
ATAAATTACA ATGCAAAAGT TTTAGCAGGA CCTTTTCATC CTTCGTTTGG TGAGCTTGTA
GACTCAGAGA TAATATTCAG AAAGGTTGTA GAAAGATTAC AAGACCAAAA GTTTACTAAA
GTATTGTTTA TAGTACATCC TAAGAACTAC TCAAAGTTTG TAGGACAGAA AAAAAGTAAT
ATCAAGAGGT TTAAACTGCT TTTTTCTCAT GCTGAAATTG AAGTGTTATG TAGCAATGAA
GAGGTTGATA AGATAAGGAT GATTACAGAA TCTAACGAAA CTGTAGTTGA TATTAGCAGA
CTTTGA
 
Protein sequence
MKQRILPIFI PQYACPFNCI FCNQKTISGE KEEVSLDRIK RQIEQGLKIN SDEEVELAYY 
GGNFTAIDID FQKKLLELAN SFERIKSIRI STRPDCIDEE RLRLLKLYNV RTIELGIQSM
FDHVLNASAR GHTAQHSKNA MEMIKKFGFL LGVQVMVGLP KSTSEKDIET AKILTSFSPD
IARIYPTLVI EGTYLAKMYQ RGEYEPLSLN EAVERCSQIK SIFIEKGVNV IRVGLQPTEE
INYNAKVLAG PFHPSFGELV DSEIIFRKVV ERLQDQKFTK VLFIVHPKNY SKFVGQKKSN
IKRFKLLFSH AEIEVLCSNE EVDKIRMITE SNETVVDISR L