Gene Athe_0799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0799 
Symbol 
ID7407986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp888376 
End bp889500 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content35% 
IMG OID643715177 
Productputative transcriptional regulator 
Protein accessionYP_002572687 
Protein GI222528805 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAGT ACAAGCTTAA AATGCTACTT GAGTCTGATG AGGGACCGAA ACTTGATTTT 
AAACAGTCTC TTTCGCTTGA GACAGATGGT GAAAAGAAAG AGCTTGTAAA AGACGTCATT
GCCATTGCAA ATTCAAGAGG TGGAAGGGGC TATATCATCT TTGGGGTTGA AGATAAGACA
AAAAGAATTG TTGGAATTAA AGACGAGAAC ATCTCTGAAG AAAAAATTCA GCAAATAATC
TCAAGTAGGT GTGACCCACC TGTATCAATT AAATTTGAGA TTGTAGAGTA CAATGATAAA
AAACTTGGCG TACTCACAAT ATACAAAAGC AGTCTAAGAC CTCATCAGAT GGTTCAAAAC
GGCGTGTTTT ATATAAGACG TGGGTCAACA ACAGACGTTG CAAGAAGAGA AGAGATAGCT
TCCATGTTTG AAGAAAGCGG CAGCGTCAAT TTTGAGATGT CTATTGTAAG AAATGCAAAT
TTAAATGACC TTGAACCTGA GCTAATCTCT ATATTTTTTA AAAGAAGTGG CATCTCATCC
GAGTGGGATA ACTTAATTTT GCTTGAAAGT TTTGGGATTG TCCAAAGAGA TAGAGAAAAT
AATAATCTTT ATCCTACCTT AGCAGGTATT CTTGTGTTTG GAAAATATCC AGAGAGATTT
TTGCCGTCAG CGTATTTGAC AATTGAATTT TTTGACCAAA TTCAGATTAT TTGTGGTAAC
ATATATAGTA TAATCAAAAA GACTATAAAT TTTTTTACTC AGAAATATCC TCAGAAGGAC
CTGTGGGCTT TGTTCGAAGC AATTGGAAAT GCACTTGTTC ACAGGGATTA TTATGACTTG
GCAAGATGCA CGGCAGTCAA GATTAGTGAG AGAAGCATTG AAGTTGCAAA TCCTGGATGT
CTTCTTGAGA GCAATATGAT ATTTAGTATG GGTAGGGAAA TTATACCAAG ACGTCGGAAT
CCGTGGATTT ATCAAAAGAT GATAATTTTA GATGACAATA ATCTCTTTTT GAAAGCTGGC
AAGGGAATAT CAAGGATAAG AAAAACATAT TCAAATGTGA AGATTATAAA TATAAACTCA
CAGAATACAT TTAAAATAAT ATTGCCACCT ATTGATAAAC TGTAA
 
Protein sequence
MDKYKLKMLL ESDEGPKLDF KQSLSLETDG EKKELVKDVI AIANSRGGRG YIIFGVEDKT 
KRIVGIKDEN ISEEKIQQII SSRCDPPVSI KFEIVEYNDK KLGVLTIYKS SLRPHQMVQN
GVFYIRRGST TDVARREEIA SMFEESGSVN FEMSIVRNAN LNDLEPELIS IFFKRSGISS
EWDNLILLES FGIVQRDREN NNLYPTLAGI LVFGKYPERF LPSAYLTIEF FDQIQIICGN
IYSIIKKTIN FFTQKYPQKD LWALFEAIGN ALVHRDYYDL ARCTAVKISE RSIEVANPGC
LLESNMIFSM GREIIPRRRN PWIYQKMIIL DDNNLFLKAG KGISRIRKTY SNVKIININS
QNTFKIILPP IDKL