Gene Athe_1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1856 
Symbol 
ID7408969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1935834 
End bp1938215 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content26% 
IMG OID643716228 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002573717 
Protein GI222529835 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.1682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAGAA CCATATGTAT TAAAAAAAGT GCAAATATGT TCTTAATATT TTTAGTTATG 
GTTTTTCTTT TTCATTTTCT AATGTCAGTT TTCTTATTAT ATATGACTAA AGAACTTATT
TTGAGAGACA TAATTAATGA TAAGTCACTT CAATTAGATA AACTTCGCGA AAAAATAGAC
AATGAACTAA AGAAAATGAA TGAGATAGTT ACACGTATTC TTACTGATGA TGAATTAAAA
TGGGTAGGGA ATCTAAGGGG GTTAAGAGAA AATTCTTTAG ATTTATGGGA GTACTTTGAA
TATTATAAAC ACTTTAAAGA CATAGGCATG ATTAATGGAG AGTTAAAACC TATCATTGTA
CTTTTTTTAC GCGAGGGAGA AATAGTATAT TTTGCTTCCG AAGAATTTGG CTCTTTTTTT
ACATTTGGAT TTGAAAATTT TTGTAATTAC TTTTCACCTG ACAGAATGAA CTGCCGCGTT
TGGCTCAATA AAATATTCGA TAAAAACAAC AGTAAATGCA AAAATAAGTT AATATCACAA
GAATATTCAA TTAGTGGCCA AAGAATATTG GCATTGCATG AAACATATTA CTTTCCATTT
GATAACATTG GAAATCAATT GGCAATTCTT CTGGTAATTA TTGATGTTGC TAAATTTTGT
AAGCTACTAA AGGAAAACAA ATTAAACAGC CAAGATTCAA TAATTTTAAT TTATGATAGG
TGTGAGAAAA AAATATTAAC TTCAAATAAA GCAGATATTA ATAATACAAT AAATAACATA
TTAGAAAAAC TTAGCAAAAA TAAGAAATAT ATTTCTAATC TGTATAATGT AATAACAATA
CGAAATAAAA AGTATATATT TTTACGGTTG GCTTCAAATG TATATGATTG GGATTATGTT
TATCTAGTAC CATATAACAG CATAAAAAGT GAAATATACA TTTCAAGAAC TGTTTATACA
TTGTATCTAA CGGAGTTTAC ACTTTTCATA ATTATTATAA GTTTATATGC TATTTATATC
AAATTATATA AAGTAAGGAC ATATAAAATA ACTAAGGGCT TAAAAAATGT AAACGAGGAG
AATGAGAATA AAATACTTAC TTTTAGGTGC ATAAACAAGT TAAAGGAGAA AGACCGTAGG
TTAATACACA CTAAAATTAC AAGTTACAGG ATTCTTAATA AAATAACCAG TAAACAAGAA
TTATTAAACG ATATGATAAT AGAAAAGCTT ATTTATGGTT GGTCTTATAG CAAAGGGGCA
ATAGAAGAAA AAATCCAAAG TATAGGCTTA AAAATTGGAG GTAAAAAATT TTTAGTTGCC
ATAATCAAAA TGCTCTCTCT TACCAAACAA GTTAGAACAA GAGATATAAT AAAAAATGAG
TTAGAAAGTA TTAGGGTTGA CTCAAAAATT AACTTGGTAT TTTATTATGT TTACCAACTT
GCAGATAGTG ATCTTGCTTT AATTTTAGCA TTTGATGAAG ATGAAGATGA AAAAGTAAGT
CAAAATGTAA ATCTATGGCT GAGATTGATA GGTGATAGAT TAGTAATTAA AATATGTCAC
AAATATTTAA TTGCAGTTGG TCGTATAGTT AATAACATTG ACGAATGTAG AATTTCATTT
TTAGATGCAA AAGAAATTAT TGAAGCTAAT AAATTAATCA CAACAGAGAA CGTTGAGAAT
GGCATACTGT GGTATTATAA TGTTGTTAAG AAAGATAATA ATATATGGTA TCCAATAGAA
ATAGAAGAGA GATTAATGTT GCTAGTTAAT CTTGGGAAAA TATCTGAGAT TGAAGAAATT
CTACACCTAT TATATTATAA AAACTTTGAA GAAAAAGATA TATCACTGAG TTTAAAGTAT
GTGCTAATTA GTGAGTTAAT TGGAACAATT ATTAAAATTG CAAATATGAC AAAAGTAAAT
ATTGACAATC TATTTGACAT TAAGAACTTT ATTTTTATTG AAGAAAATGA CTTTGACAGG
TTATTTGAAG ATATAAAGCA AGTATTTATT AATATAACAG AACAAATCAA GAGAAACAGG
AAGGCTTCTA AGGAAAAATT AATAGAGGAG ATATTAGAAT TTATAAACAA AAACTTATTT
AATCCAAACA TGAGTATATC CCTTGTCGCT GAAAGATTTA ATTTATCTGA ATCTTATTTT
TCTAATATTT TTAAAAATGC AGTAGGTATA AAGTTCAGTG ATTATGTAGA AAAGTTAAGA
ATAGAAGAAG CATATAAATT GATAAAGCAA AAAAAGTGGA ATTTAGACGA TATTAGTAAA
ATGGTAGGAT ATACTAACAT TAAAACCTTT AGAAGAGCTT TTAAAAGGGT AAAAGGTTGT
TTACCCAGTG AAATTTTAAA CATAAATGAA CATGATATAT AG
 
Protein sequence
MGRTICIKKS ANMFLIFLVM VFLFHFLMSV FLLYMTKELI LRDIINDKSL QLDKLREKID 
NELKKMNEIV TRILTDDELK WVGNLRGLRE NSLDLWEYFE YYKHFKDIGM INGELKPIIV
LFLREGEIVY FASEEFGSFF TFGFENFCNY FSPDRMNCRV WLNKIFDKNN SKCKNKLISQ
EYSISGQRIL ALHETYYFPF DNIGNQLAIL LVIIDVAKFC KLLKENKLNS QDSIILIYDR
CEKKILTSNK ADINNTINNI LEKLSKNKKY ISNLYNVITI RNKKYIFLRL ASNVYDWDYV
YLVPYNSIKS EIYISRTVYT LYLTEFTLFI IIISLYAIYI KLYKVRTYKI TKGLKNVNEE
NENKILTFRC INKLKEKDRR LIHTKITSYR ILNKITSKQE LLNDMIIEKL IYGWSYSKGA
IEEKIQSIGL KIGGKKFLVA IIKMLSLTKQ VRTRDIIKNE LESIRVDSKI NLVFYYVYQL
ADSDLALILA FDEDEDEKVS QNVNLWLRLI GDRLVIKICH KYLIAVGRIV NNIDECRISF
LDAKEIIEAN KLITTENVEN GILWYYNVVK KDNNIWYPIE IEERLMLLVN LGKISEIEEI
LHLLYYKNFE EKDISLSLKY VLISELIGTI IKIANMTKVN IDNLFDIKNF IFIEENDFDR
LFEDIKQVFI NITEQIKRNR KASKEKLIEE ILEFINKNLF NPNMSISLVA ERFNLSESYF
SNIFKNAVGI KFSDYVEKLR IEEAYKLIKQ KKWNLDDISK MVGYTNIKTF RRAFKRVKGC
LPSEILNINE HDI