Gene Athe_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2055 
Symbol 
ID7408268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2169507 
End bp2171066 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content30% 
IMG OID643716422 
Producttwo component transcriptional regulator, AraC family 
Protein accessionYP_002573905 
Protein GI222530023 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTAAGG TGGTTCTAAT TGATGATGAG CCGATAATAA TTGAAGGACT TAAAAAGATA 
TTGGACTGGC ATGCGCTTGG GTTTGAAATA GTTGCAGTTG CATATGATGG AGTAGATGGT
TTTTCCAAAT TGCTTGAACT AAATCCTGAT GTTGCTCTTA TTGACATACG TATACCTGGA
ATTGATGGGC TTTTGTTAAT CCAAAGATTG AGAGAAAAAA ATATTTCAAC AAAAATTATT
ATTCTTTCAG GTTACTCTGA GTTTGAATAT GCCCGAAAAG CTGTGGAACT TGGAGTGGAG
AGTTATCTTC TAAAACCTAT TGACAAACAA CTTCTTGAAG AGAAACTTAT GGCAATCAGG
GAAAAACTGG AGGAAGAATT TAAAATAAAC CGGGCATTTT CAGCTGCTAA GAAACTCACA
AGGGAAAAAG TAGTTGAAAA ATTAGTATTG GGTACTTTGA AAGATACGGA GATAGAGTAT
ATGAATAAAT TCTTTGAACT TCAACTTCCC TGGAAAAAGT ATCAGGTTGC CATAATTCAG
CTGCTAAATG AAAATAGGAA TTCTTGTGAG ATAAATCAAA CAGTCTTGCA ATTAAAAGAA
AAGGTGGATT TGTTTTTGAA TAAAAACTCT TGCGGTTTTT CGACAATTAT AAACAACAAT
ATCTGCATAC TTTTCAAAGA CTTTTGGTAT CCCTTCAATA GTAGAAGCAT TAATATTTTA
AAGGATATGC TCATGAAATA TACGGACGGT CAGATTATTA TTTCAATTGG AAGTGAAGTA
GAAGACTATA GAAATATTAA AAAATCGTTT GAAGAAGCCA ATGAACTTTT AAAAAAGAGA
TTTTTATTGG GCTACAAAGG TTTAATCTTT ATAAAAGAAG CTATTTTTCG ATATGATAAA
ACTGAGAAAG AGTTTGATGA TAAGGAGAAT GCTTATGCAT TAGCAGTAGC AATTGAGTTT
GAGAATTTTG AAAGGATTAA CAATATATTG GAAAATAAGG CAGATAACTT GATAAAGAAA
AATGCATCTG AAGATGAAGC AAAAAGCAGT TTTTACAATT TCTTTGTTGA TGTTTTGTAT
AAACTTTCTC AGAATCAAGA ATACAAACAA ATTGTCGAAA AGTATCTTAC ACAAGAAATT
TTTAAAAACT TATTTACCCA GAAGACTTTA ACTGAGTTAA AAGGACTTAT AAAGTATTAT
TTTACTTTAA TTGCAGAACA AATAAAAAAA CTTCATTCAG ACAATTTCAA AGTTCAGGTT
GAAGAGTTTA TAAAAAGAAA TTATTTTATT GACTTAAAGC TTGAAACATT GGCAGAAATA
TTTGGCTACA ATTCATCCTA TTTTAGTAAA CTTTTCAAAA AAACATTTGG TGAGAACTTT
TCATCTTTTA TCGAAAAAGT CAGAATTGAG AAAGCAAAAG AGCTATTAGA AAATGGGAAG
AAAGTTTCAG AAGTTGCCAA AAAGGTTGGA TATGAAGATA TGGACTACTT TTGTTTAAAA
TTTAAAAAGT ATGTTGGATG TTCGCCTAAG AGCTATAAAG AAAGTTTAAA AAGAAAATAA
 
Protein sequence
MFKVVLIDDE PIIIEGLKKI LDWHALGFEI VAVAYDGVDG FSKLLELNPD VALIDIRIPG 
IDGLLLIQRL REKNISTKII ILSGYSEFEY ARKAVELGVE SYLLKPIDKQ LLEEKLMAIR
EKLEEEFKIN RAFSAAKKLT REKVVEKLVL GTLKDTEIEY MNKFFELQLP WKKYQVAIIQ
LLNENRNSCE INQTVLQLKE KVDLFLNKNS CGFSTIINNN ICILFKDFWY PFNSRSINIL
KDMLMKYTDG QIIISIGSEV EDYRNIKKSF EEANELLKKR FLLGYKGLIF IKEAIFRYDK
TEKEFDDKEN AYALAVAIEF ENFERINNIL ENKADNLIKK NASEDEAKSS FYNFFVDVLY
KLSQNQEYKQ IVEKYLTQEI FKNLFTQKTL TELKGLIKYY FTLIAEQIKK LHSDNFKVQV
EEFIKRNYFI DLKLETLAEI FGYNSSYFSK LFKKTFGENF SSFIEKVRIE KAKELLENGK
KVSEVAKKVG YEDMDYFCLK FKKYVGCSPK SYKESLKRK