Gene Athe_2550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2550 
Symbol 
ID7409501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2667562 
End bp2669070 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content33% 
IMG OID643716914 
Producttwo component transcriptional regulator, AraC family 
Protein accessionYP_002574391 
Protein GI222530509 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTACA AAGTGTTGAT AGTTGAAGAT GAGGTATTTA TGAGAGAAGG ACTAAAAAAC 
CTAATTGACT GGAAAGAACT TGGGTTTGAG ATAGTGGGCG AGGCAGAAGA TGGTCTTTCG
GCATTTGAGT TTTTGAAAAA AATGCAGGTT GACGTGCTCA TCAGCGATAT CAAGGTTCCG
CTATTAAGTG GACTTGACCT TATTGAAAAG GTTAAAAGGG AAATAAGAAA CCCTCCTGAG
ATAATAATCA TCAGCGGGTA TGCAGATTTT GAGTATGCAA AAAAAGCTAT CCAGCATGGA
GTTGTAAATT ACATATTAAA GCCTATTGAA GAAGAAGAGC TTGTTGATAC GCTTCTTAAA
ATAAAGTCAA AACTTAAAAA AAGGGAGTTA ATTAAGGAAG GCGAGAGTCT ACTTGCACTT
GAGAGAAGCT TTAAAGAAAA GATAGAAAGT AACAATATCA AAATAGAAAA TGGAGTTTAT
GTTGGAATAG TCTATTTTAA AGAAATGTCG AGCTGGCTTA GTCTTTATTT TGATGAGGGA
ATAGAAAATA TATTGTCAAG GATAAAGGGT TGTTTTGAAC AACTAAAAAA AGAGAGACTT
ATGTATGAAT TTTCAGAAAT TGATGGCAAG TATTATGTAG TTACAACATC TGAAGAATAT
ATAAAAAAGG CGTATCAGAG CATAAAAGAG ATGGTGGACT TTGCGACAGA GATTGTATTT
GCTTTTTCAA GAAGGATTAC AAAATGGGCT CAATTTTTTG ATGCTATATA CGATGCAGCA
TACTCTTTAA ACTTTGCACT TTTTTATAAT CAAACAGGAA TTACATTTTA TAGTACTAGT
CTGCCCAGTT CAAAAGAGCT TTTGTACTCG AAAGAATATG ACAAAAATCT CATTTTGGCT
ATCGAAGAAG AGAATAGCCA GAACCTACAA AAGATAATTA AAGAAATGTT GGAGGATAGT
AAAAAAGGAA GATATCAGCT TGACTTTTTG AAAACATATT TGAGCTTTGT AGTGGTATCT
ATAAGTTCTT ATTTTAGCAG GATAGGACTC AATTTAGAAG ATGAGATAGA GTGGTTTTCA
AGCTTGAGGC TGGAATTTTC AAACGTTGAG GATATAGAAA AAAATATGCT TAAGTTTTGC
GAAGGGATAC TTAACAAGTT CAGACAGTGG AAAACAAGCC TATCTAACGG TATCATGAGT
GAGCTTGAAA AGTATATAAA AGAAAACTAT AACAAGAACC TAACGCTGAA ATCTGTTGCG
CAAAAGTTTT ATCTAAATCC TGTATACTTG GGTCAGCTTT TCAAAAAGCA TTATGGGATG
TATTTCAACT CTTATTTGCA AAAGATACGC GTTGAAGAGG CAAAAAGGCT TTTGATGTCA
ACAAATATGA AGATATACGA AATTTCACAG GCAGTTGGGT ACAACGACAC AGACTATTTT
ATCCAGTGCT TTACAAAATT TTGCAATATG ACGCCTAATC AGTTTAGAAA AAAGTACAGA
AAAGTATAA
 
Protein sequence
MLYKVLIVED EVFMREGLKN LIDWKELGFE IVGEAEDGLS AFEFLKKMQV DVLISDIKVP 
LLSGLDLIEK VKREIRNPPE IIIISGYADF EYAKKAIQHG VVNYILKPIE EEELVDTLLK
IKSKLKKREL IKEGESLLAL ERSFKEKIES NNIKIENGVY VGIVYFKEMS SWLSLYFDEG
IENILSRIKG CFEQLKKERL MYEFSEIDGK YYVVTTSEEY IKKAYQSIKE MVDFATEIVF
AFSRRITKWA QFFDAIYDAA YSLNFALFYN QTGITFYSTS LPSSKELLYS KEYDKNLILA
IEEENSQNLQ KIIKEMLEDS KKGRYQLDFL KTYLSFVVVS ISSYFSRIGL NLEDEIEWFS
SLRLEFSNVE DIEKNMLKFC EGILNKFRQW KTSLSNGIMS ELEKYIKENY NKNLTLKSVA
QKFYLNPVYL GQLFKKHYGM YFNSYLQKIR VEEAKRLLMS TNMKIYEISQ AVGYNDTDYF
IQCFTKFCNM TPNQFRKKYR KV