Gene Athe_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2056 
Symbol 
ID7408269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2171070 
End bp2172833 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content31% 
IMG OID643716423 
Productputative sensor with HAMP domain 
Protein accessionYP_002573906 
Protein GI222530024 
COG category[T] Signal transduction mechanisms 
COG ID[COG2972] Predicted signal transduction protein with a C-terminal ATPase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAT GGCTGACAAA CCTTAAAATG CGTAAGAAGT TCATATTAGC ATTTATCATA 
TCAGCTTTTA TTCCCCAAAT TGTTTTGGGG ATTATTCTTT TTTTAAATCT TCATGCTATA
GCACTGGAAA ATGCTATAAA CAATACAAAA AGAAATGTTC AGGACGTGAA GAAAAACCTT
TTAGATGTTT TGCAAAATGC TGTAGATATT TCAAATAAGC TTTATCTTGA CAAAAAGCTT
TTAGACATCC TTTCTACTGA ATACAGTGAT GTTTCAAAAT TATATGAAGA TTATACATCG
TATAAAGAGT TTTCGAACTT GCTTTCTATT TATAATAAGA ATATTCATCT CATAAAAGTC
TATACTTTCA ATCCAACATT ACTTGACACT GGGGAATTTG TAAAGGTAGA TAACTATATT
AAGAAACAAA GATGGTTTAT CCACGCTTTA AAAGAGGACG GTAAAATATT ATGGGAGCTT
ATATTTGACA ACAGCCCATT CAGACCTCAG TATTATTTTA GTTTGGTGAG GTTACTTAAA
AATTCTTATG GCGAAAAAGT CGGAGTGATG GTAATTTACA TAAAAAAAGA GAAAATTGAT
GAGATTCTTT CTCAAAATGA AAATACAATT GTTGTTACAG ATAAAGGAAC TGTAGTTGCA
GCAAAAGATG AAAGTTTGAT TGGTAAAACA ATAGATATCA AAGCTTTTGA GGATGGCGAT
AGATTAATAG AAAATGTGAA AATAAATCAG AGAAATCTTA TGGCTCTTGT TGGAACTATT
GCACCAAATG AGACGGGAGG GAATTATCTA AAGGTTATTT CTTTCTTCTC TAAGAAGGAG
ATTTTTAAAG TACCAAATAA GGTTTCATTT TTTGCTTTTG TGGTGATTAC AGTAAATTTA
TTAATTTCGT TATTTCTGAT GCTTCTTTTT TCAAAGTTAA TTACTGATAG ACTAACTATA
TTAAACGAAA AGGTAAATCA AATTTCTCAC GGAAAACTTG ATACCAGCAT AGAGATTTTG
GGGAAAGACG AAATCGGACA GCTTGCAGAA AATGTCAAAG AAATGGCAAA AAATATCAAA
AATCTTATTG AACAGGTTTA TTTAGCCGAG ATTCAAAAGC AGCAAATGAT CACCAAACAA
CGAGAGATTC AGTTTGAGAT GCTCTGCAGC CAAATAAATC CCCACTTTAT ATTCAATACT
CTTGAGGCTA TTAGGATGAA AGCATTTTGT AGTGGGCAAG AGGAAATTTC GCACATTGTA
TATCTTCTAA GTAACTTATT AAGAAAAAGC ATAACAGTAA GTTCAGAGCT GATTTCACTA
AAAGAGGAAA TTGAATTTGT TCAACAGTTT TTGGAGATTC AAAAATTCAG GTTTGGTGAT
AGGATAGATT TTGATATTCA GATAGATGAA GACCTTTTCA ATCAAAAGAT ACTACCTTTT
ATAATTCAGC CTCTTGTAGA GAATTCAATA AAACACGGAA TCGAACCGAA AGTTGGAAAG
GGTTATATTA GTATCAGAAT TTTCAAAAGA GATGAAAAAA TTGTTATTAG AGTTGAGGAT
AATGGAATCG GAATGAAAAA AGAGGAATGT GATAACTTAA TAACCTTACT CAAGTCAGAC
CAAAAAGATG CTCATGTAGG TCTTAGAAAT GTATACACAA GATTGAAATT GTTTTATGGT
AATGAATTTG AGTTTTTAAT CAAGAGTGAG TATGGAAGTG GAACAGTGGT TGAAATAACT
GTTCCAAGCA AGGGTGGTGA ATAG
 
Protein sequence
MKIWLTNLKM RKKFILAFII SAFIPQIVLG IILFLNLHAI ALENAINNTK RNVQDVKKNL 
LDVLQNAVDI SNKLYLDKKL LDILSTEYSD VSKLYEDYTS YKEFSNLLSI YNKNIHLIKV
YTFNPTLLDT GEFVKVDNYI KKQRWFIHAL KEDGKILWEL IFDNSPFRPQ YYFSLVRLLK
NSYGEKVGVM VIYIKKEKID EILSQNENTI VVTDKGTVVA AKDESLIGKT IDIKAFEDGD
RLIENVKINQ RNLMALVGTI APNETGGNYL KVISFFSKKE IFKVPNKVSF FAFVVITVNL
LISLFLMLLF SKLITDRLTI LNEKVNQISH GKLDTSIEIL GKDEIGQLAE NVKEMAKNIK
NLIEQVYLAE IQKQQMITKQ REIQFEMLCS QINPHFIFNT LEAIRMKAFC SGQEEISHIV
YLLSNLLRKS ITVSSELISL KEEIEFVQQF LEIQKFRFGD RIDFDIQIDE DLFNQKILPF
IIQPLVENSI KHGIEPKVGK GYISIRIFKR DEKIVIRVED NGIGMKKEEC DNLITLLKSD
QKDAHVGLRN VYTRLKLFYG NEFEFLIKSE YGSGTVVEIT VPSKGGE