Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1825 |
Symbol | |
ID | 4809809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2159911 |
End bp | 2162709 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107239 |
Product | Hpt sensor hybrid histidine kinase |
Protein accession | YP_001038239 |
Protein GI | 125974329 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAGTG ATGAACTTGA CTTCTGCGCT TTTTTTGATA TTGAGGAACA TTTATATTGG CAGGGTGATG CAATGAAGTC AAAAAGGATG ATAAAGCAAT CCACCTTAAG TGCGATTATT TTGGCAATAT TCTTCTTTAT AGCTGTTATG CTTATTTGGA GTATATTGTA TATGAATTCC TGCATAAGAG CGGAGCAAAA TGCCGAAAGG AGGCGGACGG AATTTAAGCA GCTGGGAATT GACCTGGCGG ATGCCTCGGA CTATTTAACT GATGAAGCGC GAAAGTTTGC GGTTACAAGA AAAATAGTTC ATTTGGAAAG ATATTGGGAG GAAATAAATG TAACCAGAAC ACGGGATAAA GTTATTTCAA GGCTTCAGGA ATTGGATTCT CCCAAAGAGG AGCTGGAACT GCTGGCAAAA GCCAAAAAGT ATTCCGATGC TTTGGTGGAG ACCGAAAGAA GATCCATGCG TTTGGTTTTG GAGGCGTTGG GAGTGGATGA GTCCGACATG GTTCCGGAGG TTGCTTCCTT CAAGTTAAGT GAGGGGGATC AAAGATTAAG CAGGGAGGAT AAGCTGCTAA AGGCAATAGA AATAATGTAT GATGCCCGCT ATGACAGTGA CAAGAAAAAT ATAATGGACC CGATAGCGAA ATTTCAAAGG ATAATGAACC AGAGGCTTGA GTCGGAGTTG GAAGCGGCCA GGAGAGGTAC GGCCGGGGCG ACGGTTTTGC AGATTATCCT GGCTTTCGTT ATAATCCTGG CAATTGCGGT TCTGATTAGA ATTCTGTTCA CGCAGGTTAC CTATCCCATA AGGGATTACA TATTAAAGTT AAAAGATTTT TCCTTTGATG ATGAAGAATT CAGGCTTGTA CCAAAGGGGA CCGTGGAACT TAACATGCTT GCCGAGAACT TTAATGAGTT GTACAGATCC TTTCACAATG AGCTGGTAAG GAGAAAGAAG GCGGAAGAGA CAATGAAAGC GGCCAGAGAC GAGGCTGATA AGGCCAACAG GGCGAAAAGC GAATTTCTTG CCAGTATGAG CCACGAGATC CGGACCCCTA TAAACAGTAT TATTGGCTAT CAGTATTTGC TTAAAAACTC CGTTCTTTCT CCCAAACAAA GGGAGTATGT CGAAAATATC GGGCTGGCTG CAAAGAACCT TTTGGCGATT ATTAATCAAA TATTGGATTT TTCCAAGATT GAAGCGGGAA GAATGGTTTT GGAAGAGGTG GACTTCAATA TTGATGATGT GTTGAACGAA CTTATGATAA TTGTGGGCAT GGAGGCCAAA AGAAAGGGAA TTGAACTTAG AATCAAGGTC GATGAAGATG TTCCCAGGTT TTTAAAAGGG GATATAACAA GACTTAAGCA GGTTGTTATG AATTTGGTAT CAAACGGGAT TAAGTTTACT CATGAGGGCT ACATTTCCAT CAGGGTGGAA CTGGTTGAAA AAAATGAGGA GAATGCCTGC ATAAAATTTA GTGTAACGGA TACCGGTATA GGGATAAGTG ATGAACAGAA GAAATTACTG TTTCAAGCCT TTACCCAGGG TGATGCATCC ACTTCCAGAA AATATGGGGG CACCGGTCTT GGGCTGGCTA TTTGCAAAAG GCTGGTGGAG CTTATGAAGG GGGAAATAAA TGTAGAAAGC GAAGTAGGCA AAGGTTCTAC CTTCAGTTTT TCCCTGAGAT TGAAAATAGC AAGCTGCAAT GAGAATAGAA ATGGAAAAAG CAAATCGGTT GACACAGAGG AACAGTACAA AAACGTAAAA ATACTTCTGG TGGAAGACAG CTCCGTTAAT TTGCAAATGA CTAAAGAGAT TCTTGAAAAT ATGGGGATCG ACACCGATAC TGCCCAAAGC GGTGAGGAGG CCGTAAAAAA AGCAGAGAGC AATGAATATG AGCTGATACT TATGGATATA AGGATGCCGG GAATGGATGG ATATGAAGCT ACAAGGCGAA TCAGAAAGCT GGAGAGAGGC AGTATGCCCA TTGTGGCATT AACGGCCGAT GCAGTGGAAG GTGTGGCTCA AAAAGCAAAG GAGGCAGGAA TGAACGGCTA TCTGACAAAG CCTTTGGAGC CTGAAAAACT TTTGGAAGTT ATAAGAAGTA TGACAAATGG CGGTGGCAAA TTTAAAGAAG AAAAGAGTAA AAGCTGCGCA AAAGCTGCAG AGGCGGAAAA TCCCTATTAT GAACACAAGT ATGAGACATT GGACTTTGAC GGTGCTGTAA ACAGATTGGG AGGAAAGAGG GATAAGTATA TCGGCATTCT TAAAAGCTTT GTCGAACTTC ATAAAGATGA CGGCGTAAAG ATAAGGGAGC TTGCTGCGTC GGGAAAAAGG GATGAACTTA GGAGGTTTCT CCATTCCCTT AAGGGAAGTG CGGCAAATAT CGGAGCATTA CGGCTTAAAG ACTTGCTGGC GAGATTGGAA GAAAGCTGTA TACCTCAGAA TTGTGAGAAG AATGCAAAAT GGATGAACGA TTTGGAAAAG GAGTTTGAAA GATTAATTGA AGAGATAATG CAATATATCC GGGCTTTTGA TTCCTATAAG GGGAGTAGTC CGGAAAATCA TGAAAACAAT AATGTTCGTG AGGATTTGGA AATGCTCTGT AAACTCCTTC TTACCGGAGA TTCAGAGGCT AAGAGTTTTT TTGAAGAAAA GCTGGCATAT CTCGGTAATG TATTGCGGCC TGAGGACTAC CATGACTTGA AGAAGAAAAT TTCATGCTAT GAATTTCAAA AAGCTTTGGC TATAGTAGAC AAACTCAAAC AGGATTTCTC CGGTAAGCTT ACTAAATAG
|
Protein sequence | MPSDELDFCA FFDIEEHLYW QGDAMKSKRM IKQSTLSAII LAIFFFIAVM LIWSILYMNS CIRAEQNAER RRTEFKQLGI DLADASDYLT DEARKFAVTR KIVHLERYWE EINVTRTRDK VISRLQELDS PKEELELLAK AKKYSDALVE TERRSMRLVL EALGVDESDM VPEVASFKLS EGDQRLSRED KLLKAIEIMY DARYDSDKKN IMDPIAKFQR IMNQRLESEL EAARRGTAGA TVLQIILAFV IILAIAVLIR ILFTQVTYPI RDYILKLKDF SFDDEEFRLV PKGTVELNML AENFNELYRS FHNELVRRKK AEETMKAARD EADKANRAKS EFLASMSHEI RTPINSIIGY QYLLKNSVLS PKQREYVENI GLAAKNLLAI INQILDFSKI EAGRMVLEEV DFNIDDVLNE LMIIVGMEAK RKGIELRIKV DEDVPRFLKG DITRLKQVVM NLVSNGIKFT HEGYISIRVE LVEKNEENAC IKFSVTDTGI GISDEQKKLL FQAFTQGDAS TSRKYGGTGL GLAICKRLVE LMKGEINVES EVGKGSTFSF SLRLKIASCN ENRNGKSKSV DTEEQYKNVK ILLVEDSSVN LQMTKEILEN MGIDTDTAQS GEEAVKKAES NEYELILMDI RMPGMDGYEA TRRIRKLERG SMPIVALTAD AVEGVAQKAK EAGMNGYLTK PLEPEKLLEV IRSMTNGGGK FKEEKSKSCA KAAEAENPYY EHKYETLDFD GAVNRLGGKR DKYIGILKSF VELHKDDGVK IRELAASGKR DELRRFLHSL KGSAANIGAL RLKDLLARLE ESCIPQNCEK NAKWMNDLEK EFERLIEEIM QYIRAFDSYK GSSPENHENN NVREDLEMLC KLLLTGDSEA KSFFEEKLAY LGNVLRPEDY HDLKKKISCY EFQKALAIVD KLKQDFSGKL TK
|
| |