Gene Cthe_1825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1825 
Symbol 
ID4809809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2159911 
End bp2162709 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content42% 
IMG OID640107239 
ProductHpt sensor hybrid histidine kinase 
Protein accessionYP_001038239 
Protein GI125974329 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAGTG ATGAACTTGA CTTCTGCGCT TTTTTTGATA TTGAGGAACA TTTATATTGG 
CAGGGTGATG CAATGAAGTC AAAAAGGATG ATAAAGCAAT CCACCTTAAG TGCGATTATT
TTGGCAATAT TCTTCTTTAT AGCTGTTATG CTTATTTGGA GTATATTGTA TATGAATTCC
TGCATAAGAG CGGAGCAAAA TGCCGAAAGG AGGCGGACGG AATTTAAGCA GCTGGGAATT
GACCTGGCGG ATGCCTCGGA CTATTTAACT GATGAAGCGC GAAAGTTTGC GGTTACAAGA
AAAATAGTTC ATTTGGAAAG ATATTGGGAG GAAATAAATG TAACCAGAAC ACGGGATAAA
GTTATTTCAA GGCTTCAGGA ATTGGATTCT CCCAAAGAGG AGCTGGAACT GCTGGCAAAA
GCCAAAAAGT ATTCCGATGC TTTGGTGGAG ACCGAAAGAA GATCCATGCG TTTGGTTTTG
GAGGCGTTGG GAGTGGATGA GTCCGACATG GTTCCGGAGG TTGCTTCCTT CAAGTTAAGT
GAGGGGGATC AAAGATTAAG CAGGGAGGAT AAGCTGCTAA AGGCAATAGA AATAATGTAT
GATGCCCGCT ATGACAGTGA CAAGAAAAAT ATAATGGACC CGATAGCGAA ATTTCAAAGG
ATAATGAACC AGAGGCTTGA GTCGGAGTTG GAAGCGGCCA GGAGAGGTAC GGCCGGGGCG
ACGGTTTTGC AGATTATCCT GGCTTTCGTT ATAATCCTGG CAATTGCGGT TCTGATTAGA
ATTCTGTTCA CGCAGGTTAC CTATCCCATA AGGGATTACA TATTAAAGTT AAAAGATTTT
TCCTTTGATG ATGAAGAATT CAGGCTTGTA CCAAAGGGGA CCGTGGAACT TAACATGCTT
GCCGAGAACT TTAATGAGTT GTACAGATCC TTTCACAATG AGCTGGTAAG GAGAAAGAAG
GCGGAAGAGA CAATGAAAGC GGCCAGAGAC GAGGCTGATA AGGCCAACAG GGCGAAAAGC
GAATTTCTTG CCAGTATGAG CCACGAGATC CGGACCCCTA TAAACAGTAT TATTGGCTAT
CAGTATTTGC TTAAAAACTC CGTTCTTTCT CCCAAACAAA GGGAGTATGT CGAAAATATC
GGGCTGGCTG CAAAGAACCT TTTGGCGATT ATTAATCAAA TATTGGATTT TTCCAAGATT
GAAGCGGGAA GAATGGTTTT GGAAGAGGTG GACTTCAATA TTGATGATGT GTTGAACGAA
CTTATGATAA TTGTGGGCAT GGAGGCCAAA AGAAAGGGAA TTGAACTTAG AATCAAGGTC
GATGAAGATG TTCCCAGGTT TTTAAAAGGG GATATAACAA GACTTAAGCA GGTTGTTATG
AATTTGGTAT CAAACGGGAT TAAGTTTACT CATGAGGGCT ACATTTCCAT CAGGGTGGAA
CTGGTTGAAA AAAATGAGGA GAATGCCTGC ATAAAATTTA GTGTAACGGA TACCGGTATA
GGGATAAGTG ATGAACAGAA GAAATTACTG TTTCAAGCCT TTACCCAGGG TGATGCATCC
ACTTCCAGAA AATATGGGGG CACCGGTCTT GGGCTGGCTA TTTGCAAAAG GCTGGTGGAG
CTTATGAAGG GGGAAATAAA TGTAGAAAGC GAAGTAGGCA AAGGTTCTAC CTTCAGTTTT
TCCCTGAGAT TGAAAATAGC AAGCTGCAAT GAGAATAGAA ATGGAAAAAG CAAATCGGTT
GACACAGAGG AACAGTACAA AAACGTAAAA ATACTTCTGG TGGAAGACAG CTCCGTTAAT
TTGCAAATGA CTAAAGAGAT TCTTGAAAAT ATGGGGATCG ACACCGATAC TGCCCAAAGC
GGTGAGGAGG CCGTAAAAAA AGCAGAGAGC AATGAATATG AGCTGATACT TATGGATATA
AGGATGCCGG GAATGGATGG ATATGAAGCT ACAAGGCGAA TCAGAAAGCT GGAGAGAGGC
AGTATGCCCA TTGTGGCATT AACGGCCGAT GCAGTGGAAG GTGTGGCTCA AAAAGCAAAG
GAGGCAGGAA TGAACGGCTA TCTGACAAAG CCTTTGGAGC CTGAAAAACT TTTGGAAGTT
ATAAGAAGTA TGACAAATGG CGGTGGCAAA TTTAAAGAAG AAAAGAGTAA AAGCTGCGCA
AAAGCTGCAG AGGCGGAAAA TCCCTATTAT GAACACAAGT ATGAGACATT GGACTTTGAC
GGTGCTGTAA ACAGATTGGG AGGAAAGAGG GATAAGTATA TCGGCATTCT TAAAAGCTTT
GTCGAACTTC ATAAAGATGA CGGCGTAAAG ATAAGGGAGC TTGCTGCGTC GGGAAAAAGG
GATGAACTTA GGAGGTTTCT CCATTCCCTT AAGGGAAGTG CGGCAAATAT CGGAGCATTA
CGGCTTAAAG ACTTGCTGGC GAGATTGGAA GAAAGCTGTA TACCTCAGAA TTGTGAGAAG
AATGCAAAAT GGATGAACGA TTTGGAAAAG GAGTTTGAAA GATTAATTGA AGAGATAATG
CAATATATCC GGGCTTTTGA TTCCTATAAG GGGAGTAGTC CGGAAAATCA TGAAAACAAT
AATGTTCGTG AGGATTTGGA AATGCTCTGT AAACTCCTTC TTACCGGAGA TTCAGAGGCT
AAGAGTTTTT TTGAAGAAAA GCTGGCATAT CTCGGTAATG TATTGCGGCC TGAGGACTAC
CATGACTTGA AGAAGAAAAT TTCATGCTAT GAATTTCAAA AAGCTTTGGC TATAGTAGAC
AAACTCAAAC AGGATTTCTC CGGTAAGCTT ACTAAATAG
 
Protein sequence
MPSDELDFCA FFDIEEHLYW QGDAMKSKRM IKQSTLSAII LAIFFFIAVM LIWSILYMNS 
CIRAEQNAER RRTEFKQLGI DLADASDYLT DEARKFAVTR KIVHLERYWE EINVTRTRDK
VISRLQELDS PKEELELLAK AKKYSDALVE TERRSMRLVL EALGVDESDM VPEVASFKLS
EGDQRLSRED KLLKAIEIMY DARYDSDKKN IMDPIAKFQR IMNQRLESEL EAARRGTAGA
TVLQIILAFV IILAIAVLIR ILFTQVTYPI RDYILKLKDF SFDDEEFRLV PKGTVELNML
AENFNELYRS FHNELVRRKK AEETMKAARD EADKANRAKS EFLASMSHEI RTPINSIIGY
QYLLKNSVLS PKQREYVENI GLAAKNLLAI INQILDFSKI EAGRMVLEEV DFNIDDVLNE
LMIIVGMEAK RKGIELRIKV DEDVPRFLKG DITRLKQVVM NLVSNGIKFT HEGYISIRVE
LVEKNEENAC IKFSVTDTGI GISDEQKKLL FQAFTQGDAS TSRKYGGTGL GLAICKRLVE
LMKGEINVES EVGKGSTFSF SLRLKIASCN ENRNGKSKSV DTEEQYKNVK ILLVEDSSVN
LQMTKEILEN MGIDTDTAQS GEEAVKKAES NEYELILMDI RMPGMDGYEA TRRIRKLERG
SMPIVALTAD AVEGVAQKAK EAGMNGYLTK PLEPEKLLEV IRSMTNGGGK FKEEKSKSCA
KAAEAENPYY EHKYETLDFD GAVNRLGGKR DKYIGILKSF VELHKDDGVK IRELAASGKR
DELRRFLHSL KGSAANIGAL RLKDLLARLE ESCIPQNCEK NAKWMNDLEK EFERLIEEIM
QYIRAFDSYK GSSPENHENN NVREDLEMLC KLLLTGDSEA KSFFEEKLAY LGNVLRPEDY
HDLKKKISCY EFQKALAIVD KLKQDFSGKL TK