Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0806 |
Symbol | |
ID | 4810424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 972830 |
End bp | 975538 |
Gene Length | 2709 bp |
Protein Length | 902 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106223 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_001037234 |
Protein GI | 125973324 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTGCA GAGTAAAAAA CGAGGTGAGG GATGATTTGA ATGTGAACCT CGAGGTGATG GTGGCAGGAA ATGACGAGGA GTTTGCAGCA TCCCTAAAGA ACGCTCTTTT GGACGTGGGG TGCAAGCATT CCGTAAAAAG TGCTGCTACA TTTAATGAAT TTAAAAACTA TTTAAGAAAA AATAATTTTC ATATTGTTTT TTTTGACTGT TCTTCTGACA TTTCTTTTGC CGATGTGATG GAGAGTTTTG AAAATCTTAA AAAAGAAATG CCCGTTGTTG CAGTTTTGGA AGAAGACGGA AGTAGGCATG GTTTTGAAAT GATAAAACTG GGAGCCTGTG AGTATCTCGA AAAGAAAGAT TCCTCAAGGC TAAGATCCAT TATATTTAAA AGATGCCGGG AGCTTGAACG GGAAAGAGAA CGAATAAGGT TTGAAAAAAG GCTGAAGGAG GAAAATGAAA GACTGTTAAC AACTTTGGAA AGCATAGGGG ACGGCGTAAT TGTTACCGAC AAGGCGGGTT GTGTAATCAT GATGAACAAA GCGGCCCAGG AACTGACAGG GTATTTGTCC TGTAATTCAA TTGGGAAACC GCTGTCGGAA GTTTTTGTGA TAATAAACGG TGTAAACCGT GAGCCTGAGC CCGATCCCTA TGAGAGGGTA TTGTCCGAGC GGAAGACCGT GGGCCTTAGG AAAAATACCA TGCTGGTTTC AAAAGACGGT ACTTTAAGGT ATGTTTCGGC AAGCACCGCT CCGATAAAAA GTTCTGACAG GGAAATAATG GGTGTTGTTG TAGTTTTCAG AGACATAACT AGGATAAGAA AAGCGGAAGA GAAGCTTCAG AAATTATCCC AGGCAGTGGA GCAAAGTCCC GGTATCATTG TGATGGCAGA CACGCGTGGA AATATTGAAT ATGTAAATCC CCGATTTACT GAAGTTACAG GATACAGCTT TGATGAGGTA AAGGGAAAAA GCCTTTTTTT CATGGAGTTA GAAGAAAATG GCGAAGAAAA ATGCAAAGAT ATGATTCAGG CAGTGTTTTC AGGCAAGGAA TGGAGAAGTG AAATCAGGCA TAAATCAAAA AACAATGTGG TATTTTGGGA ATATGCATAT TATTCTCCGA TAATCAACTC GGAAGGTGTC GTAACTAATT ATCTTAAAGT TTCCGAAGAT ATTACTGAAA GAAAGTTAAT GTCGGAAAAC CTTTTCAAGG CAAAAGAAGC TGCCGAGGCT GCAAACAGGG CAAAAAGCGA ATTTCTTGCA AACATGAGCC ATGAAATACG GACTCCCTTA AACGGAATAA TCGGGATGAC CAACCTTACT TTACAGACAG AATTAACTGA TGAGCAAAGG GAAAACCTCA ATATTGTAAA TTCATGTGCG GAACTGCTTC TTAGGGTTAT AAACGATATT TTGGACTATT CCAAGATAGA AGCCGGCAAA ATGACTTTAG AAAATGTCAA GTTTGATTTC TTCAACCTTT TGGAAAAGAC ATACAAAGCC CATATTGTAC AGGCAAACGA AAAAGGACTG AGATTAAGCT ACACTGTGCA AAAAGGCATA CCGAGGATTT TGGTGGGGGA TCCCGGACGG CTTCAGCAGG TTTTAAACAA CCTGATTTCC AATGCCGTAA AATTTACTGA TATCGGCGAA GTAAGAATAG ATGTGGATAT TATTGAAAAA AGAGATGATT CTGTAAAGTT GAAATTTACT GTGTCAGATA CGGGAATAGG TATAGACGGT GACAAGATGA ACATGCTGTT TAAAAGTTTC AGCCAGGTTG ACAGCTCCAT TACCCGAAAA TATGGAGGTA CAGGCTTGGG ACTTGCAATT TCAAAACAGC TTGTAGAGAT GATGGGTGGC GAAATCTGGG TAGAGAGTCA AAAAGGGAAA GGAAGCACTT TCTATTTTAC TGCAGGTTTT AAAAGAAAAG CCAAAAGCAG TGTTAAGGCA GAGTGTACTT CAAATGCCGA TGAGCGGGTA CTTGGGAGGA AACTCAACAT TCTTCTTGCC GAAGATGACA AGGTCAATCA GCAGGTTATA GCAGGCATGC TGAAAAAAGA GCAATGTTCT CTTACAATTG CTGAAAATGG TTTTGAGGCA ATTAAACTCT TCGAAGAAAA TGAGTTTGAC CTTATACTGA TGGATGTCCA AATGCCGGAA ATGGATGGTA TTGAAGCGAC AAAAAGAATA AGAAAAATGG AAGAAGGAAC TTTCAGGCAC ATTCCCATTA TTGCGGTTAC TGCATTTGCT TTTGAGAATG ACAGGGAGAA GATTTTGGAA GCCGGAATGG ATGATTACAT TTCAAAACCC TTTTCCTTTG ATGACATTTA TGCAGCAATA AACAGAGTGC TGTACAAAGA TGAGCAATCG AGTGATGATA TGCAAAAAGA GAAAGAAGAT GAAAAAGGAA GTTTGGATCC GGATGAAAAA GTTAATGAAA CTTTATCTGA AAGTGGAACT GTAAATGAAA CGGGTCCGGA TTATAGTTTG AAAGATATAA TGGAACGACT GGATCTTTCC ATACTGAATG GAAACTTTAC GGCAATTGAC AAGAGTGCTT TGGCAGTAAA AGAATTTGCA CAAAACAGAG GATTTGACAA AGTAAAAAAT CTTGCGTTCA AAATACAACT TTCTGCCAGG AAAAATCAGC TAGAGGATGT CAAAAAACTG TATGAGGTGC TAAAGCAGGA AATCAGCAAA TTTAATTAA
|
Protein sequence | MDCRVKNEVR DDLNVNLEVM VAGNDEEFAA SLKNALLDVG CKHSVKSAAT FNEFKNYLRK NNFHIVFFDC SSDISFADVM ESFENLKKEM PVVAVLEEDG SRHGFEMIKL GACEYLEKKD SSRLRSIIFK RCRELERERE RIRFEKRLKE ENERLLTTLE SIGDGVIVTD KAGCVIMMNK AAQELTGYLS CNSIGKPLSE VFVIINGVNR EPEPDPYERV LSERKTVGLR KNTMLVSKDG TLRYVSASTA PIKSSDREIM GVVVVFRDIT RIRKAEEKLQ KLSQAVEQSP GIIVMADTRG NIEYVNPRFT EVTGYSFDEV KGKSLFFMEL EENGEEKCKD MIQAVFSGKE WRSEIRHKSK NNVVFWEYAY YSPIINSEGV VTNYLKVSED ITERKLMSEN LFKAKEAAEA ANRAKSEFLA NMSHEIRTPL NGIIGMTNLT LQTELTDEQR ENLNIVNSCA ELLLRVINDI LDYSKIEAGK MTLENVKFDF FNLLEKTYKA HIVQANEKGL RLSYTVQKGI PRILVGDPGR LQQVLNNLIS NAVKFTDIGE VRIDVDIIEK RDDSVKLKFT VSDTGIGIDG DKMNMLFKSF SQVDSSITRK YGGTGLGLAI SKQLVEMMGG EIWVESQKGK GSTFYFTAGF KRKAKSSVKA ECTSNADERV LGRKLNILLA EDDKVNQQVI AGMLKKEQCS LTIAENGFEA IKLFEENEFD LILMDVQMPE MDGIEATKRI RKMEEGTFRH IPIIAVTAFA FENDREKILE AGMDDYISKP FSFDDIYAAI NRVLYKDEQS SDDMQKEKED EKGSLDPDEK VNETLSESGT VNETGPDYSL KDIMERLDLS ILNGNFTAID KSALAVKEFA QNRGFDKVKN LAFKIQLSAR KNQLEDVKKL YEVLKQEISK FN
|
| |