Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0511 |
Symbol | |
ID | 4808311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 623795 |
End bp | 625117 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640105924 |
Product | histidine kinase |
Protein accession | YP_001036941 |
Protein GI | 125973031 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000775208 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATAA AGAAAGATTT AAAAAAGCCA TATCCACTTA AATTAAAAAA ATCCGGAAGA AAGCTGCTTG CTGAAAGATA TTACCAAATT CTTAGAAGAA AATATAATTG CCTTTATGAG ATTTTTAACA GTATTGAACT TCCGATAGTA AGTATATCCT ATCCGAGTTT TAATATAATT GGAATTAATA ATAAAGCCAA GAAAGATGTA AAGCTTTTGC GTAAATGTGT TGCATTTGTA AGAGAGGTTA AAAAAGGTAA TAATATATTA AATGCTTTAT CAAAAGTATA TTCTTGTCGA AACCAAAAAT ATTTAGCTGA AATGCTTTTT ACAAAGTCGC CGGTATATCT TCCTAATGTT GAAATTGATA ATAATGGCAG AAAAATTTAT TATAAATTAA TGCACCAGCC GATTTTAAAT TCTCGTAATA AAATAGTAGG TTTTCTGATA ATTCCCATTG ATATAACCCA GGAGATAGAA CAGAAAAAAT ATATAGAAAA TCTGGCCAGA TTCAAAGATG AGTTTTTGTA CGGCATAACC CATGAGTTCA AAACTCCTCT GGTAGTTATA AGCTCGGCTT TGCAGGCAAT AGAGGCTCTT TGCAGAGATG AGCTTACCGA CAGGGTAAAA AAATACTTAA ACAGGATTAA GCAAAATACC TTCAGGCAAA TAAGGCTTGT AAATAATTTG CTGGATATAG CCAGAATTGA GGCCGGTCAT ATCAAAATAT TCAAGAGGAA TCTGGATATT GTTGCTTTGA CTCGTCTGAT AACCGAGTCA GTATCAATGT TTGCCGATTT AAAAGGTGTA CAGCTCTTGT TTTATTCCAA TATTGATAAA AAGATTATTG CTATTGATGA AGAAAAATAT GAACGGATTA TGCTTAACCT GCTCTCAAAT GCAATTAAAT TTACCCCCAA AGATAGATCG GTATATGTGG AAGTTTCAGC TAAGGAGGGA TGTGTGGAGA TAAAAGTCAA AGACAGTGGA ATAGGGATAC CCAAAGATAA AATCAAGACC ATTTTTGAAA GGTTCGGGCA GGTGGACAGT TCATTGTCAA GAAATGCCGA GGGTACAGGT ATAGGATTGT CCCTGGTTCG CATGTTTGTA AATGCTATGG GCGGAGAAAT TAGTGTGACG AGCGAAGAAA ACATTGGCAG TACTTTTACA GTCACTTTGC CTGATGCCGT AACTGAATGC GGATATGAAA ATGACTGTTT TGACGATTTG AGAGATGAGC GTTTGATACA AGCTATCAAA ATAGAGTTTT CGGATATATA TTTTGAGGAA AAGGAAGACA AAAAGAAAAG CCTTGAAACT TAA
|
Protein sequence | MNIKKDLKKP YPLKLKKSGR KLLAERYYQI LRRKYNCLYE IFNSIELPIV SISYPSFNII GINNKAKKDV KLLRKCVAFV REVKKGNNIL NALSKVYSCR NQKYLAEMLF TKSPVYLPNV EIDNNGRKIY YKLMHQPILN SRNKIVGFLI IPIDITQEIE QKKYIENLAR FKDEFLYGIT HEFKTPLVVI SSALQAIEAL CRDELTDRVK KYLNRIKQNT FRQIRLVNNL LDIARIEAGH IKIFKRNLDI VALTRLITES VSMFADLKGV QLLFYSNIDK KIIAIDEEKY ERIMLNLLSN AIKFTPKDRS VYVEVSAKEG CVEIKVKDSG IGIPKDKIKT IFERFGQVDS SLSRNAEGTG IGLSLVRMFV NAMGGEISVT SEENIGSTFT VTLPDAVTEC GYENDCFDDL RDERLIQAIK IEFSDIYFEE KEDKKKSLET
|
| |