Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1941 |
Symbol | |
ID | 7084409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2182851 |
End bp | 2184395 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698966 |
Product | histidine kinase |
Protein accession | YP_002355588 |
Protein GI | 217970354 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.531292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTCGA TCCGCCGTCG CACCTTACTG CTCGTGCTCG GCCTGCTCGG CGTGGCGCTG GGTGCGATCT CCTACCTCGG TTATCGCGAT GCCCGCCACG AGGTGCGCGA ACTCTTCGAT GCCCGTCTCG CCCAGCAGGC GCGCCTGCTC GCGGGCATGA TCCCGGGCGG GATGGCACCG GATGCACGTG CGGCGCTGCA GGACGCGCTC GATGCGGGGG GGCTCGGTGC GGCCGTGGGG GAGGGGACGG ACGCGGAACG CGAGGACGGT GACGAGGCGC TGCCGCTGGG CCACGAGTAC GAGGGCAAGC TCGGCTTCGT GGTGGTCGAC GACCAGGGGC TGGCCTTGCT GCAGTCCGCC GCGGCGCCGA TGGGTGCGCT CGACCTGCTG CTCGGCGCAC GCACCAGGGG CGCCGAGCAG CAGGGGCTCG GCGAGGTGGG AGAGGATCTT GCGGGCTATC ACACGGTGAG TCTGCACGAT GGCGCCTGGC GGCTCTTCCT GCTCCGGGAT GCGCGCGACC GCCAGTGGAT CCTGGTCGGC GAGCGCGAGG ACGTGCGGGG AGAACTCCTG GGCAGGATCA CCTTGCGCAG CGTGCTGCCC GATCTCGTCG GGTTGCCGTT GGTCGCCGTG CTGGTGTGGC TGGCGATCGG CTGGGGCCTG CGCCCGCTCG CGCGCATCGT CGAATCCTTG CAGGCGCGCG GGCCGGACGA CCTCTCCGCC CTCGCGCTGC AGGACGTGCC GCAGGAGCTC GAGCCGATGG TGGCCGCGCT CGACCGCCTG CTGCATCAGG TCAACGAGCT GCTCGAACGC GAGCGCCGCT TCCTCGCCTA TGCCGCGCAC GAGCTGCGCA CGCCGCTGGC CGTGCTGCGC ATCCATGCCC AGAATGCGCT GCAGGCGCCT GATCCGGCCG ATCGCGAGGA GGCGCTCCGG CTGCTGGGCT CGGGCATCGA GCGTGCCACC CGGGTGGTGG CGCAGTTGCT GACGCTGGCC CGCCTCGAAC CCGACGCGAG CCGGCCCAAG AGGCTGCCGA TCGAGCTGCT CGCGCTTGTC CGCGAGCAGC TCGCCGAGCT GACCCCGCTC GCCGACGAAC ATGGTCAGGA CCTCGCCCTC GAGGCGGACG AGGGGGCCGA CTTCCACCTG CTCGGCGATG CCGGCAGCCT GGGCATCCTG ATGCAGAATC TGGTGGGCAA CGCGGTGCGG CACACGCCGC CCGACGGCTG CATCCGCGTG CTGCTTGAGG CCACGCCCGC AGCCATCGTG CTGCGGGTGC AGGACAGCGG CCATGGCGTG CCGCCGGAGC TGCGCGAGAA GGTGTTCGAG CGCTTCTTCC GCGCCGGTGG CGGGCAGGGG GCGGGTCTCG GGCTGGCGAT CGTCGCGCGC ATCGTCGAAC TGCATGGCGG CACGATCGCG CTCGACGGCT GTGCGCTCGG CGGGCTGGAG GTGCGGGTGG TGCTGCCGCG GGATGCCGCC GCGCCGCGCC GGGTGCAGGG CGATGAGGGG AAGGTGCCGC CGTGCTCCGC GTCCGGCGGA GCGGCCCGCC CTTAA
|
Protein sequence | MGSIRRRTLL LVLGLLGVAL GAISYLGYRD ARHEVRELFD ARLAQQARLL AGMIPGGMAP DARAALQDAL DAGGLGAAVG EGTDAEREDG DEALPLGHEY EGKLGFVVVD DQGLALLQSA AAPMGALDLL LGARTRGAEQ QGLGEVGEDL AGYHTVSLHD GAWRLFLLRD ARDRQWILVG EREDVRGELL GRITLRSVLP DLVGLPLVAV LVWLAIGWGL RPLARIVESL QARGPDDLSA LALQDVPQEL EPMVAALDRL LHQVNELLER ERRFLAYAAH ELRTPLAVLR IHAQNALQAP DPADREEALR LLGSGIERAT RVVAQLLTLA RLEPDASRPK RLPIELLALV REQLAELTPL ADEHGQDLAL EADEGADFHL LGDAGSLGIL MQNLVGNAVR HTPPDGCIRV LLEATPAAIV LRVQDSGHGV PPELREKVFE RFFRAGGGQG AGLGLAIVAR IVELHGGTIA LDGCALGGLE VRVVLPRDAA APRRVQGDEG KVPPCSASGG AARP
|
| |