Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3563 |
Symbol | |
ID | 7873069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3905964 |
End bp | 3909035 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643700504 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_002890534 |
Protein GI | 237654220 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCACG CTCCCGCCGC CGATGCCGCC ACCCGCGGCC GCCATGCCGG CCTGCGCCTC GCCTTTCCGC TCATCGCCCT CACCGTGCTG GTGGGCGGCG TGGGTGCCTT CGGCTACCGC TCGCTGAGCG AGGAGATCCG CCGCGAGACC CAGCGCACGC TGGCGGTCAT CGCCGAGCAG AAGCGCCAGC ACATCGAAGG CTGGCTCGCC GAGGCGCGCG TGGATGTGCG GATGGTCTTC ACCGGCCATT CGCAGCTCGA GGCCTTGTTC GAGGCCTGGC TGGACGGCGG CCGCCGCGAC GATGCGGCGT TCGCGCGCAT GCGGGCGCTC GTCGAGGAGC TCGCCCGCCT GCGCGGCTGG CAGGGCGTCG CCCTGCTCGA CGCCGACGGC GCGCCTACGC TCGCGGTCGG CAGCCCCGAC CTCTCGGCCC ATGCCGAACT GATCGCCGAC GTGCTGCGGC AGCCGCGCGT CGAGCTCGTC GACCTGCACC AGGACGCCCG GGGGCGCACG CATTACGGCG TGCTCGCGCC GGTGGGCGCG CCGTCGCGCG GTGTGGCCTA CCTGACCTGG GAGGCCGAGA CCGCGCTGTA TCCGATGGTC GAGGCCTGGC CGGTGCCCAC GCGCAGCGCC GAGACTTACC TGGTGCGGCG CGAGGGCGAG GGCGTGCGCT TCCTGACCCC GCTGCGCCAT CAGGCCGATG CGGCGCTCGC GCTCGAGCGC CCGCTCGCCA CTCCCGATCT GCCGGCCGCG CGCGCGGCGC TCGGCGAGCG CGGCATCCTC TCCGGCGGGC GCGACTACCG CGGCGTGCCG GTACTGGCCT ACGCCGCCGC GGTCGAGGGC ACGCCCTGGC TGATGCTCGC CGAGATCGAC GAGCGCGAGG CCTACTCGGG CATCCGCACG CTGACCTGGG GCATGGGCGT GGTGATGGCG CTCGGCCTCA TGCTGGTGTA CTCGGGCGGC TACCTGGTGT GGCGGCGCGA CCGCGAGCAG CGCGAGCGCG CCACCCTGCA GGCCCGCCAG ATCGCCGAGG CGCGCTTTCG GGTGATCTTC GAGCAGGCGC CGCTGGGCGT GGTGCTGCTC GACCCGCGCA CACGGCGGAT CACCGAGGCC AACCCGCGCT TCGCCGCGAT CGTCGGCCGC AGCGTCGGCG AGCTGGTCGG CGTCGACCCG ATCGCGCTCA CTCATCCGGA TGACGTCGCC GAGAGCCTGC GCCAGCTCGG CCGCCTCGAC GCCGGGCGCA TCGCCGGCTA CCGCCTCAAC AAGCGCTACC TGCGCCCGGA CGGCACGCCG GTGTGGGTGA GCCTGGCCTT CGCGCCGGTG CAGGTGGCCT CGGAGGACGC ACCGCGCTAC CTCGGCATCG TCGAGGACAT CTCCGCGCGC ATCGAGATGG AGGAGCGCCT GCGCGAGGCC TCGGCCGCGG CTGCCGCGGC CAACGCCGCC AAGAGCGAGT TCCTCGCCCA CATGAGCCAC GAGATCCGCA CGCCGATGAA CGCGGTGCTC GGCCTCGCCC AGGTCCTCGA GCGCGAGCCG CTCGCGCCTG CGCAGCGCGA CATGGTCGGG CGCATCCGCG GCGCCGGCGC TTCGCTGCTG GCGATCCTCG ACGACGTGCT CGACCTCTCC AAGATCGAGG CCGGCCAGCT GCGCATCGAG CCGCGGCCCT TCGACCTGCG CGCGCTGCTC GCCAATCTCG ACAGCCTGAT GGGCCAGGCC GCGCGTGCGA AGGGGCTGGC GCTGCGCATC GAGCCGCCGG CGCTGCCGCC CGGGCAGCTG CGCGGCGATG GGCTGCGCAT CGAGCAGATC CTCATCAACC TGGTCAGCAA CGCGATCAAG TTCACCGAGC GTGGCGAGGT TTCCCTGCGC GTGCGCGCCG ACGAGGTGGG GGATGTGCGA CTGCGCCTGC GTGCGGAGGT GCGCGACACC GGCATCGGCA TCGCGCCCGA GGCGCAGGCG CGCCTGTTCG CCCCCTTCAC CCAGGCCGAT GCCGGCATCG CGCGGCGCTT CGGCGGCACC GGGCTGGGCC TGTCGATCTG CAAGCGCCTG GTCGAGCTGA TGGGGGGCGC GATCGGTGTG CATAGCCAGC CCGGGCTGGG CAGCACCTTC TGGTTCGAGC TGCCGCTGGA GCGGGTTGCC GGCGGCGAGC CGGCGAGCGT CGGAGTGGTC GCGGCGCCGG AAGGCGATCG CGCGGCCGGG CCGCGGCTGG CCGGCATGCA GGTGCTGGTG GTGGACGACA GCGCGATGAA CCGCGATCTG GTGCAAGGCG CGCTGGCGCT GGAGGGCGCG TGCGCCACCC TCGCCGCCGA CGGCCAGCAG GCCATCGAGT TGCTGCGTGG CCGGCCGCAG GCCTTCGACG CGGTGCTGAT GGACGTGCAG ATGCCGGTGC TCGACGGCCT CTCCGCCACC CGCCGCATCC GCGACGAGCT CGGCCTCGCC GCGCTGCCCG TGATCGCCTT CACCGCCGGC GTGGGCGAGG ATCAGCAGGC CGCCGCGCGC GCCGCCGGCG CCGACGACGT GCTGCCCAAG CCGATGGACC TGGAGCAGAT GACGCAGCTG CTGATGCGCT GGGTAATGCC GCAGTCGGCT GCGGGCCTGG CCGAAGCGAC GCCCGCGGCC GGCGCGGGTG ATCGGCCTGC GCACGCGGTC GCTGCCGCAC CCATGCCGGC CGCCGCGCCC GTGCCCCCGC CCGCGTCGCC GGCGCAGAGC GCAGCAGCGC CGCCTGCCGC GGCTGGCGAC GATTTCCCCG CGCTGCCCGG CATCGACCGC GAGCGTGCGA TGCAGCGCCT CGGCAAGGAT CGCGACATGT TCATCGGCCT GCTCGGGCTC TTCATCGAGG ACAACGCCGG GGTGGTGGCG GCGACCCGCG CCGATCTCGC ACGCGGCGAG CGCGAATCGG CGGCGCGCCG CATGCACACG CTGCGCAGCA ACGCCGGCTT CATCTGCGCG CTCGCGATCA TGCAGGCGGC CGCAGCGCTG GAGAAGGCAA TCGCGCAGGA CGAGCCCGAC GTGGCGGCGC GCCTGGACGA ACTCGCGGCG GACATCGCCG GGCTGGTGGA GGCCGGCCGT GCGTTCTTGT GA
|
Protein sequence | MNHAPAADAA TRGRHAGLRL AFPLIALTVL VGGVGAFGYR SLSEEIRRET QRTLAVIAEQ KRQHIEGWLA EARVDVRMVF TGHSQLEALF EAWLDGGRRD DAAFARMRAL VEELARLRGW QGVALLDADG APTLAVGSPD LSAHAELIAD VLRQPRVELV DLHQDARGRT HYGVLAPVGA PSRGVAYLTW EAETALYPMV EAWPVPTRSA ETYLVRREGE GVRFLTPLRH QADAALALER PLATPDLPAA RAALGERGIL SGGRDYRGVP VLAYAAAVEG TPWLMLAEID EREAYSGIRT LTWGMGVVMA LGLMLVYSGG YLVWRRDREQ RERATLQARQ IAEARFRVIF EQAPLGVVLL DPRTRRITEA NPRFAAIVGR SVGELVGVDP IALTHPDDVA ESLRQLGRLD AGRIAGYRLN KRYLRPDGTP VWVSLAFAPV QVASEDAPRY LGIVEDISAR IEMEERLREA SAAAAAANAA KSEFLAHMSH EIRTPMNAVL GLAQVLEREP LAPAQRDMVG RIRGAGASLL AILDDVLDLS KIEAGQLRIE PRPFDLRALL ANLDSLMGQA ARAKGLALRI EPPALPPGQL RGDGLRIEQI LINLVSNAIK FTERGEVSLR VRADEVGDVR LRLRAEVRDT GIGIAPEAQA RLFAPFTQAD AGIARRFGGT GLGLSICKRL VELMGGAIGV HSQPGLGSTF WFELPLERVA GGEPASVGVV AAPEGDRAAG PRLAGMQVLV VDDSAMNRDL VQGALALEGA CATLAADGQQ AIELLRGRPQ AFDAVLMDVQ MPVLDGLSAT RRIRDELGLA ALPVIAFTAG VGEDQQAAAR AAGADDVLPK PMDLEQMTQL LMRWVMPQSA AGLAEATPAA GAGDRPAHAV AAAPMPAAAP VPPPASPAQS AAAPPAAAGD DFPALPGIDR ERAMQRLGKD RDMFIGLLGL FIEDNAGVVA ATRADLARGE RESAARRMHT LRSNAGFICA LAIMQAAAAL EKAIAQDEPD VAARLDELAA DIAGLVEAGR AFL
|
| |