Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2051 |
Symbol | |
ID | 7083811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2321259 |
End bp | 2324282 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643699078 |
Product | Hpt sensor hybrid histidine kinase |
Protein accession | YP_002355695 |
Protein GI | 217970461 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR02956] TMAO reductase sytem sensor TorS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.442026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGCCC CATCTTACGT GAAACCCCGC CACGAACGCC GCTTCGGCCT GCGCGAACGC CTGCTGCTGG GGCTGCTCGT CGCCGCGCTC GGCACCGTAC TGGTGGCGGT GGTGGGCTGG GTCAGCTTCC AGCGCGTGGT GAGCAGCCAG CAGGCGATCG TGCGCGACAC CCTGCCGGCC GCCGACGCGC TGCACGAGGC GGTGCGCGCC AACGCCCGCC TCGCGGCGCT TGCGCCGCGG CTGCTGCGGG CGGATTCGGC GGCCGAGCTC GATCAGTTGC GCGCCGCGCT CGGTGTCGAC GTGCCGTTGG TGCGCGACCG CCTGGCGGCG CTGCGCTCGC CGCACGTCGA GACGGAGCTG CGCGAGCGCC TGCTGGCGAC CGGCGACCGC CTCGCCGCCG GCCTCGAAGG CATGAGCGCG GCGATCGACC AACGCCTGCG CCTGCGTGCA GCCCGCCTCG AGCGCATCGA CACGCTGCGC GCGGCGATCG ACGGACTCGA CGAGCTTGCC CGCATCCACG CGGATAACGC CACCGCTCAG CTCGTGTCCA CCCTCACGAC CCTGCTGCAG CCCGACCCCG GGCAGGGCGC GCCGAGCTTG GCCGAGCGCG AGGCTGCGCG CGAGCGTGTG CTCGATCTCG ACATCGATAG CCTCGAACGC ATGCACGAAC TCAACCTCAC CGTGCATGCG CTCGGCTTTC TCATCGGCCG CCTCGACGAG CTCGACTCGG GCGCACGGCT GCAGGCGGCA CGCGTCGAGT TCGCCGCGCA CCTCGCACTG CTGGCGCGCC GGCTCGCGGA TATCGCCGAC CCGGGTGGGC GGGAGCAGGG CGAGCGGGTG CTCCGATTGC TCGCCTCGGC GCTGGACGAG GAGGGTACCT TCGCACTGCG CGCACGCGAG ATCGCCCTGC GCGCGCAGGT GGAGACCTTG CAGGGCGTGG TCGGCGAACT CACCACCGAG CTCGACGCCC TCGCCGGCGA GCTGATCCAC CGCGGCGGCC GCATCCTCGC CGCCGCCGGA TCGGCCGCCG AACGCAGCGC GACCTCGGGT CTGATCGCCT TCGGCGTGAT CGCCGCGGCG CTGCTGCTGG TGACCCTGGG CGTCACCGTC CATGCGCTGC GGCGTCACAC GCTCGGGCGC CTGCGCGCGC TGGAAGAGGC CACCCTCGCA CTCGCGTCCG GCCGACGCGA GGTGGCGATC GACACCGCGG GCGACGACGA GCTGGCCTCG CTCGCGGTCG CGCTCGAACG CTTCCGCGCC AACGCGATCG AGCGCGACCG CTTCGCAGAA GAGCTTCGGC AGCAGCAGCA GGAGCTCGAG AATCAGGTGC TGGCGCGCAC CGCGCAGCTG CGCGAGGCCA ACGCCGCGCT CGCGCGCGAG ACCGCCGACC ACGCTCGCGC CCGTCACGCC GCCGAGCAGG CCGACCGCGC CAAGACCGCC TTCCTCGGCA CCGTCAGCCA CGAGCTGCGC ACCCCGCTCG CGGGTATCCT CGGTCTGCTC GAACTCGTCG AGGACGCCAG CTCGACCACC GAGCGCAAGC AGTACCAGGC GCAGATGCGC GCCGCGGCCG TGCTGCTGCT CGAGTTGCTC GAGGACATGC TCGACTTCGC CCGCATCGAA GCGGGCGGCG TGCAGGTCGA CAAGGCGAGC TTCAGCCTGC GCGACACGGT GAACGATGTC TTCGCGGTGC AGGGCACGCG CGCCGCGGTA CGCGGGCTCG CGCTGATCGC CGAGGTCGAT CCCGCGCTGC CCGACGCGGT GCAGGGCGAC CGCCGCAAGC TCAGCCAGAT CCTGCTCAAC CTGGTCGGCA ACGCGATCAA ATTCAGCGAC ACGGGCGCGG TGACGGTGCG CGTGCGCGCC GGTACGCAAC CCGACCACCT GCACTTCGCC GTCGAGGACC ACGGCATCGG CATCGAGCCC GCGCGCCAGC AGGAGGTCTT CGAGCCCTTC GTGCAGGTGC GCGACAGCGG CCGCCACCAC GCCGGCACCG GCCTCGGGCT GGCGGTGTGC CGCCGCCTGG TCGAGGCCAT GGGCGGCGCG ATCGAGCTGA AGAGCGCGCC GGGGCAGGGC ACCACGGTGA GTTTCGAGAT CCCCCTCCCG GCGACCGCAG CGGCGGCCGT CGAGGCCTTG CCTGCCGCGA CGGAGCCGAA GCGGACCGCG CTGCCCCCCG GCCACCGCGT GCTGGTGGTC GAGGACGACG AGGTCAATCG CATGGTGGTC GAGCGCTTCC TCGACGCCCT CGGTCAGCAG GCGGTGTGTG CGCCCGACAT CCAGAACGCA CTGCGCCTGC TGCAGGCGCG CCCGATCGAC CTCGCCCTGA TCGACATGAA CCTGCCCGAC GGCGACGGCC GCGAGCTGCT CGCCCGCCTG CGCGCGCTGC CGGCGCACGC GCACACTCCG GCCGTGCTGA TGTCGGCCCA CATCCCGCGC CGAGAGGTCG ATGCGCTGCT CGAAGCGGGC TTCGCCGCCT TCCTGTCCAA GCCCTTCGCG CGCGAGCGCC TGCGCGCGCT GCTCGCCGAT CTCCTTGCCG AGCCAGCGGC CGCAGTGCCC ACGCGAGGCA CGGCGGGTGT CGCGGAGGTA CTGCAGGCGA CAGACTGGAT CGACGCCGAT TTCCTGCGTG CGGAGCGGGA GGCGCTGGGC GAGGAGACCG TGGCCGACAT CGTCGGCGTC TTTCGCACGC AGGGCCAGCC GCTGATCGAC GCGCTGCTCG CCGCCGCCCG CGCCGGCGAG CACGAAGCCT GCGCACGCCT CGCGCACAAG CTGCGCGGCG CCGCGGGCAA CGTGGGGGCA GGGCGGCTGG CGGAATGCGC GGGCGCGCTG GAGGAGGCGC TGAAACCGGA CCCGACGGGG GACGCGCCGC GCGTGGGCGC GATCGGCGAC CTGGCCGTGC GCGCGGGCGA GCTGGGCGAG GCCTGGTCGT ACACGCTGCA GGCGCTCGAC GCCTTGCGTT CTGCGGCGGA CGACGCCGAA GCGCAAAGGC CGGATCAGAC CCCGCCAGGC TCGACGTCGG CAGCCAGCCG ATAG
|
Protein sequence | MTAPSYVKPR HERRFGLRER LLLGLLVAAL GTVLVAVVGW VSFQRVVSSQ QAIVRDTLPA ADALHEAVRA NARLAALAPR LLRADSAAEL DQLRAALGVD VPLVRDRLAA LRSPHVETEL RERLLATGDR LAAGLEGMSA AIDQRLRLRA ARLERIDTLR AAIDGLDELA RIHADNATAQ LVSTLTTLLQ PDPGQGAPSL AEREAARERV LDLDIDSLER MHELNLTVHA LGFLIGRLDE LDSGARLQAA RVEFAAHLAL LARRLADIAD PGGREQGERV LRLLASALDE EGTFALRARE IALRAQVETL QGVVGELTTE LDALAGELIH RGGRILAAAG SAAERSATSG LIAFGVIAAA LLLVTLGVTV HALRRHTLGR LRALEEATLA LASGRREVAI DTAGDDELAS LAVALERFRA NAIERDRFAE ELRQQQQELE NQVLARTAQL REANAALARE TADHARARHA AEQADRAKTA FLGTVSHELR TPLAGILGLL ELVEDASSTT ERKQYQAQMR AAAVLLLELL EDMLDFARIE AGGVQVDKAS FSLRDTVNDV FAVQGTRAAV RGLALIAEVD PALPDAVQGD RRKLSQILLN LVGNAIKFSD TGAVTVRVRA GTQPDHLHFA VEDHGIGIEP ARQQEVFEPF VQVRDSGRHH AGTGLGLAVC RRLVEAMGGA IELKSAPGQG TTVSFEIPLP ATAAAAVEAL PAATEPKRTA LPPGHRVLVV EDDEVNRMVV ERFLDALGQQ AVCAPDIQNA LRLLQARPID LALIDMNLPD GDGRELLARL RALPAHAHTP AVLMSAHIPR REVDALLEAG FAAFLSKPFA RERLRALLAD LLAEPAAAVP TRGTAGVAEV LQATDWIDAD FLRAEREALG EETVADIVGV FRTQGQPLID ALLAAARAGE HEACARLAHK LRGAAGNVGA GRLAECAGAL EEALKPDPTG DAPRVGAIGD LAVRAGELGE AWSYTLQALD ALRSAADDAE AQRPDQTPPG STSAASR
|
| |