Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1353 |
Symbol | |
ID | 7084474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1499463 |
End bp | 1502552 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698370 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_002355008 |
Protein GI | 217969774 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.283316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGTCC TGCGCTGGTC GCCCGAGGCC GAGCGGGTTT TCGGCTGGTC CGCTGCGGAG GTGCTCGGCA GGCGCCCCAA CGAATGGAGC TTCACCCATC CGGACGACGC GGCGGAGGTC AAGCGCGCGA TCGGCCAGGC CATCACCGCG GACGGTCCCA CACCGCCGGT GATCGGCCGC AACTTCACGC GCGACGGCCG CCTGCTGCAT TGCGAATGGC ACAACCGGGC GAACCGCGAC CCGCAAGGCC GGCTCGTCTC GCTGCTCTCT TTCGCGAAGG ACCTCACCCG ACAGCTCGAG GCCGAGCGCG CGCGCGACCT CAGCGAAGCC CGCTACGCGC ACATCTTCAA CAACAGCCAC GCGGTGATGC TGATCCTCGA CCCGGAGAGC GGTCGCATCC TCGACGCCAA CCCCGCCGCG GAGGAGTTCT ACGGCTGGTC CAGGAAAACG CTGCAGACGA TGCAGATCGG GGACATCAAC ACCCTGTCCC CGCAGGCGCT GCTCGTCGAA CTGAAGGCGG CCCATGCGGA AGAGCGCAAG CACTTCGAGT TCCGCCATCG CCGCGCCGAC GGCTCCGTCC GCGACGTCGA GGTCCACAGC GGACCGACCG AGGACGGCAA CCGCTCCATG GTCTTCTCGA TCGTGCACGA CATCACCGAG CGCAAGCAGG CCGAAGCGTT GTCGCGACGC TGGGAGCGCT TCTTCCGCCT GTCGAACCTC GGCCTCGCGA TGCACGACGT GTCCGACAAC ACGATCATCG ACGTCAATGC CACCTATGCG AGCCAGCACG GCTACTCGAT CGAGGAGCTG CGCGGCATGC GGATCGACGA ACTTTACCCC GAGGACGAGC GCGAGCAACT CCATGCTCAC CTCGCCGAGG CCGATCGCAT CGGCAACGCC AGTTTCGAGA CCGTTCACCT GCGCAAGGAC GGCAGCCGGC TGCCGCTCGT GATCGGAGTG ACCGCCCTGC TCGACGACCG CGGCCGCGCC GTCGCGCGCT TCGCATTCGG ACTCGACATC AGCGCGCGCA AGGCGGCCGA GGATGAGCTG CGCAAGCTGT CGCGCGCCGT CGAGGAGAGC CCGGAGAGCA TCGTCATCAC CAACACCCGG GCCGAGATCG AGTACGTCAA CCAGGCCTTC ATCGACAAGA CCGGCTATTC CCGTGCCGAG GCCATCGGCC AGAACCCGCG CCTGCTGCAG TCCGGACGCA CCACCCCGGC CACCTATGCG GACCTGTGGA ACACGCTCAC CCACGGCCGC TCCTGGCAGG GCGAGTTCTT CAACCGCCGC AAGGACGGCA GCGAGTACCT CGAGCGCGTG ACGATCACGC CGATCCACGA CGAAAGCGGG CACATCACCC ACTACGTCGC GGTCAAGCAG GACATCACCG CGCAACGGCG CATGGAAGAG GAACTGCTGC GCTACCACGA ACACCTCGAG GGCCTCGTCG AGAGCCGGAC GGCCGAACTG CAGCACGCAC TCGATGCGGC CAACATCGCC AGCCGCGCAA AGAGCGAGTT CCTCACCACG ATGAGCCACG AGATCCGCAC GCCGATGAAC GGGGTGATCG GCCTGCTCGA CGTGCTGAGC CATTCGCAGC TCTCCACGGA ACAGGTCGAG ATGGTCGGCA TCATGCGCGA ATCCGCCGAG ACCCTGCTGC GGCTGATCGA CGACATCCTC GACTTTTCCA GGATCGAGTC GGGCAATCTC GAGCTCGACG TCGGCCCTGC ATCGATCCCC GACCTGATCG CGCGTGTCGT CGGCATCCTG CAGACGGTCG CGAACCGCAA GTCGGTCCGG CTGAGTACCC GCATCGACCC CGACGTGCCC GCGGTCGTGC GCACCGACGC GCTGCGCCTG CAGCAGATCC TGGGCAACCT CGTCGGCAAC GCGGTGAAGT TCTCCTCCGG CCTCGACCGC CCCGGCCGCG TCGAGATTCG CGTCGAGACC GCCGGCGCAG GCCGGATCCG CTTCATGGTC ACCGACAACG GCATCGGCAT CGCCCCCGAG GCGATCGAGA AGATCTTCGA CCCCTTCTCG CAAGCCGAAT CCAGCACCAC CCGCCGCTTC GGCGGCAGCG GACTCGGACT GTCGATCTGC ACGCGCCTCG TGCGCATGAT GAAGGGCCGG ATGGAGGTGA ACAGCCTTCC CGGGCGTGGC AGCCGCTTCG TGGTCACACT CCCGCTTGCC GCGACCAACG GCTCGGCCAC GCGCAGCGAG CCAGCCCGCA GCACGGCGCT CGCTCGCCCC TCGCCCACGC CGCCGCTTGC GACACCCGGA GCCGGCGAGT CCGGTCGCCG CATCCTCGTC GCCGAGGACA ACGACATCAA TCGTCGCGTC ATCGCGCGAC AGCTCGCACT CCTCGGGCTG CAATGCGACA CTGCCGAGGA TGGCTTCGAG GCACTCGAGC GCTGGCGGCA GGGTCAATAC AGCCTGCTGC TGACCGATCT CCACATGCCC GGGATGGACG GCTACGAACT CACGGCGAGG ATCCGCAGCG AGGAAGCGCC GGGCCGGCGT ACGCCGATCG TGGCGGTCAC CGCAAACGCC CTGCGCGGAG AGAAGGAGCG CTGCATCGAC GCCGGCATGG ACGACTTCAT CCTCAAGCCG GTACAGGTTG CGGCGCTGCA GGAGGTGCTC GCGCGCTGGC TGCAACCGGA CACCGAGCAG CGGACTCCCG CCGGCACCGC GCCGGCAGCC CCGGCGGCGC CTCCTGCGGT GTTCGATCCG CAGGCCTTGC CGGGACTGAT CGGCAACGAT GCGGCGCTGA TCGCGGAGTT TCTCGGCGAA TACCGGCTCT CCGCATGCGA CACCGTGCGC AGCATCCGCA AGGCGTGCGA GGACGGCGAC TGGCGCCGGG CCGGCGAGCT CGCCCACCGC CTGAAGTCCT CCTCGCGCTC GGTCGGCGCC ATGCAGCTCG GCGAAATCTG CGCCGCGCTC GAACAGGCCG GTCGCGACGA CGATGGCGAC CAAGTGCAGC GCCAGGGCGG GCTGCTCGAG GCGGCCCTGG CAGCCACGCT CACCGCGATG CAAGGCACAC AGGCCGCAGG AGTTCCGCCT GCGCGTGGTG ATACCCCCCG GCCTGGGTAG
|
Protein sequence | MRVLRWSPEA ERVFGWSAAE VLGRRPNEWS FTHPDDAAEV KRAIGQAITA DGPTPPVIGR NFTRDGRLLH CEWHNRANRD PQGRLVSLLS FAKDLTRQLE AERARDLSEA RYAHIFNNSH AVMLILDPES GRILDANPAA EEFYGWSRKT LQTMQIGDIN TLSPQALLVE LKAAHAEERK HFEFRHRRAD GSVRDVEVHS GPTEDGNRSM VFSIVHDITE RKQAEALSRR WERFFRLSNL GLAMHDVSDN TIIDVNATYA SQHGYSIEEL RGMRIDELYP EDEREQLHAH LAEADRIGNA SFETVHLRKD GSRLPLVIGV TALLDDRGRA VARFAFGLDI SARKAAEDEL RKLSRAVEES PESIVITNTR AEIEYVNQAF IDKTGYSRAE AIGQNPRLLQ SGRTTPATYA DLWNTLTHGR SWQGEFFNRR KDGSEYLERV TITPIHDESG HITHYVAVKQ DITAQRRMEE ELLRYHEHLE GLVESRTAEL QHALDAANIA SRAKSEFLTT MSHEIRTPMN GVIGLLDVLS HSQLSTEQVE MVGIMRESAE TLLRLIDDIL DFSRIESGNL ELDVGPASIP DLIARVVGIL QTVANRKSVR LSTRIDPDVP AVVRTDALRL QQILGNLVGN AVKFSSGLDR PGRVEIRVET AGAGRIRFMV TDNGIGIAPE AIEKIFDPFS QAESSTTRRF GGSGLGLSIC TRLVRMMKGR MEVNSLPGRG SRFVVTLPLA ATNGSATRSE PARSTALARP SPTPPLATPG AGESGRRILV AEDNDINRRV IARQLALLGL QCDTAEDGFE ALERWRQGQY SLLLTDLHMP GMDGYELTAR IRSEEAPGRR TPIVAVTANA LRGEKERCID AGMDDFILKP VQVAALQEVL ARWLQPDTEQ RTPAGTAPAA PAAPPAVFDP QALPGLIGND AALIAEFLGE YRLSACDTVR SIRKACEDGD WRRAGELAHR LKSSSRSVGA MQLGEICAAL EQAGRDDDGD QVQRQGGLLE AALAATLTAM QGTQAAGVPP ARGDTPRPG
|
| |