Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2831 |
Symbol | |
ID | 7873239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3064182 |
End bp | 3068405 |
Gene Length | 4224 bp |
Protein Length | 1407 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643699752 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_002889807 |
Protein GI | 237653493 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.160519 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCC CGACCTCGAG CCCGACTCCC CCCGCAAGCC CGATGGGCCG CTCTTTCGGC GCGCGGCTGC TGCTCGTGCT GGCGCTGTCG GTGGCGCTGA TCGGCGCCGG CGGCCGCTGG CATTACCACA ACCAGCAGGC CGAGGCCCGC CACTTTGCCG CGGAAACGCT CATCCAGATC GCAGACACGA AGGTCGAGCA GATCGCCCAA TGGATGACGG AACGGTACGC CGATGCCGAG GCCATGCAGG ATGTGCCGCA GGCGGCCCGC TACCTGCAGA GGCCGGACGA CGCCCAGGCC CTCGAGAGTG TGCAGGCATG GATGAGCGGA ATCCAGCAGC GTCACGGTTA CCGCGCGGTG GCCTTGTTCG ATGCGGCCGG CGAGCGGTGC CTGGCGGTTC CGGAAACGTA TTGCGGCGAT GCCGCGAACC TTGCGCATGG CCGCGAGCAT GTTCGGCGCA ATCGGTTGGC GCGCGAGGTC TCCTTCAGCG ACCTGCATCG CCTGCCCGAC CAGTCCATCC AGATGATCTT CGCCGCGCAG GTCGGCTTGT CATCCCAGGC GGGGGAGCCG GCAATCGGCA TGCTGTTGGT CGTCATCGAC CCGCGCCGCT TTCTGTACCC CCTGATCCAG CGCTGGCCCA CCCCCAGCGC CAGCGCGGAA ACCATTCTCA TCCGGCGCGA AGGCGACGAG CTGGTTTCTC TGAACGACCT GCGCCACCGC GCCGATTCGG CGCTCAGGCT GCGGCTGCCG ATCGCGACAA ACCCGCTCCT TCCGGCGGCG ATGGCGGTGC AAGGGGTGCA GGGCGTGGTC GAAGGCGTGG ACTATCGCGG CACCCCCGTG CTCGCCGTCG TTCGCAGGGT CATGAACACG CCCTGGTTCA TGGTCGCCAA GGTGGATGAG GACGAGATCC ATGCGCCGGT GCGTCGGCAG GCCTGGACCA CCGGGCTGCT GAGCGTTCTG CTGATGCTGC TGGCCAGCCT CGGGGTCGCT CTGCTGTGGC GCCAGCAGCG GCTGGTGAGC GCACAGCAGG CGGCCGCCGT GCTGCGCGAA AGCGAAAACC GGTTGAGGAA GGCTCTGGAT GTCCAGAACG TGGGCGTGAT GTTCTGGGAC GTGACCTCCG GCACCCTGAT CGATGCCAAC GACACCTTTC TGAAGATCAT GGGTTACAGC CGGCAGGAGG TGGACGCGCA CCAGTTGACC TGGCAAACGT TCACGCCGCC GGAATACATC GAGACGAGCA TCGCGGAGAT GGAGTCGTTC CGGGCGAACG GCCGCGTCGG CCCGTACGAG AAGGAATACT TGCGCAAGGA TGGCTCGCGG CAATGGTTCG TGTTTGCCGG CAGCGCGCTC GACGAGAACA GGTGCGTGGA ATACTGCATC GACGTTTCCG GCCAGAAGCG GGCGGAATCG GCACTCGCCG AAAGCGAAGA AAGACTGCGG CTCGCCTTGC AGGCCACGAA CCTCGGCCTG TATGACCTGA ACGTCCAGTC CGGCGAGGCA CTCGTCAACG CGGAGTACGC CGCGATGCTG GGGTATGCCC CCGAAACCTT TGTCGAAACC AATGCAGCCT GGATCGAGCG CATTCATCCG GACGACAGGG AGATCACGGC ACGCGCCTAC GCGGACTACG TGGCGGGAAG AACTGCGCAG TATCGGGTGG AATTCCGTCA GAAAACGCGC GACGGGGACT GGAAGTGGAT TCTTTCGCTC GGCAAGATCG TCGCATTCGA CGATGCCGGA AAGCCCCTCA GGATGCTCGG CACCCATACC GACATCACCG AGCGCAAGCG GGCCGAGGCC TCCGTGGCGC AACACGCGCG GGAACTGGAA CGTGGACGCC AGGCACTGCT GAGCGTGCTG CAGGACCAGC GCCGCGCGGA AACTTCGCTG CGCCAGCTGG CGCTGGCGGT GGAGCAGAGC CCGGAGAGCA TTGTCATCAC GAACCTCTCG GCCGAGATCG AATACGTCAA TGCCGCGTTT CTGGCCACCA CAGGCTACAC GCGCGAGCAG GTCATCGGCA GGAACCCGCG CATTCTGCAA TCCGGCAGGA CGCCGCGCGC GACCTACGAC GCGATCTGGG CGGCGCTCAC GGCCGAGCAG ACGTGGAAGG GCGAACTGGT CAACCGCAAG GCCAGCGGCG AGGACTATGT CGAGTTCGCC CATATCGCAC CGCTACGGCA GCCGGATGGC CGTATCAGCC ATTACGTCGC GGTGAAGGAG GACATCACCG AGAAGAAGCG CCTCGGCGAG GAACTGGATC GCCACCGCCA CCACCTCGAA GACCTGGTGA CGCAGCGCAC CGCCGAACTG GCGCAAGCGC GACAGCAGGC CGAGGCCGCC AGTCAGGCCA AGAGCGCCTT CCTGGCCAAC ATGAGCCACG AGATCCGCAC GCCGATGAAC GCCATCATCG GACTGACTCA TCTGCTGCGA AAGGATGGTG TCACTCCGCA GCAGGACGGG CGGCTGGAGC AGATCGAGGC ATCGGGCCGG CATCTGCTCG GCCTGATCAA CGACATCCTC GACCTCTCCA AGATCGAGGC CGGCAAACTC GATCTGGCGC TGGAAGATTT TCACCTCTCG GCGGTGCTCG ACCACGTCGC CTCGTTGATT CGCCCGTCCG CCCAATCCAA GGGCTTGCAC ATCGAGCTCG ACGGCGACGC CGTGCCGACG TGGTTGCGTG GCGACCCGAT GCGCCTGCGC CAATGTCTGT TCAACCTGGC TGGCAATGCG GTCAAGTTTA CCGAAAGGGG CAGCATCGTC CTGCGTGCCA AGCTGCTGAA GGATGGCGCG GAGGGCTTGC AGGTGCGCTT TGAAGTCGAG GACACGGGGA TTGGCGTGAC GCCGGCGCAG GGGCAGCGCC TGTTCCACGT GTTCCAGCAG GCCGAAGTCG GCACCACGCG CAAATACGGC GGCACCGGTC TCGGTCTGGC GCTGACGCGC AACCTGGCGC AGATGATGGG CGGCGAGGCG GGGATGGATA GCATCCCGGG CGAGGGCAGC ACCTTCTGGT TTACGGTGCT GTTGCAGCGC GGCCACGGCA TCATGCCCGC GCCAACGGCG CGGCGCGCCG ATGCCGAGCG GGCGCTGCGC CATGAGCAGG CGGGCACGCG GGTGTTGCTG GTGGAAGACA ACCCCATCAA CCGGCTGGTG GCATTGGAGC TGCTGCATGC CGTCGGTTTG ACGGTGGAGA CGGCCGAGGA CGGCGCCGAG GCCCTGGAGC GGGTCAAGGC CGCCGACTAC GCGCTGATCC TGATGGACAT GCAGATGCCG GTGATGGATG GCCTGGACGC GACCCGCGCC ATCCGCGCCC TGCCGGGCTG GCGCGACAAG CCGATTCTGG CAATGACCGC CAATGCCTTC GATGACGATC GCCGCACCTG CGTGGAGGCC GGCATGAACG ATTTCATCGC CAAGCCCGTG GAGCCGGAGC GGCTCTACGC CACCCTGCAC AAGTGGCTGC CGGGGCGAAC CGGGAAGTTG CCTCCGTCGG CGTCCGACGA GTCGGACTTC GCACTTGATG AAGGCGACCA CCCGTGCAGC GTTGCAGGTG CTGGCGTGAT GGCGCCGCAG GCCGACCTGG CTCCGCTGGC CGCAGCGCCT GCGGAGGCCG CCGATGCCGA GCTGCGCACG CGGCTGTCGG CGATTGGCGA TCTGGATCTG GAAAGTGGCC TGCGCACCTT GGGCGGTTCG TGGCTGGACT ACCCGCCTAT CCTGGGGCTC TTCGCCGAAT ACCATGGCGA CGATGCACGC AAACTCGCCG AACAGATCCA GCAAGACGAT CTTGCCGGCG CGAGGCGCCT GGCTCACGCG CTGAAGGGTG CGGCCGGCAC GGTCGGCGCC ACGATCGTCC ATCGACTCGC CGGCGACCTC GAGGACGCGC TCAAGCGCGG CGACCGTGCG GAGGTGCAGG CGGCGCTGGT GCCGCTGACC GAACGTCTGC CCAGGCTGAT TGCCGCGCTA CAGGATGCTC TGGTGCGTGC GAATCGGGTA ATGGTGCAGG AGATGCCTGC AAGACGGGTG GACACCAGCG TGCTGGTGCG ACTCGAGGCC TTGCTGACCG GGGACGACAC GACGGCGATC AGCTTCCTTG CGGAGAATCG GCAGGCCCTG CGCGAGACCT TGCTGATGGA CTTTGAAGAG GTTCGACGCC AGATCGAGGT GTTCGACTTT CCGGGGGCAT TGGAAAGCGT GCGCGCGGCG CTTGCGGCCT TGGCGAGCAA GTAG
|
Protein sequence | MSTPTSSPTP PASPMGRSFG ARLLLVLALS VALIGAGGRW HYHNQQAEAR HFAAETLIQI ADTKVEQIAQ WMTERYADAE AMQDVPQAAR YLQRPDDAQA LESVQAWMSG IQQRHGYRAV ALFDAAGERC LAVPETYCGD AANLAHGREH VRRNRLAREV SFSDLHRLPD QSIQMIFAAQ VGLSSQAGEP AIGMLLVVID PRRFLYPLIQ RWPTPSASAE TILIRREGDE LVSLNDLRHR ADSALRLRLP IATNPLLPAA MAVQGVQGVV EGVDYRGTPV LAVVRRVMNT PWFMVAKVDE DEIHAPVRRQ AWTTGLLSVL LMLLASLGVA LLWRQQRLVS AQQAAAVLRE SENRLRKALD VQNVGVMFWD VTSGTLIDAN DTFLKIMGYS RQEVDAHQLT WQTFTPPEYI ETSIAEMESF RANGRVGPYE KEYLRKDGSR QWFVFAGSAL DENRCVEYCI DVSGQKRAES ALAESEERLR LALQATNLGL YDLNVQSGEA LVNAEYAAML GYAPETFVET NAAWIERIHP DDREITARAY ADYVAGRTAQ YRVEFRQKTR DGDWKWILSL GKIVAFDDAG KPLRMLGTHT DITERKRAEA SVAQHARELE RGRQALLSVL QDQRRAETSL RQLALAVEQS PESIVITNLS AEIEYVNAAF LATTGYTREQ VIGRNPRILQ SGRTPRATYD AIWAALTAEQ TWKGELVNRK ASGEDYVEFA HIAPLRQPDG RISHYVAVKE DITEKKRLGE ELDRHRHHLE DLVTQRTAEL AQARQQAEAA SQAKSAFLAN MSHEIRTPMN AIIGLTHLLR KDGVTPQQDG RLEQIEASGR HLLGLINDIL DLSKIEAGKL DLALEDFHLS AVLDHVASLI RPSAQSKGLH IELDGDAVPT WLRGDPMRLR QCLFNLAGNA VKFTERGSIV LRAKLLKDGA EGLQVRFEVE DTGIGVTPAQ GQRLFHVFQQ AEVGTTRKYG GTGLGLALTR NLAQMMGGEA GMDSIPGEGS TFWFTVLLQR GHGIMPAPTA RRADAERALR HEQAGTRVLL VEDNPINRLV ALELLHAVGL TVETAEDGAE ALERVKAADY ALILMDMQMP VMDGLDATRA IRALPGWRDK PILAMTANAF DDDRRTCVEA GMNDFIAKPV EPERLYATLH KWLPGRTGKL PPSASDESDF ALDEGDHPCS VAGAGVMAPQ ADLAPLAAAP AEAADAELRT RLSAIGDLDL ESGLRTLGGS WLDYPPILGL FAEYHGDDAR KLAEQIQQDD LAGARRLAHA LKGAAGTVGA TIVHRLAGDL EDALKRGDRA EVQAALVPLT ERLPRLIAAL QDALVRANRV MVQEMPARRV DTSVLVRLEA LLTGDDTTAI SFLAENRQAL RETLLMDFEE VRRQIEVFDF PGALESVRAA LAALASK
|
| |