Gene Tmz1t_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2051 
Symbol 
ID7083811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2321259 
End bp2324282 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content73% 
IMG OID643699078 
ProductHpt sensor hybrid histidine kinase 
Protein accessionYP_002355695 
Protein GI217970461 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR02956] TMAO reductase sytem sensor TorS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.442026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGCCC CATCTTACGT GAAACCCCGC CACGAACGCC GCTTCGGCCT GCGCGAACGC 
CTGCTGCTGG GGCTGCTCGT CGCCGCGCTC GGCACCGTAC TGGTGGCGGT GGTGGGCTGG
GTCAGCTTCC AGCGCGTGGT GAGCAGCCAG CAGGCGATCG TGCGCGACAC CCTGCCGGCC
GCCGACGCGC TGCACGAGGC GGTGCGCGCC AACGCCCGCC TCGCGGCGCT TGCGCCGCGG
CTGCTGCGGG CGGATTCGGC GGCCGAGCTC GATCAGTTGC GCGCCGCGCT CGGTGTCGAC
GTGCCGTTGG TGCGCGACCG CCTGGCGGCG CTGCGCTCGC CGCACGTCGA GACGGAGCTG
CGCGAGCGCC TGCTGGCGAC CGGCGACCGC CTCGCCGCCG GCCTCGAAGG CATGAGCGCG
GCGATCGACC AACGCCTGCG CCTGCGTGCA GCCCGCCTCG AGCGCATCGA CACGCTGCGC
GCGGCGATCG ACGGACTCGA CGAGCTTGCC CGCATCCACG CGGATAACGC CACCGCTCAG
CTCGTGTCCA CCCTCACGAC CCTGCTGCAG CCCGACCCCG GGCAGGGCGC GCCGAGCTTG
GCCGAGCGCG AGGCTGCGCG CGAGCGTGTG CTCGATCTCG ACATCGATAG CCTCGAACGC
ATGCACGAAC TCAACCTCAC CGTGCATGCG CTCGGCTTTC TCATCGGCCG CCTCGACGAG
CTCGACTCGG GCGCACGGCT GCAGGCGGCA CGCGTCGAGT TCGCCGCGCA CCTCGCACTG
CTGGCGCGCC GGCTCGCGGA TATCGCCGAC CCGGGTGGGC GGGAGCAGGG CGAGCGGGTG
CTCCGATTGC TCGCCTCGGC GCTGGACGAG GAGGGTACCT TCGCACTGCG CGCACGCGAG
ATCGCCCTGC GCGCGCAGGT GGAGACCTTG CAGGGCGTGG TCGGCGAACT CACCACCGAG
CTCGACGCCC TCGCCGGCGA GCTGATCCAC CGCGGCGGCC GCATCCTCGC CGCCGCCGGA
TCGGCCGCCG AACGCAGCGC GACCTCGGGT CTGATCGCCT TCGGCGTGAT CGCCGCGGCG
CTGCTGCTGG TGACCCTGGG CGTCACCGTC CATGCGCTGC GGCGTCACAC GCTCGGGCGC
CTGCGCGCGC TGGAAGAGGC CACCCTCGCA CTCGCGTCCG GCCGACGCGA GGTGGCGATC
GACACCGCGG GCGACGACGA GCTGGCCTCG CTCGCGGTCG CGCTCGAACG CTTCCGCGCC
AACGCGATCG AGCGCGACCG CTTCGCAGAA GAGCTTCGGC AGCAGCAGCA GGAGCTCGAG
AATCAGGTGC TGGCGCGCAC CGCGCAGCTG CGCGAGGCCA ACGCCGCGCT CGCGCGCGAG
ACCGCCGACC ACGCTCGCGC CCGTCACGCC GCCGAGCAGG CCGACCGCGC CAAGACCGCC
TTCCTCGGCA CCGTCAGCCA CGAGCTGCGC ACCCCGCTCG CGGGTATCCT CGGTCTGCTC
GAACTCGTCG AGGACGCCAG CTCGACCACC GAGCGCAAGC AGTACCAGGC GCAGATGCGC
GCCGCGGCCG TGCTGCTGCT CGAGTTGCTC GAGGACATGC TCGACTTCGC CCGCATCGAA
GCGGGCGGCG TGCAGGTCGA CAAGGCGAGC TTCAGCCTGC GCGACACGGT GAACGATGTC
TTCGCGGTGC AGGGCACGCG CGCCGCGGTA CGCGGGCTCG CGCTGATCGC CGAGGTCGAT
CCCGCGCTGC CCGACGCGGT GCAGGGCGAC CGCCGCAAGC TCAGCCAGAT CCTGCTCAAC
CTGGTCGGCA ACGCGATCAA ATTCAGCGAC ACGGGCGCGG TGACGGTGCG CGTGCGCGCC
GGTACGCAAC CCGACCACCT GCACTTCGCC GTCGAGGACC ACGGCATCGG CATCGAGCCC
GCGCGCCAGC AGGAGGTCTT CGAGCCCTTC GTGCAGGTGC GCGACAGCGG CCGCCACCAC
GCCGGCACCG GCCTCGGGCT GGCGGTGTGC CGCCGCCTGG TCGAGGCCAT GGGCGGCGCG
ATCGAGCTGA AGAGCGCGCC GGGGCAGGGC ACCACGGTGA GTTTCGAGAT CCCCCTCCCG
GCGACCGCAG CGGCGGCCGT CGAGGCCTTG CCTGCCGCGA CGGAGCCGAA GCGGACCGCG
CTGCCCCCCG GCCACCGCGT GCTGGTGGTC GAGGACGACG AGGTCAATCG CATGGTGGTC
GAGCGCTTCC TCGACGCCCT CGGTCAGCAG GCGGTGTGTG CGCCCGACAT CCAGAACGCA
CTGCGCCTGC TGCAGGCGCG CCCGATCGAC CTCGCCCTGA TCGACATGAA CCTGCCCGAC
GGCGACGGCC GCGAGCTGCT CGCCCGCCTG CGCGCGCTGC CGGCGCACGC GCACACTCCG
GCCGTGCTGA TGTCGGCCCA CATCCCGCGC CGAGAGGTCG ATGCGCTGCT CGAAGCGGGC
TTCGCCGCCT TCCTGTCCAA GCCCTTCGCG CGCGAGCGCC TGCGCGCGCT GCTCGCCGAT
CTCCTTGCCG AGCCAGCGGC CGCAGTGCCC ACGCGAGGCA CGGCGGGTGT CGCGGAGGTA
CTGCAGGCGA CAGACTGGAT CGACGCCGAT TTCCTGCGTG CGGAGCGGGA GGCGCTGGGC
GAGGAGACCG TGGCCGACAT CGTCGGCGTC TTTCGCACGC AGGGCCAGCC GCTGATCGAC
GCGCTGCTCG CCGCCGCCCG CGCCGGCGAG CACGAAGCCT GCGCACGCCT CGCGCACAAG
CTGCGCGGCG CCGCGGGCAA CGTGGGGGCA GGGCGGCTGG CGGAATGCGC GGGCGCGCTG
GAGGAGGCGC TGAAACCGGA CCCGACGGGG GACGCGCCGC GCGTGGGCGC GATCGGCGAC
CTGGCCGTGC GCGCGGGCGA GCTGGGCGAG GCCTGGTCGT ACACGCTGCA GGCGCTCGAC
GCCTTGCGTT CTGCGGCGGA CGACGCCGAA GCGCAAAGGC CGGATCAGAC CCCGCCAGGC
TCGACGTCGG CAGCCAGCCG ATAG
 
Protein sequence
MTAPSYVKPR HERRFGLRER LLLGLLVAAL GTVLVAVVGW VSFQRVVSSQ QAIVRDTLPA 
ADALHEAVRA NARLAALAPR LLRADSAAEL DQLRAALGVD VPLVRDRLAA LRSPHVETEL
RERLLATGDR LAAGLEGMSA AIDQRLRLRA ARLERIDTLR AAIDGLDELA RIHADNATAQ
LVSTLTTLLQ PDPGQGAPSL AEREAARERV LDLDIDSLER MHELNLTVHA LGFLIGRLDE
LDSGARLQAA RVEFAAHLAL LARRLADIAD PGGREQGERV LRLLASALDE EGTFALRARE
IALRAQVETL QGVVGELTTE LDALAGELIH RGGRILAAAG SAAERSATSG LIAFGVIAAA
LLLVTLGVTV HALRRHTLGR LRALEEATLA LASGRREVAI DTAGDDELAS LAVALERFRA
NAIERDRFAE ELRQQQQELE NQVLARTAQL REANAALARE TADHARARHA AEQADRAKTA
FLGTVSHELR TPLAGILGLL ELVEDASSTT ERKQYQAQMR AAAVLLLELL EDMLDFARIE
AGGVQVDKAS FSLRDTVNDV FAVQGTRAAV RGLALIAEVD PALPDAVQGD RRKLSQILLN
LVGNAIKFSD TGAVTVRVRA GTQPDHLHFA VEDHGIGIEP ARQQEVFEPF VQVRDSGRHH
AGTGLGLAVC RRLVEAMGGA IELKSAPGQG TTVSFEIPLP ATAAAAVEAL PAATEPKRTA
LPPGHRVLVV EDDEVNRMVV ERFLDALGQQ AVCAPDIQNA LRLLQARPID LALIDMNLPD
GDGRELLARL RALPAHAHTP AVLMSAHIPR REVDALLEAG FAAFLSKPFA RERLRALLAD
LLAEPAAAVP TRGTAGVAEV LQATDWIDAD FLRAEREALG EETVADIVGV FRTQGQPLID
ALLAAARAGE HEACARLAHK LRGAAGNVGA GRLAECAGAL EEALKPDPTG DAPRVGAIGD
LAVRAGELGE AWSYTLQALD ALRSAADDAE AQRPDQTPPG STSAASR