Gene Tmz1t_3563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3563 
Symbol 
ID7873069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3905964 
End bp3909035 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content74% 
IMG OID643700504 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_002890534 
Protein GI237654220 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCACG CTCCCGCCGC CGATGCCGCC ACCCGCGGCC GCCATGCCGG CCTGCGCCTC 
GCCTTTCCGC TCATCGCCCT CACCGTGCTG GTGGGCGGCG TGGGTGCCTT CGGCTACCGC
TCGCTGAGCG AGGAGATCCG CCGCGAGACC CAGCGCACGC TGGCGGTCAT CGCCGAGCAG
AAGCGCCAGC ACATCGAAGG CTGGCTCGCC GAGGCGCGCG TGGATGTGCG GATGGTCTTC
ACCGGCCATT CGCAGCTCGA GGCCTTGTTC GAGGCCTGGC TGGACGGCGG CCGCCGCGAC
GATGCGGCGT TCGCGCGCAT GCGGGCGCTC GTCGAGGAGC TCGCCCGCCT GCGCGGCTGG
CAGGGCGTCG CCCTGCTCGA CGCCGACGGC GCGCCTACGC TCGCGGTCGG CAGCCCCGAC
CTCTCGGCCC ATGCCGAACT GATCGCCGAC GTGCTGCGGC AGCCGCGCGT CGAGCTCGTC
GACCTGCACC AGGACGCCCG GGGGCGCACG CATTACGGCG TGCTCGCGCC GGTGGGCGCG
CCGTCGCGCG GTGTGGCCTA CCTGACCTGG GAGGCCGAGA CCGCGCTGTA TCCGATGGTC
GAGGCCTGGC CGGTGCCCAC GCGCAGCGCC GAGACTTACC TGGTGCGGCG CGAGGGCGAG
GGCGTGCGCT TCCTGACCCC GCTGCGCCAT CAGGCCGATG CGGCGCTCGC GCTCGAGCGC
CCGCTCGCCA CTCCCGATCT GCCGGCCGCG CGCGCGGCGC TCGGCGAGCG CGGCATCCTC
TCCGGCGGGC GCGACTACCG CGGCGTGCCG GTACTGGCCT ACGCCGCCGC GGTCGAGGGC
ACGCCCTGGC TGATGCTCGC CGAGATCGAC GAGCGCGAGG CCTACTCGGG CATCCGCACG
CTGACCTGGG GCATGGGCGT GGTGATGGCG CTCGGCCTCA TGCTGGTGTA CTCGGGCGGC
TACCTGGTGT GGCGGCGCGA CCGCGAGCAG CGCGAGCGCG CCACCCTGCA GGCCCGCCAG
ATCGCCGAGG CGCGCTTTCG GGTGATCTTC GAGCAGGCGC CGCTGGGCGT GGTGCTGCTC
GACCCGCGCA CACGGCGGAT CACCGAGGCC AACCCGCGCT TCGCCGCGAT CGTCGGCCGC
AGCGTCGGCG AGCTGGTCGG CGTCGACCCG ATCGCGCTCA CTCATCCGGA TGACGTCGCC
GAGAGCCTGC GCCAGCTCGG CCGCCTCGAC GCCGGGCGCA TCGCCGGCTA CCGCCTCAAC
AAGCGCTACC TGCGCCCGGA CGGCACGCCG GTGTGGGTGA GCCTGGCCTT CGCGCCGGTG
CAGGTGGCCT CGGAGGACGC ACCGCGCTAC CTCGGCATCG TCGAGGACAT CTCCGCGCGC
ATCGAGATGG AGGAGCGCCT GCGCGAGGCC TCGGCCGCGG CTGCCGCGGC CAACGCCGCC
AAGAGCGAGT TCCTCGCCCA CATGAGCCAC GAGATCCGCA CGCCGATGAA CGCGGTGCTC
GGCCTCGCCC AGGTCCTCGA GCGCGAGCCG CTCGCGCCTG CGCAGCGCGA CATGGTCGGG
CGCATCCGCG GCGCCGGCGC TTCGCTGCTG GCGATCCTCG ACGACGTGCT CGACCTCTCC
AAGATCGAGG CCGGCCAGCT GCGCATCGAG CCGCGGCCCT TCGACCTGCG CGCGCTGCTC
GCCAATCTCG ACAGCCTGAT GGGCCAGGCC GCGCGTGCGA AGGGGCTGGC GCTGCGCATC
GAGCCGCCGG CGCTGCCGCC CGGGCAGCTG CGCGGCGATG GGCTGCGCAT CGAGCAGATC
CTCATCAACC TGGTCAGCAA CGCGATCAAG TTCACCGAGC GTGGCGAGGT TTCCCTGCGC
GTGCGCGCCG ACGAGGTGGG GGATGTGCGA CTGCGCCTGC GTGCGGAGGT GCGCGACACC
GGCATCGGCA TCGCGCCCGA GGCGCAGGCG CGCCTGTTCG CCCCCTTCAC CCAGGCCGAT
GCCGGCATCG CGCGGCGCTT CGGCGGCACC GGGCTGGGCC TGTCGATCTG CAAGCGCCTG
GTCGAGCTGA TGGGGGGCGC GATCGGTGTG CATAGCCAGC CCGGGCTGGG CAGCACCTTC
TGGTTCGAGC TGCCGCTGGA GCGGGTTGCC GGCGGCGAGC CGGCGAGCGT CGGAGTGGTC
GCGGCGCCGG AAGGCGATCG CGCGGCCGGG CCGCGGCTGG CCGGCATGCA GGTGCTGGTG
GTGGACGACA GCGCGATGAA CCGCGATCTG GTGCAAGGCG CGCTGGCGCT GGAGGGCGCG
TGCGCCACCC TCGCCGCCGA CGGCCAGCAG GCCATCGAGT TGCTGCGTGG CCGGCCGCAG
GCCTTCGACG CGGTGCTGAT GGACGTGCAG ATGCCGGTGC TCGACGGCCT CTCCGCCACC
CGCCGCATCC GCGACGAGCT CGGCCTCGCC GCGCTGCCCG TGATCGCCTT CACCGCCGGC
GTGGGCGAGG ATCAGCAGGC CGCCGCGCGC GCCGCCGGCG CCGACGACGT GCTGCCCAAG
CCGATGGACC TGGAGCAGAT GACGCAGCTG CTGATGCGCT GGGTAATGCC GCAGTCGGCT
GCGGGCCTGG CCGAAGCGAC GCCCGCGGCC GGCGCGGGTG ATCGGCCTGC GCACGCGGTC
GCTGCCGCAC CCATGCCGGC CGCCGCGCCC GTGCCCCCGC CCGCGTCGCC GGCGCAGAGC
GCAGCAGCGC CGCCTGCCGC GGCTGGCGAC GATTTCCCCG CGCTGCCCGG CATCGACCGC
GAGCGTGCGA TGCAGCGCCT CGGCAAGGAT CGCGACATGT TCATCGGCCT GCTCGGGCTC
TTCATCGAGG ACAACGCCGG GGTGGTGGCG GCGACCCGCG CCGATCTCGC ACGCGGCGAG
CGCGAATCGG CGGCGCGCCG CATGCACACG CTGCGCAGCA ACGCCGGCTT CATCTGCGCG
CTCGCGATCA TGCAGGCGGC CGCAGCGCTG GAGAAGGCAA TCGCGCAGGA CGAGCCCGAC
GTGGCGGCGC GCCTGGACGA ACTCGCGGCG GACATCGCCG GGCTGGTGGA GGCCGGCCGT
GCGTTCTTGT GA
 
Protein sequence
MNHAPAADAA TRGRHAGLRL AFPLIALTVL VGGVGAFGYR SLSEEIRRET QRTLAVIAEQ 
KRQHIEGWLA EARVDVRMVF TGHSQLEALF EAWLDGGRRD DAAFARMRAL VEELARLRGW
QGVALLDADG APTLAVGSPD LSAHAELIAD VLRQPRVELV DLHQDARGRT HYGVLAPVGA
PSRGVAYLTW EAETALYPMV EAWPVPTRSA ETYLVRREGE GVRFLTPLRH QADAALALER
PLATPDLPAA RAALGERGIL SGGRDYRGVP VLAYAAAVEG TPWLMLAEID EREAYSGIRT
LTWGMGVVMA LGLMLVYSGG YLVWRRDREQ RERATLQARQ IAEARFRVIF EQAPLGVVLL
DPRTRRITEA NPRFAAIVGR SVGELVGVDP IALTHPDDVA ESLRQLGRLD AGRIAGYRLN
KRYLRPDGTP VWVSLAFAPV QVASEDAPRY LGIVEDISAR IEMEERLREA SAAAAAANAA
KSEFLAHMSH EIRTPMNAVL GLAQVLEREP LAPAQRDMVG RIRGAGASLL AILDDVLDLS
KIEAGQLRIE PRPFDLRALL ANLDSLMGQA ARAKGLALRI EPPALPPGQL RGDGLRIEQI
LINLVSNAIK FTERGEVSLR VRADEVGDVR LRLRAEVRDT GIGIAPEAQA RLFAPFTQAD
AGIARRFGGT GLGLSICKRL VELMGGAIGV HSQPGLGSTF WFELPLERVA GGEPASVGVV
AAPEGDRAAG PRLAGMQVLV VDDSAMNRDL VQGALALEGA CATLAADGQQ AIELLRGRPQ
AFDAVLMDVQ MPVLDGLSAT RRIRDELGLA ALPVIAFTAG VGEDQQAAAR AAGADDVLPK
PMDLEQMTQL LMRWVMPQSA AGLAEATPAA GAGDRPAHAV AAAPMPAAAP VPPPASPAQS
AAAPPAAAGD DFPALPGIDR ERAMQRLGKD RDMFIGLLGL FIEDNAGVVA ATRADLARGE
RESAARRMHT LRSNAGFICA LAIMQAAAAL EKAIAQDEPD VAARLDELAA DIAGLVEAGR
AFL