Gene Tmz1t_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1941 
Symbol 
ID7084409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2182851 
End bp2184395 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content73% 
IMG OID643698966 
Producthistidine kinase 
Protein accessionYP_002355588 
Protein GI217970354 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.531292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTCGA TCCGCCGTCG CACCTTACTG CTCGTGCTCG GCCTGCTCGG CGTGGCGCTG 
GGTGCGATCT CCTACCTCGG TTATCGCGAT GCCCGCCACG AGGTGCGCGA ACTCTTCGAT
GCCCGTCTCG CCCAGCAGGC GCGCCTGCTC GCGGGCATGA TCCCGGGCGG GATGGCACCG
GATGCACGTG CGGCGCTGCA GGACGCGCTC GATGCGGGGG GGCTCGGTGC GGCCGTGGGG
GAGGGGACGG ACGCGGAACG CGAGGACGGT GACGAGGCGC TGCCGCTGGG CCACGAGTAC
GAGGGCAAGC TCGGCTTCGT GGTGGTCGAC GACCAGGGGC TGGCCTTGCT GCAGTCCGCC
GCGGCGCCGA TGGGTGCGCT CGACCTGCTG CTCGGCGCAC GCACCAGGGG CGCCGAGCAG
CAGGGGCTCG GCGAGGTGGG AGAGGATCTT GCGGGCTATC ACACGGTGAG TCTGCACGAT
GGCGCCTGGC GGCTCTTCCT GCTCCGGGAT GCGCGCGACC GCCAGTGGAT CCTGGTCGGC
GAGCGCGAGG ACGTGCGGGG AGAACTCCTG GGCAGGATCA CCTTGCGCAG CGTGCTGCCC
GATCTCGTCG GGTTGCCGTT GGTCGCCGTG CTGGTGTGGC TGGCGATCGG CTGGGGCCTG
CGCCCGCTCG CGCGCATCGT CGAATCCTTG CAGGCGCGCG GGCCGGACGA CCTCTCCGCC
CTCGCGCTGC AGGACGTGCC GCAGGAGCTC GAGCCGATGG TGGCCGCGCT CGACCGCCTG
CTGCATCAGG TCAACGAGCT GCTCGAACGC GAGCGCCGCT TCCTCGCCTA TGCCGCGCAC
GAGCTGCGCA CGCCGCTGGC CGTGCTGCGC ATCCATGCCC AGAATGCGCT GCAGGCGCCT
GATCCGGCCG ATCGCGAGGA GGCGCTCCGG CTGCTGGGCT CGGGCATCGA GCGTGCCACC
CGGGTGGTGG CGCAGTTGCT GACGCTGGCC CGCCTCGAAC CCGACGCGAG CCGGCCCAAG
AGGCTGCCGA TCGAGCTGCT CGCGCTTGTC CGCGAGCAGC TCGCCGAGCT GACCCCGCTC
GCCGACGAAC ATGGTCAGGA CCTCGCCCTC GAGGCGGACG AGGGGGCCGA CTTCCACCTG
CTCGGCGATG CCGGCAGCCT GGGCATCCTG ATGCAGAATC TGGTGGGCAA CGCGGTGCGG
CACACGCCGC CCGACGGCTG CATCCGCGTG CTGCTTGAGG CCACGCCCGC AGCCATCGTG
CTGCGGGTGC AGGACAGCGG CCATGGCGTG CCGCCGGAGC TGCGCGAGAA GGTGTTCGAG
CGCTTCTTCC GCGCCGGTGG CGGGCAGGGG GCGGGTCTCG GGCTGGCGAT CGTCGCGCGC
ATCGTCGAAC TGCATGGCGG CACGATCGCG CTCGACGGCT GTGCGCTCGG CGGGCTGGAG
GTGCGGGTGG TGCTGCCGCG GGATGCCGCC GCGCCGCGCC GGGTGCAGGG CGATGAGGGG
AAGGTGCCGC CGTGCTCCGC GTCCGGCGGA GCGGCCCGCC CTTAA
 
Protein sequence
MGSIRRRTLL LVLGLLGVAL GAISYLGYRD ARHEVRELFD ARLAQQARLL AGMIPGGMAP 
DARAALQDAL DAGGLGAAVG EGTDAEREDG DEALPLGHEY EGKLGFVVVD DQGLALLQSA
AAPMGALDLL LGARTRGAEQ QGLGEVGEDL AGYHTVSLHD GAWRLFLLRD ARDRQWILVG
EREDVRGELL GRITLRSVLP DLVGLPLVAV LVWLAIGWGL RPLARIVESL QARGPDDLSA
LALQDVPQEL EPMVAALDRL LHQVNELLER ERRFLAYAAH ELRTPLAVLR IHAQNALQAP
DPADREEALR LLGSGIERAT RVVAQLLTLA RLEPDASRPK RLPIELLALV REQLAELTPL
ADEHGQDLAL EADEGADFHL LGDAGSLGIL MQNLVGNAVR HTPPDGCIRV LLEATPAAIV
LRVQDSGHGV PPELREKVFE RFFRAGGGQG AGLGLAIVAR IVELHGGTIA LDGCALGGLE
VRVVLPRDAA APRRVQGDEG KVPPCSASGG AARP