Gene Tmz1t_4013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_4013 
Symbol 
ID7873659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4410186 
End bp4412099 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content72% 
IMG OID643700950 
Producthistidine kinase 
Protein accessionYP_002890973 
Protein GI237654659 
COG category[T] Signal transduction mechanisms 
COG ID[COG3850] Signal transduction histidine kinase, nitrate/nitrite-specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCTGG CGGTCCCCCG CCTGCTCGTG AGGCGCTCGA TCATGCTGCT GGTGTCGGTC 
GCCCTCGTGG CGATGACGCT GATCGGGCTG GCGGGCATGA GCGCATCCTT CGTCGTGGTG
GATCGCAGCC GCGACAGCGT GCAGGCGATC GCCGTGGCCA GCACCCTGCG CGGCCATACC
CAGCGGATCG CCAACCTGAT CGCGATCGAT GCGCTCAAGG GCCGCATCGG TGTGTCCGAG
CGCACGCGCG CGGCGATGGC CGACATCGAG CGCGAGCTGC AGCAGGCCGC GCTGCGCCGC
TTCGTGGACG AGGCCCCGGG AGAGCTCTTC GCGGCGACCT ACCGCGGCGT GCAGGCGGGT
TGGGAGCGCT CGGTGCGGCC GCGGATCGAG GCGCTCGCCG GGCCTGTGCC GCCGGATGCG
CGCGAGGTCG AGACGCTGCT CGCCGGGGTC GACGACTTCG TCGGTCAGGT CGATAGCCTG
GTCGCGGTGC TCGGCCAGGA GAACGAGCGC CGTATCGACG AGCTGCGCCG CATCCTCGCG
CTCGCCGCGG CGGTGACGCT GGGGGTGGTG CTGGTGGTGA TCGTCTTGCT GCAGCGGGCG
CTGCTGCGCC CGCTCGGTGG CCTGCTCGAC GCCGCACGCC GCATCGCCGG CGGTGATTTC
GGCGTGCGCG TGCGCTACAC CGGGGAGGAC GAACTCGGCC AGGTCGGCAG CGCCTTCAAC
CTGATGGCCG ACGAACTCGC CCGCCACTAT CACCTGCTCG AGCTGCGCGT CACCGAGAAG
ACCGCCGAGC TGCAGCGCAG CAACCGTTCG CTCGAGCTGC TCTACCACGC CATCGCCCGG
CTCTACCAGG CGCCGAGCGC GCCCGACGCC TACGAGGCGA CCCTGCGCGA CATCGACCGC
GTCGCCGGGC TGGAGGGCTC CTTCGTCTGC ATCGAGCCGC GCCCCGGCGC CCCCGCGGCG
GTGATCGCCT CGTCCATGGG GCCCTGTCCG GATCGCGCCG AGCGCGGCGA GGACGCCTGC
GCGGCCTGCC GGGCCGGCTC GGAGGGGCAG GCGCTGGCGC TGCTGTCGCC GACCCTGCTG
CGCTTTCCGC TGCGCGACCG CGAGCACCAT CACGGCATGC TGCGCCTGTC GCTGTCCGAC
GGCGCGCGCC TCGAGGACTG GCAGCGCCAG CTCGTCGAGG CGCTGTCGCG CCACATCGGC
ATGGCGCTCG GCGCGGCACG GCGCACCGAG CAGGAGCGCC TGCTCGCACT GCAGGAGGAG
CGCTCGGTGA TCGCGCGCGA GCTGCACGAT TCGCTCGCGC AGGCGCTGTC CTACATGAAG
ATCCAGGTCA GCCTGTTGCA GCGCGCACTC GCCGATCCGG CGCGCACGGC GGAGGCGGAG
CCGATCCTGG CCGACCTGCG CGAGGGCATC AGCGCCGCCT ACCGTCAGCT GCGCGAGCTG
CTGGTGTCCT TCCGCCTCGG CCTGTCGGCC GACCTCGCCA CCCTGATGGA AGATGCGGCG
CGCGAATACG GCACGCGCGG CGGTCTGGAG GTCGAGCTCG CCGTGGAGCT CGGCGCCTGC
CAGCTCAGCC CGAACCAGGA GGTGCATGTG CTGCAGATCG TGCGCGAGGC CTTGTCGAAC
ATGGTGCGCC ACGCTTCGGC ACGCCATGCC TGGGTGGCGC TGCGTGGCGG CGCGGATGGC
GAGGTGCTGC TGGAGGTGCG CGACGACGGC TGCGGCATCG GCGCGCCGCC GGCGGATGCC
CGCAACCACC ACGGGCTGGC GATCATGCGC GAGCGCGCGC GCAGCATGGG CGGGGAAATC
GACATCGGGC CGGCGCTGCC GAGCGGTACC CGGGTGTGCG TGCGCTTCCG ATCCGCCAAC
GCGAGCATGG CGGCGATGGC GCAGGGTGAT GCGAAGAAGG AAACGGAGAC ATGA
 
Protein sequence
MYLAVPRLLV RRSIMLLVSV ALVAMTLIGL AGMSASFVVV DRSRDSVQAI AVASTLRGHT 
QRIANLIAID ALKGRIGVSE RTRAAMADIE RELQQAALRR FVDEAPGELF AATYRGVQAG
WERSVRPRIE ALAGPVPPDA REVETLLAGV DDFVGQVDSL VAVLGQENER RIDELRRILA
LAAAVTLGVV LVVIVLLQRA LLRPLGGLLD AARRIAGGDF GVRVRYTGED ELGQVGSAFN
LMADELARHY HLLELRVTEK TAELQRSNRS LELLYHAIAR LYQAPSAPDA YEATLRDIDR
VAGLEGSFVC IEPRPGAPAA VIASSMGPCP DRAERGEDAC AACRAGSEGQ ALALLSPTLL
RFPLRDREHH HGMLRLSLSD GARLEDWQRQ LVEALSRHIG MALGAARRTE QERLLALQEE
RSVIARELHD SLAQALSYMK IQVSLLQRAL ADPARTAEAE PILADLREGI SAAYRQLREL
LVSFRLGLSA DLATLMEDAA REYGTRGGLE VELAVELGAC QLSPNQEVHV LQIVREALSN
MVRHASARHA WVALRGGADG EVLLEVRDDG CGIGAPPADA RNHHGLAIMR ERARSMGGEI
DIGPALPSGT RVCVRFRSAN ASMAAMAQGD AKKETET