Gene Tmz1t_3650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3650 
Symbol 
ID7873155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4007015 
End bp4008091 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content69% 
IMG OID643700591 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_002890620 
Protein GI237654306 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.051036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCGC ACCCCACCCA CGCCGCCAGC AGCCCCAGCA GCCCGTTCGC CGGACTCGAC 
CTGCTGTCTT CGGCGGTCGT GCTCGTCGAC GCCGGGCTGG TGATCCGCTA CCTCAACGCG
GGCGCGGAGA ACCTGTTCGC GATCAGCCGG CGCAAGCTGC TCGGCCAACC GCTCGAACGC
CTGCTCGGCA GCCCGCCCGG CCTGGCCGCG GCGCTCGACA ACGCGCTGCG CACCAACTGG
AGCTACACCG GCCAGGACAT CAGCGTGCAG CGCGGCGACG CCGAGCCGCT GCGGCTCGAC
TGCACGGTGA CGCCGGTCGA CACCGCAAGC GTGCGCCTGC TGCTCGAATT CCGCCCGATC
GACGCGCAGC TGCGCGTCGC GCGCGAGGAG CAGCTGCTGC ACCAGCAGCA GGCCAACCGC
GAGCTCATCC GCAACCTCGC GCACGAGATC AAGAACCCGC TCGGCGGCAT CCGCGGCTCG
GCCCAGCTGC TGCAGCACGA GCTCGACGAC CCGCAGCTGC GCGAATTCAC CGACGTCATC
ATCGCCGAGG CCGACCGCCT GCAGGACCTG ATGAACCGCC TGCTGAGTTC GCATTGCATG
ATGCGCCCGG CCTCGATCAA CCTCCACGAC GTGCTCGAAC GCGTGCGCCG CCTGATCCTC
GCCGAGTTTC CTTCGATCGG CATCGTCCCC GACTACGACC TCAGCCTGCC CGAGCTCACC
GCCGATCGCG AGCAGCTCAT CCAGGCCGTG CTCAACATCG TGCGCAACGC CGCACAGGCG
CTGGGCGGCC ACGGCGAGAT CCTGCTGCGC ACCCGCATCG CGCGCCAGGT CACGCTCGCC
AAGCGCCGTC ACAAACTGGC ACTCAAATTG CAAGTAATCG ACGACGGCCC CGGCATCCCC
GAGGAGATCC GCGATCGCAT CTTCTATCCG CTGGTTTCGG GGCGGGAGGG CGGCAGTGGT
CTGGGCCTGT CGCTCGCACA GAGCTTCATC GAGCAACACC AGGGCATGAT CGAGGTGGAT
AGCCGTCCCG GGCGCACCTG CTTCACGATC CTGCTGCCGA TTACCGAGCG TGCCTGA
 
Protein sequence
MPSHPTHAAS SPSSPFAGLD LLSSAVVLVD AGLVIRYLNA GAENLFAISR RKLLGQPLER 
LLGSPPGLAA ALDNALRTNW SYTGQDISVQ RGDAEPLRLD CTVTPVDTAS VRLLLEFRPI
DAQLRVAREE QLLHQQQANR ELIRNLAHEI KNPLGGIRGS AQLLQHELDD PQLREFTDVI
IAEADRLQDL MNRLLSSHCM MRPASINLHD VLERVRRLIL AEFPSIGIVP DYDLSLPELT
ADREQLIQAV LNIVRNAAQA LGGHGEILLR TRIARQVTLA KRRHKLALKL QVIDDGPGIP
EEIRDRIFYP LVSGREGGSG LGLSLAQSFI EQHQGMIEVD SRPGRTCFTI LLPITERA