Gene Tmz1t_1955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1955 
Symbol 
ID7084423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2198529 
End bp2199653 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content72% 
IMG OID643698980 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002355602 
Protein GI217970368 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.630058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTCGC CCCGGCTCGA AGCCTTGCGC AGCGCGCTCG AGAAGGCGGA GTTCCGCCTC 
GCCGCCACCC AGGACCTCGC CCGTGTCGGC GACTGGGAGC TCGACCGCCA CACCGGGCGG
ATGTACTGGT CGCGCGAACT CTTCCGCCTC TTCGAGCGCC CGGAAGCGCT CGGCGTGCCC
GACCTCAACG AGGCGCTCGG CTACTTCAGC CTGGAGTCGA CGAATCGCAC CCGCGACCTG
TTCTGGGAGG CGATCGACAG CGGCCGACGC TGCGCCCTGG AGCAGGAGGT CCTGCTGCCC
TCGGGCGAGG AGCGTCGCCA TTTCACCGTG ATCGTGCCGG TCGCCGACGA GACCGGGCGC
GTGTTCCGCC TGTACGGTAC GGTGCAGGAC ATCACCGAAC GCCGGCGCCT GGAGGCCGAG
CGCCTGGAGC ATCTGGAACG CCTCGAAGAG CTCTCCCGCC ACCTGGTCGA GATCGAGGAG
CGCGAGCGCC GCGAACTCGC CAGCGCGCTG CACGACCGCG CCAGCCCCAA CCTAGCCGCG
CTGCAGATCC TGTTCTCCAG CCTGGCCGAC GCCCTCCCCG AATCCGCCCG CGATGAGCTC
GCCCCGCTGC TGGAGGACGC CTCGGCCCTG CTCGCCGACA CCACCGCCGG CATCCGCGAG
ATCTGCACCA ACCTGCGCCC GGCCACGCTC GACTACGCCG GCCTGGTACC CGCACTGCGC
GAATACGTCG CCCAGTTCCG CGCCCGCACC GGGCTGGACG TGCGCGTCGA CGCTGCGTCC
GGCAGCCCCC CGTGCGCCCT CTCCCGCGCG ACGCAGACGC TCTGCTTCCG CCTAGTGCAG
GAGGCGCTCA CCAACTGCGC CAAGCACGCT CGCGCCGGCA GCGTGCGCAT CGGGCTCGGC
GGCTGCGCCG GCGGGGTCCT GCTGCAGATC GGCGACGACG GCGTCGGCTT CGACCTCTCC
CGTCTCGGCG AAGCGGGCAG CACCCCGGGG CTGGGCCTGA TCACGATGCG CGAGCGCGTC
GAGCTCGCCG GCGGGGACTT CCGACTGTAT ACCCGCCCCG GCGACGGCAC CGTCATCGAG
GTACGGCTGC CCGCCGAGCT CCACCCCGCG GAAACGAACC GATGA
 
Protein sequence
MRSPRLEALR SALEKAEFRL AATQDLARVG DWELDRHTGR MYWSRELFRL FERPEALGVP 
DLNEALGYFS LESTNRTRDL FWEAIDSGRR CALEQEVLLP SGEERRHFTV IVPVADETGR
VFRLYGTVQD ITERRRLEAE RLEHLERLEE LSRHLVEIEE RERRELASAL HDRASPNLAA
LQILFSSLAD ALPESARDEL APLLEDASAL LADTTAGIRE ICTNLRPATL DYAGLVPALR
EYVAQFRART GLDVRVDAAS GSPPCALSRA TQTLCFRLVQ EALTNCAKHA RAGSVRIGLG
GCAGGVLLQI GDDGVGFDLS RLGEAGSTPG LGLITMRERV ELAGGDFRLY TRPGDGTVIE
VRLPAELHPA ETNR