Gene Tmz1t_3720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3720 
Symbol 
ID7873719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4086339 
End bp4088420 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content72% 
IMG OID643700666 
Producthistidine kinase 
Protein accessionYP_002890690 
Protein GI237654376 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCTCG CCGAACGCCT GCGCGGCTCG GTACGCGCCA AGCTGCTCGC CCTCGTGCTC 
GCGCCGCTGA TGCTGGGCTT CCCGCTGATC ATGGGCGTGC TGTGGTACTG GGGCGAGGTC
TACTACCACC GCCTGATGGT GTCGCGGGTC GCCAGCGACT TGGCGACTGC GCACGGTTAC
TTCGAGCGCG TGATCGACGG CGTCGGCAAC GGCGTGCAGG GGCTCGCCGG CTCGCATGCG
CTCGCGCAGA CCCTGGCGCA CGAGAGTCCG GAAGCGGTGG CCGCGCTGCT CGGTGCGCGC
AAGGGCGGAC TCGCGCTCGA CTTCCTCAAC CTGCTCGACG TCAATGGCCG CGTGCTGCAT
GCGTCGACCG CGCTGGCGGC GGGGAGCGCG CGTGGTGGAT GGCCCGTCGT GCAGCAGGCG
ATCCGCGCCC GCGGCGGCAG CGTCGTCGAG CGCTTCGCAC CCGCCGAGCT CGCGGCGATC
GACGCCGGCA TGCGTGAGCG TGCCCGCCTT GCCCTGGTGG CCACGGCGCG CGCGCGGCCG
GATTCGCGCG GCGAGGAGGA GCGCGGCCTG GTGGTGCACG CGGCGGCCCC GGTGCTCGAC
GCCGCGGGCA ACCTGGTCGC GGTGCTCGAA GGCGGCGTGC TGCTCAATGG CAACCTCGAC
TTCGTCGATA CCCTCAACGC CATTGTCTAC CGCGAGGGCA GCCTGCCCGA GGGCAGCGAG
GGCACCGCCA CGCTCTTCGT CGACGATGTG CGCATCGCCA CCAACGTGCG CCTGTTCGAG
GGCGCGCGCG CGCTCGGCAC GCGCGCCTCG GACGAGGTCC GCACGCATGT GCTCGACCAC
GGGCGTACCT GGCTGGAGAC CGCCTTCGTG GTCAACGACT GGTACGTGTC GGCTTACGAG
CCGGTGGTGG ACAGCCGTGG CGAACGCGTC GGCATGCTCT ACGTCGGCTT CCTCGAGGCG
CCCTTCCGCG CCGCCAAGCG CATGGCCCTG CTGGTGGTCG GCGCGCTCGC GCTGCTGGTC
AGCGCCGCCG GCGCGCTGCT GACGCTGTAT TGGGCGCGCG GCATCTTCCG GCCGATCGAG
CGCATGCACG CCACCATCGG CCGCATCGAC GCCGGCGACG AGCAGGCGCG CGTGGGTGAC
GTCGCCAGCC GCGACGAGCT CGGCCGCCTG GCTACCGCCT TCGACCACCT GCTCGACGAC
CAGGCCGTGC GCCGCGCCGA GCTGCAGGCG CTCAACGCCT CGCTCGACCG CAAGGTCGCC
GAGCGCACCG CCGACCTCGC CGACGCCAAC GCCGAGCTGC GCGCCGCACA GCACCGCCTG
GTGATGAGCG AGAAGCTCGC CGCGATCGGC GAGCTCACCG CCGGCGTCGC GCACGAGATC
AGCAATCCCA CCGCGGTGAT CCAGGGCAAT CTCGACCTGC TGCGCGAGGA GCTCGGCCCC
GCGGCGCAGC CGGTGGCCAA CGAGATCCGC CTGATCCACG AGCAGGTCGG GCGCATCCGC
CTGATCGTCA CCAAGCTGCT GCAGTTCGCC CGTCCGGGCG AGTTCGCCGG CTACGTCGAG
GACGTCGACG TGAACGCCGC GCTCGCCGAC TGCCTGGTGC TGACGCGCCA GCATCTGGCG
CGCGCCGAGG TGAAGGTGGT GCAGCGCCTG GCCGCCACCG CGCGGGTGCA GATCAACCTG
CAGGAGCTGC AGCAGGTGCT GATCAACCTC ATCGTCAACG CCGTGCAGGC CATGCCGGCC
GGCGGTACGC TGACGCTGGA GACGGTCGAC CGCGACCCGC GGCAGGATGC GCGCGGCGTG
CGCATCACGG TGCGCGACAC CGGCGGCGGC ATCCGTGCCG AGGACCTGGC GCGCATCTTC
GATCCCTTCT TTACCACCAA GAAGCGCCAA GGCACCGGGC TGGGGCTGTC GATCAGCCAC
ACCCTGGTCG AGCGCTATGG CGGCCGCATC GAGGTGGACA GCGCGCCCGG GCGGGGCGCG
GCCTTCACGG TGAGCCTGCT CGCCGAGCCG GTGTATCGCA AGGACCCCGC GCCGCCGCCC
GGCGGACGGG AAAACGGAAC AGGACAAACG GAAGCGACAT GA
 
Protein sequence
MRLAERLRGS VRAKLLALVL APLMLGFPLI MGVLWYWGEV YYHRLMVSRV ASDLATAHGY 
FERVIDGVGN GVQGLAGSHA LAQTLAHESP EAVAALLGAR KGGLALDFLN LLDVNGRVLH
ASTALAAGSA RGGWPVVQQA IRARGGSVVE RFAPAELAAI DAGMRERARL ALVATARARP
DSRGEEERGL VVHAAAPVLD AAGNLVAVLE GGVLLNGNLD FVDTLNAIVY REGSLPEGSE
GTATLFVDDV RIATNVRLFE GARALGTRAS DEVRTHVLDH GRTWLETAFV VNDWYVSAYE
PVVDSRGERV GMLYVGFLEA PFRAAKRMAL LVVGALALLV SAAGALLTLY WARGIFRPIE
RMHATIGRID AGDEQARVGD VASRDELGRL ATAFDHLLDD QAVRRAELQA LNASLDRKVA
ERTADLADAN AELRAAQHRL VMSEKLAAIG ELTAGVAHEI SNPTAVIQGN LDLLREELGP
AAQPVANEIR LIHEQVGRIR LIVTKLLQFA RPGEFAGYVE DVDVNAALAD CLVLTRQHLA
RAEVKVVQRL AATARVQINL QELQQVLINL IVNAVQAMPA GGTLTLETVD RDPRQDARGV
RITVRDTGGG IRAEDLARIF DPFFTTKKRQ GTGLGLSISH TLVERYGGRI EVDSAPGRGA
AFTVSLLAEP VYRKDPAPPP GGRENGTGQT EAT