Gene Tmz1t_3399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3399 
Symbol 
ID7873890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3717949 
End bp3719991 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content70% 
IMG OID643700338 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002890370 
Protein GI237654056 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGACA ACCGCAAGCC GCCGGATCCC GACCCGCAGC CTTTTCCGAT GAGCCTCTCC 
GACCCTCTCC CGCCCCCGGC GCGCCGGCGC TGGCTGCAGC AGCTGCCCTA CCTCAGCCTG
GCGCTCTTCC TCGCCGCGAT CGCCGCGCTG GTGTGGCTCA CCCGCGAGTA CGACGACGAG
GCCCAGCGCG CGACGCTGAT CAGCGACGTG CTGTGGATGG AGCAGAACCT GCGCTTCAAC
CTGGACCGCA ACGAGAACCA CCTGCAGGAG ATCGGCCCCG AGCTGCTCGC CACACCCACG
CTGTCGGCGC AGACCGAGGC TCGGCTCGCC AGCCTGCTGA CATCCGAGCG CGGCCTCGCG
CGCATCCTGT GGCTGTCGCC GGAAGGCGAA GTGCGCGGCG CGATGCCGCC GCTGGCTCCC
GCCCAGGCGG TGCACGACGC CTCCGCGGAT GCTGCCGCGG TGCCGGTCGA CGAATCCAGC
GTGCGGCTGG CACGCGCCAT CGGCCGCCCG GTGTATGGCC CCGCGCATCC GGCGCCCGGT
GGCACCCACC ACTTCGAGGT GCACGTGCCG GTGTGGTCGG ACGAGGCGCA GGTCGGCATG
GTGGTCGGCG TGTATGCGCT CTCGGACGTG GTGATGCGCG AGCTGCCGTG GTGGTTCTCC
GAGCGCTACC ACGTCGCCGT GCTCGACAGC GACGGACGCA CGATCGCCGC CAAGTCGAAA
GTCGCGCCGG TGTCGCCGCG GCTGTCCTAT TCGATGCCCT TCGATCCGCC CGGACACGGC
CTCACGCTGC AGATCTCCGC CTACCGCTCC GACCCGCGCT GGATCCCGGT GCTGCTCGGC
GCCTCGATCG TGCTGCTGGC CGGCATCATC GTGTGGAGCG TGATGCAACT GCGCCGCCAG
CTCGCACGCC GTCAGCAGGC CGAGGTGGCG CTGCGCGCCG AGTCGGTCTT CCGCAAGGCG
ATGGAGGACT CGATGCTGAC CGGCATGCGT GCGCGCGACC TCGAAGGCCG CCTCACCTAC
GTCAACTCCG CCTTTTGCCG CATGACCGGC TTCAGCGCGG ACGAGCTGCT CGGTCGCAAG
CCGCCGATGC CCTACTGGGA CCCGGAAAGG CTGGAGCAGA CCTTCGAGCT GCACCGCCAG
ATCATGACCA GCGGCAGCGC CAGCGAAGGC ACCGAGGTGC GCCTGCGGCG CAAGAACGGC
GAAACCCTCG ACGTCCTCGT CTTCGAGGCG CCGCTGATCG ACGCTTCCGG CCGCCACGCC
GGCTGGATGG GTTCGGTGCT CGACATCACC GAACAGAAGC GCGCGCGCGA ACAGGCCCGC
CAGCAGGAGG AGCGCCTGCA GCAGAGCTCG CGCCTGATCA CCATGGGCGA GATGGCGTCC
ACGCTCGCGC ACGAGCTCAA CCAGCCGCTC GCGGCGATCG CAAGCTACAC CACGGGCTGC
ATCAACCGCC TGCAGGACGA GGCGCCGCTC GACCGCAGCG AGCTGCTCGA CGTGCACCAG
CGCATCGCCC GTCAGGCCCA GCGCGCCGGC GAGATCATCC GCCGGGTGCA CGATTTCGTG
CGCCGATCCG AACCCAAGCG CGAGGCGCTC GACCTCGCCG CGGTGATCCG CGACGCCATC
GGCCTGATCG AGGCCGACGC GCGCAAGCGC GGCATGCGGA TCCGCAGCGA GCTCGCCGAC
GGCCTGCCGC AGGTGCCGGC GGACGCGGTG ATGATCGAGC AGATCGTCGT CAACCTGGTG
CGCAACGCGA TGGACTCGAT GCGCGACACC CCGCCAGCCG AGCGCGTCGT CGCCGTGCAC
ACCGCGCGCG AGGGCCGCTT CGTCACCGTC ACCGTCGCCG ACCGCGGCGC AGGCATTCCC
GCCGAAACCG CCGCGCGGCT ATTCGAGCCC TTCTTCACCA CCAAGCAGGA GGGCATGGGC
ATGGGACTCA ACATCTGCCG CTCGATCGCC GAGCTGCACC GCGGCCGCCT GGCCTTCGAA
TCGCGGCCGG GCGGCGGTAC CATCTTCACC CTTTCCCTCC CGGTGGACGC CGAGTTCGAA
TGA
 
Protein sequence
MRDNRKPPDP DPQPFPMSLS DPLPPPARRR WLQQLPYLSL ALFLAAIAAL VWLTREYDDE 
AQRATLISDV LWMEQNLRFN LDRNENHLQE IGPELLATPT LSAQTEARLA SLLTSERGLA
RILWLSPEGE VRGAMPPLAP AQAVHDASAD AAAVPVDESS VRLARAIGRP VYGPAHPAPG
GTHHFEVHVP VWSDEAQVGM VVGVYALSDV VMRELPWWFS ERYHVAVLDS DGRTIAAKSK
VAPVSPRLSY SMPFDPPGHG LTLQISAYRS DPRWIPVLLG ASIVLLAGII VWSVMQLRRQ
LARRQQAEVA LRAESVFRKA MEDSMLTGMR ARDLEGRLTY VNSAFCRMTG FSADELLGRK
PPMPYWDPER LEQTFELHRQ IMTSGSASEG TEVRLRRKNG ETLDVLVFEA PLIDASGRHA
GWMGSVLDIT EQKRAREQAR QQEERLQQSS RLITMGEMAS TLAHELNQPL AAIASYTTGC
INRLQDEAPL DRSELLDVHQ RIARQAQRAG EIIRRVHDFV RRSEPKREAL DLAAVIRDAI
GLIEADARKR GMRIRSELAD GLPQVPADAV MIEQIVVNLV RNAMDSMRDT PPAERVVAVH
TAREGRFVTV TVADRGAGIP AETAARLFEP FFTTKQEGMG MGLNICRSIA ELHRGRLAFE
SRPGGGTIFT LSLPVDAEFE