Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3193 |
Symbol | |
ID | 7874333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3474084 |
End bp | 3476792 |
Gene Length | 2709 bp |
Protein Length | 902 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700122 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_002890165 |
Protein GI | 237653851 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0583373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGCCT CCGCCGCCGC CCGGGCGACC CTGTCGCGCT GGTACTGGGC CATGCCCTAT CTCGCGGTGA CCGTGCTCGC GCTGTCGATG CTCGCGGTCG TCTGGCTGCT GCAGGCGCGC GAGACCGCGG TCGAGCGCGA CGCGCTCGCG CGCGACCTGC AGTGGACCGA GCAGTCGATC CGGCGCGCCA TGCTCACCAC CGAGGAGTTT CTCGTCCAGC TCGCGCGCGA CCTGTCGGCG GGCACGCTCG ACCACGACGA CTTCCAGCTG CGCGCCAACG AGCATCTGGC GATCAACCCC GCGCTCGCCA ACCTCGCCTG GGTGGACCGC GAGCACGTGA TCGCCTGGTC GGCGCCCTTC GACACCCGTG ACCTGCTCGC CGGCGAGATC CTCGTGGATG ACCAGGTGGA GGCCTTCCAG AACAGCGCCC TGTCCGCCCG CCTGACCTAC GGCCGGCCCT ACATCGACGC GCGCGGCAGC GCGGTGATCG AGGTGTACGC GCCCGTGCTG CGGCGCGGCG ATTCGCTCGG TGCGATCGTC GCCGTGTATT CGATCGAGCG CATGCTGAGC CGGCTGGTGC CGGCGCGCTT TGCCGAAAAA TACCGGCTTG CGGTGCTCGA CCGCGATGGC GGCGTGCTGA CCCTCAGCTC GGCCCTGCCG CCCGCGCAGG AGGCGCTCTC GCTTACCCTC GCGCTCGATC CGCCGGGCAA CGGCCTCGCG CTGCAGGCGA TGGCCTTCCG GCCCGGTGGG GCGGTGCTGC GCTACCTGCC GGCGGTGCTC ATCGTGGGGC TGTCGCTGCT GGTGCTGTGG AGCCTGTGGA CGCTGCGCCG CCACGTGCAG CGCCGGGTGC GGGTGGAGAA GGAGCGCGAC CAGCTCTTCG ACCTTTCGCT CGACCTCTTG TGCGTCATCG ATCTCGATGG TCGTTTCGGG CGCTGCAACC CGGCCTTCGA GCGCGTGCTC GGGCACGACC CCCGGAGCCT GCCCGGGTGC TCGCTGATCG ACTTCGTGCA TCCCGAGGAC GTTTCCGAGA CGCTGGCGAT GTTGCGCCGC CTCGCCGGCG GCGAACCGGT GCGCTTCGAG ACGCGCTGCC GCTGTGCCGA CGGCAGCTAC AAATGGCTGA TGTGGAGCAT CAACCCGGTA CGCGAGGAGC GTCTGGTCTA CGCCGTGGCG CACGACGTCA CCGGACGCAA GGCCACCGAG GACGCGCTGC GCGCCGAGTC CGCCTTCCGC AAGGCGATGG AGGACTCGGT CGTCACCGGC CTGCGCGCGA TCGACCTGTC GGGACGCATC ATCTATGTGA ACACCGCCTT CTGCCGCATG ATCGGCTTCG ACGAGGACGA GCTCGTCGGC GCCGGGCCGC CCTTTCCCTA TTGGCCACCC GAGCAGCTCG CCGAATGCGA GCGCAACCTG TCGATGACCC TGTCGGGGCG CGCGCCGCGC AGCGGCTTCG AGATGCGCAT CCTGCGCAAG AACGGCGAGC GCTTCGATGC GCGCTTCTAC CTTTCCCCGC TGATCGACCT CACCGGCCGG CAGACCGGCT GGATGGCCTC GATCACCGAC ATCACCGAGC CCAAGCGCGT GCGCGCCGCG CTCGAGTCGG CGCACGAGCG CTTCGAGGCG GTGCTCGACG GCCTCGACGC CGCGGTGTTC GTGGCCGACG CGCGCACCGA CGAGATCCTC TTCGCCAACC TCGCCTTCAA GCGCATCCAT GGTTTCGACG CGGTCGGGCG CACCGTGCGC GGGGTGGCGG TGCCGCAGCC CGAGCGCGGC GACTACCGCG TCGATCCGCG CAACCTGTCG CCCGCCGACG TGCCGCGCGA GCTCTTCGAC GGCGAATTGC TGCACCCGCA GTCGGGGCGC TGGTACCACG TGCGCGAGCA GGCCACACGC TGGGTGGACG GGCGCGTGGT GCGCATGGGG ATCGCCACCG ACATCACCGA CCGCAAGCAG ACCGCCGCGG TGGCGCGCGA GCAGGAGGAG CGCCTGCAGC GCACTTCGCG CCTGATCACC ATGGGCGAGA TGGCCTCGAC GCTGGCGCAC GAGCTCAACC AGCCGCTGGC GGCGATCGCG AACTACTGCG CCGGTTCGGT GACGCGGCTG CAGTCGGGCA AGGCGAGGAC CGAGGACGTG CTCGCGGCGA TGCAGAAGGC GAGCTTCCAG GCCGAGCGCG CCGGCAAGAT CATCCGCCGC GTGCGCGAGT TCGTGAAGAA GAGCGAGCCG CAGCGCTGCG CGGTCGACCT CGTCGAGGTG CTGGAGGACG CGATCGGCTT CGCCGACATC GACGCCCACC GCACCCGCAT CCGCATCCAC ACGGAGCTCG AGCCCGGGCT GCCGCCGGTG TACGCCGACC GCATCATGAT CGAGCAGGTG CTGCTCAACC TGATCCGCAA CGGCCTCGAC GCCATGGCCG ACGCCTTGCC CGAGGCGCGC GTGCTGACGG TGCGGGTGCG CACCGTCGGC GCGGACGCGG TCGAGGTCGC GGTGATCGAC CGCGGCCACG GCATCAGCGA GGAGGGCAGG GCGCAGCTGT TCACGCCCTT CTACACCACC AAGGCCGAGG GCATGGGCAT GGGGCTCAAC ATCTGCCGCT CGATCGTCGA GTTCCACAAC GGCAGATTGC TGGTCGACGC CAACCCCGAA GGCGGTACCA TATTCACGTT TACCCTGCCG ACGGAGTCCG CAATTGAGCG CAGTGCCCGC AGCGCCTGA
|
Protein sequence | MQASAAARAT LSRWYWAMPY LAVTVLALSM LAVVWLLQAR ETAVERDALA RDLQWTEQSI RRAMLTTEEF LVQLARDLSA GTLDHDDFQL RANEHLAINP ALANLAWVDR EHVIAWSAPF DTRDLLAGEI LVDDQVEAFQ NSALSARLTY GRPYIDARGS AVIEVYAPVL RRGDSLGAIV AVYSIERMLS RLVPARFAEK YRLAVLDRDG GVLTLSSALP PAQEALSLTL ALDPPGNGLA LQAMAFRPGG AVLRYLPAVL IVGLSLLVLW SLWTLRRHVQ RRVRVEKERD QLFDLSLDLL CVIDLDGRFG RCNPAFERVL GHDPRSLPGC SLIDFVHPED VSETLAMLRR LAGGEPVRFE TRCRCADGSY KWLMWSINPV REERLVYAVA HDVTGRKATE DALRAESAFR KAMEDSVVTG LRAIDLSGRI IYVNTAFCRM IGFDEDELVG AGPPFPYWPP EQLAECERNL SMTLSGRAPR SGFEMRILRK NGERFDARFY LSPLIDLTGR QTGWMASITD ITEPKRVRAA LESAHERFEA VLDGLDAAVF VADARTDEIL FANLAFKRIH GFDAVGRTVR GVAVPQPERG DYRVDPRNLS PADVPRELFD GELLHPQSGR WYHVREQATR WVDGRVVRMG IATDITDRKQ TAAVAREQEE RLQRTSRLIT MGEMASTLAH ELNQPLAAIA NYCAGSVTRL QSGKARTEDV LAAMQKASFQ AERAGKIIRR VREFVKKSEP QRCAVDLVEV LEDAIGFADI DAHRTRIRIH TELEPGLPPV YADRIMIEQV LLNLIRNGLD AMADALPEAR VLTVRVRTVG ADAVEVAVID RGHGISEEGR AQLFTPFYTT KAEGMGMGLN ICRSIVEFHN GRLLVDANPE GGTIFTFTLP TESAIERSAR SA
|
| |