Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1844 |
Symbol | |
ID | 7084267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2066427 |
End bp | 2068571 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643698867 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_002355492 |
Protein GI | 217970258 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.257508 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTCGG ACGAGCTCGA CTTCCACGCG CTGATCGACT TCATGCCCGA CAGCGGCGCG ATCCGCTTTT GCGGGCGGCG GGCGATCCTC GTCCATGCCG ACTCGATGGG CGCGCTGCGC AAGGAGCTGG TCGAGACCCT GGGCAGCGAG ATCGCCAAGG TGGTGCTCAC GCGCTACGGC TTCAGCTGCG GCTACGAGGA TGCCGAGCTG CTTGCCGCCT ACATGCATCC GGACAGCCTG GAGGAGTTCA TCCTCGGCGG ACCGCGCACC CACATGTTCT CGGGCATCGC CGAGGTCGCG CCCGATGCGC TCGAGCTCGA CCATGCCAGC GGGCGCTACC GCATGAGCGG GCGCTGGCGG CACTCCTACG AGGCCGAGCA GCACCTGCGC CTGTTCGGTC GCGCCGACGA GCCGGTGTGC TGGACGCTGA CCGGCTATGC CTCGGGCTTC GCCTCGCGGG TGCTGCAGGC CGACATGATC TGCGTGGAGA CGGCCTGCGA GGGGCGCGGC GATCCCCACT GCGCGTGGAC GCTGATGAAC GCCAGCGATT GCGCGCCCGA GCTCGCCGGC CTGCGCGAAT ACTTCAAGCC GCTCAACCTG CGCGACCGCC TCAACCTGCT CGAGAGCCGG GTGCATGAGC GCACCCGCGA GCTCGAGGCC TCCGAGCAAC GCTACCGCAA CCTCATCGAG GACCTCCCCG AGATGGCGTT CGCGCTGCAC GCCTCGGGGC GCCTGCTGCA CCTGAACAAG GCGGCGCGCC TGCGCCTCGG GGTGAGCGAG GCGCGGCTGC ACGCGCTGCG CCTGAAGGAC CTCGTGCTGC CCGACTACAC CGCGCGCGCC ACCGCCTTCC TCAAGGAGAT CGCGGCACGC CGCAGCACCA CCACGCTCGA CCTGGTGATG CGCGACGCCG CAGGCCTGCC CTTCCCGGTA CGGCTGCAGG TCGAGCCCGT GCTCAAGGAC GATCGCATCG TGGGCTACAG CGGGCTGGCG CTCGACATCA GCGCCCAGCT CGAACGCGAG CGCGAGCTCA CCGAGTACGC GCGCCGCATC GAGTCGCGCG AGCAGCAGAT CCAGGACGTG ATCAACGAGG CGGTCTATAT CCTCGACGGC CAGGGCCGGC TGAGCTTCGT CAACACCCGC ATGGCCGAGC TGCTCGGTGC CGCACCCGAG GCGGCGATCG GGCACGCCGT CGGCGACTTC ATGCCGGCCT CTTCGGCCGC ACGCATCGAG CGCGATTTTG CCCGCCGCAT CGCCGGCGAG GCCGGCCGGC CCTTCGAGAT CGCGCTCTCC GGACGCGAGG GCAGCAGCGT GGTGCTGGAG GTCAACTCCA CCCTCTTCGT CGGCCGCGAC GTCGCCGAGG GCGTGATCGG CGTGGCGCGC GACATCACCG CGCGCCGGCA GATGGAGCGC GAGCTCGCGC AGGCCAACCG CCTGAGCGCG CTCGGCCAGT TCGCCTCGGG CGTGGCGCAC GAGATCAACA ACCCGCTCGG CCTGGTGTCC GGCTATGCCG AGGAACTCCA GGCCCTGCTC GACCAATGCA CGCCGCTCGG CGACGACCGG CGGCTCGCGC AGCTGCGCCG CGGACTGGCC ACCATCCAGC AGCAGGCCCA CCGCTGCAAG GCGATCACCG CCAACCTGCT CGCCTTCTCG CGCCGCCAGA GCGCCAGCCT GGAGACGGTG GACGTGGGCC TCTTCGTCGC CGAACGCCTG GCCTTCTTCA ACGACGCCGG CCTCACCCGC GGCGTCGAGC TCGCGGTCGA CATCGCACCG GGTCTGCCTC AGGCCGGCAC CAGCCCGGCC CTGCTCGAAC AGGTGCTGCA AAACCTGATC AAGAACGCCT GCGACGCGAT GAACGGCAGC GGGCGCATCG ACATCGCCGC ACGCGCCGTC GGCGAAGGCA TCGAGATCGA GATCGGCGAC AGCGGGCCGG GTTTCGCGCC GGGGGTCGCC GAACGCGTGT TCGATCCCTT CTTCACCACC AAGCCACCGG GCAAGGGCAC CGGGCTGGGC CTGTCGATCT GCTACGCGAT CATCAACGAG CTGGGTGGCC GCATCGCCTG CGGCAACCGC CCCCAGGGCG GGGCCTGGTT CCGTCTGGTG CTGCCGCTTG CGGAAGACGG ACCGGGAGCC GCGAAGCATG ACTGA
|
Protein sequence | MRSDELDFHA LIDFMPDSGA IRFCGRRAIL VHADSMGALR KELVETLGSE IAKVVLTRYG FSCGYEDAEL LAAYMHPDSL EEFILGGPRT HMFSGIAEVA PDALELDHAS GRYRMSGRWR HSYEAEQHLR LFGRADEPVC WTLTGYASGF ASRVLQADMI CVETACEGRG DPHCAWTLMN ASDCAPELAG LREYFKPLNL RDRLNLLESR VHERTRELEA SEQRYRNLIE DLPEMAFALH ASGRLLHLNK AARLRLGVSE ARLHALRLKD LVLPDYTARA TAFLKEIAAR RSTTTLDLVM RDAAGLPFPV RLQVEPVLKD DRIVGYSGLA LDISAQLERE RELTEYARRI ESREQQIQDV INEAVYILDG QGRLSFVNTR MAELLGAAPE AAIGHAVGDF MPASSAARIE RDFARRIAGE AGRPFEIALS GREGSSVVLE VNSTLFVGRD VAEGVIGVAR DITARRQMER ELAQANRLSA LGQFASGVAH EINNPLGLVS GYAEELQALL DQCTPLGDDR RLAQLRRGLA TIQQQAHRCK AITANLLAFS RRQSASLETV DVGLFVAERL AFFNDAGLTR GVELAVDIAP GLPQAGTSPA LLEQVLQNLI KNACDAMNGS GRIDIAARAV GEGIEIEIGD SGPGFAPGVA ERVFDPFFTT KPPGKGTGLG LSICYAIINE LGGRIACGNR PQGGAWFRLV LPLAEDGPGA AKHD
|
| |