Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3153 |
Symbol | |
ID | 7874295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3412440 |
End bp | 3414275 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700083 |
Product | putative transcriptional regulator |
Protein accession | YP_002890127 |
Protein GI | 237653813 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.880898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTAGCG CGCAGGAACT GCTCGACGAG CTCAATGCCA GCGACGAGTC GCCGCGCATC GAGGCCAAGC GTGCGCGGGA GGTCGGCAAG TCGGTGCTGG AGACCGTCAT CGCCTTTGCC AACGAGCCCG GCATGGACGG CGGCCACTTG CTGCTCGGCG TCGATTGGTC GATCAACGAC AAGGGCGACA CCGTTTATCG CCCCGAGGGT GTGCCCGACC CCGACAAGCT GCAGCAGGAT CTGGCCTCGC AATGCGCGAG CATGCTCAGT TTTCCATTGC GTCCGGAAAT CAGCGTCGAG CGTATCGACG GCAAGACCCT GGTCGTGGTG TATGTGGCCG AGGTGGATAG CGGGCACAAG CCGGTCTATC TCAAGGCCAC CGGCCTGCCG CGCGGAGCCT TCCGCCGCAT CGGATCGACC GACCAGCGCT GCACCGACGA GGATCTGTGG GTGCTGCGCG GCGACACCCG CCCGCAGAAG GGGCCGGACC AGAGCATTCT CACCGATGCA CGCCTGGACG ATTTCGACCC CGCCGCGCTG GCCGAATACC GCCGCGTGCG CAGCCGCTTG AACGCCAGCG CCGAGGAGCT CGGTTACGGT GACGACGATC TGCTCGAGGC GCTCGGCGCG GTGCGCCGCG TCGATGGCGA GCCGCGCCCC ACGTTGGCCG GCATCCTGCT GTTCGGCAAA CCGATGGCGC TGCGCCGCAT GCTGCCGATG GTCAAGATCG ACTACATCCG CGTGCCGGGC ATCGAGTGGA TGGAGGACCC GCACGAGCGC TTCCAGTCCA TCGAGATCCG CAAGCCGCTG CTGCTGGCGC TGCCCCTGGC CGAAGCCAGC ATCATCGACG AGCTGCCCAA GGGTTTTCAT CTGCCCGAGG GCGAACTGCA CAGCGTCCAG GAGCCGATCG TCCCGCGCAA GGTGATCCGC GAGGCGCTGG CCAATGCGGT GATGCACCGC AGCTACACGC AGCACAGCCC GATCCAGATC ATCCGCTACA GCAACCGCAT CGAGATCCGC AACGTCGGCC ACTCGCTCAA GCCCGTGGCC GAGCTTGGCA TTCCGGGCTC ACGGCTGCGC AACCCCACCC TGTCCGCCGT GCTCCACGAC TTGAACCTGG CCGAAGCCAA AGGCACCGGC ATCCGCAGCA TGCGCAGGCT GGCCGCCGAG GCGGGACTGA CGCTGCCCGA GTTTCACTCC AGCCGCGAGT CGGACGAGTT CCGCGTCACG CTTTTCCTGC ACAACCTGCT CACCGAGGAC GACCACGCCT GGCTGCGCTC GCTGAGCAGC GAGCCGCTGG ATGCCGACGA AACCAAGGTG CTGATTTACG CTCGCGCCAC CGGTGCGGTG GACAACACGG CGTGCCGCGA CTTCAGCGGG CTGGACACGC TGACCGCCAG CCGCGTGCTG CGCCGCCTGC GAGACAAGGG TTTGCTGGAA AAACATGGAG GCGGCAGCCA CACGTATTAC GAACTGGCCA GCCCAACAAT CCCCACGCCG CTTGCAATCC ACCCAAGCTC AAGTGCGCCA GCAGGGGAGG CTTGCAACCC AAAGCATGCA ACCTTGCCTC TCGAGCTTGC AACCTTGCTT GCAACCCTAC AGGGCCGCGT CAGCACGGAA GCCTTGCGTG GAGGCATTGT GCGCCTGTGC GCATGGCAGG CTTTGGGCGT GGACCAGCTT GCAAGTTTTC TGAACAAAGA CCGGCACTAC TTGCGCAACA AGCACCTGAT TCCAATGGTG CGAGAGGGGC AGCTGCGCTT TCGCTACCCC GAAAGCGCTA AACACCCGCA CCAGGCCTAT GTCGCCGCCG GCGCGGAGGA CAGGAACAAT GGCTGA
|
Protein sequence | MRSAQELLDE LNASDESPRI EAKRAREVGK SVLETVIAFA NEPGMDGGHL LLGVDWSIND KGDTVYRPEG VPDPDKLQQD LASQCASMLS FPLRPEISVE RIDGKTLVVV YVAEVDSGHK PVYLKATGLP RGAFRRIGST DQRCTDEDLW VLRGDTRPQK GPDQSILTDA RLDDFDPAAL AEYRRVRSRL NASAEELGYG DDDLLEALGA VRRVDGEPRP TLAGILLFGK PMALRRMLPM VKIDYIRVPG IEWMEDPHER FQSIEIRKPL LLALPLAEAS IIDELPKGFH LPEGELHSVQ EPIVPRKVIR EALANAVMHR SYTQHSPIQI IRYSNRIEIR NVGHSLKPVA ELGIPGSRLR NPTLSAVLHD LNLAEAKGTG IRSMRRLAAE AGLTLPEFHS SRESDEFRVT LFLHNLLTED DHAWLRSLSS EPLDADETKV LIYARATGAV DNTACRDFSG LDTLTASRVL RRLRDKGLLE KHGGGSHTYY ELASPTIPTP LAIHPSSSAP AGEACNPKHA TLPLELATLL ATLQGRVSTE ALRGGIVRLC AWQALGVDQL ASFLNKDRHY LRNKHLIPMV REGQLRFRYP ESAKHPHQAY VAAGAEDRNN G
|
| |