Gene Tmz1t_3153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3153 
Symbol 
ID7874295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3412440 
End bp3414275 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content66% 
IMG OID643700083 
Productputative transcriptional regulator 
Protein accessionYP_002890127 
Protein GI237653813 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.880898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAGCG CGCAGGAACT GCTCGACGAG CTCAATGCCA GCGACGAGTC GCCGCGCATC 
GAGGCCAAGC GTGCGCGGGA GGTCGGCAAG TCGGTGCTGG AGACCGTCAT CGCCTTTGCC
AACGAGCCCG GCATGGACGG CGGCCACTTG CTGCTCGGCG TCGATTGGTC GATCAACGAC
AAGGGCGACA CCGTTTATCG CCCCGAGGGT GTGCCCGACC CCGACAAGCT GCAGCAGGAT
CTGGCCTCGC AATGCGCGAG CATGCTCAGT TTTCCATTGC GTCCGGAAAT CAGCGTCGAG
CGTATCGACG GCAAGACCCT GGTCGTGGTG TATGTGGCCG AGGTGGATAG CGGGCACAAG
CCGGTCTATC TCAAGGCCAC CGGCCTGCCG CGCGGAGCCT TCCGCCGCAT CGGATCGACC
GACCAGCGCT GCACCGACGA GGATCTGTGG GTGCTGCGCG GCGACACCCG CCCGCAGAAG
GGGCCGGACC AGAGCATTCT CACCGATGCA CGCCTGGACG ATTTCGACCC CGCCGCGCTG
GCCGAATACC GCCGCGTGCG CAGCCGCTTG AACGCCAGCG CCGAGGAGCT CGGTTACGGT
GACGACGATC TGCTCGAGGC GCTCGGCGCG GTGCGCCGCG TCGATGGCGA GCCGCGCCCC
ACGTTGGCCG GCATCCTGCT GTTCGGCAAA CCGATGGCGC TGCGCCGCAT GCTGCCGATG
GTCAAGATCG ACTACATCCG CGTGCCGGGC ATCGAGTGGA TGGAGGACCC GCACGAGCGC
TTCCAGTCCA TCGAGATCCG CAAGCCGCTG CTGCTGGCGC TGCCCCTGGC CGAAGCCAGC
ATCATCGACG AGCTGCCCAA GGGTTTTCAT CTGCCCGAGG GCGAACTGCA CAGCGTCCAG
GAGCCGATCG TCCCGCGCAA GGTGATCCGC GAGGCGCTGG CCAATGCGGT GATGCACCGC
AGCTACACGC AGCACAGCCC GATCCAGATC ATCCGCTACA GCAACCGCAT CGAGATCCGC
AACGTCGGCC ACTCGCTCAA GCCCGTGGCC GAGCTTGGCA TTCCGGGCTC ACGGCTGCGC
AACCCCACCC TGTCCGCCGT GCTCCACGAC TTGAACCTGG CCGAAGCCAA AGGCACCGGC
ATCCGCAGCA TGCGCAGGCT GGCCGCCGAG GCGGGACTGA CGCTGCCCGA GTTTCACTCC
AGCCGCGAGT CGGACGAGTT CCGCGTCACG CTTTTCCTGC ACAACCTGCT CACCGAGGAC
GACCACGCCT GGCTGCGCTC GCTGAGCAGC GAGCCGCTGG ATGCCGACGA AACCAAGGTG
CTGATTTACG CTCGCGCCAC CGGTGCGGTG GACAACACGG CGTGCCGCGA CTTCAGCGGG
CTGGACACGC TGACCGCCAG CCGCGTGCTG CGCCGCCTGC GAGACAAGGG TTTGCTGGAA
AAACATGGAG GCGGCAGCCA CACGTATTAC GAACTGGCCA GCCCAACAAT CCCCACGCCG
CTTGCAATCC ACCCAAGCTC AAGTGCGCCA GCAGGGGAGG CTTGCAACCC AAAGCATGCA
ACCTTGCCTC TCGAGCTTGC AACCTTGCTT GCAACCCTAC AGGGCCGCGT CAGCACGGAA
GCCTTGCGTG GAGGCATTGT GCGCCTGTGC GCATGGCAGG CTTTGGGCGT GGACCAGCTT
GCAAGTTTTC TGAACAAAGA CCGGCACTAC TTGCGCAACA AGCACCTGAT TCCAATGGTG
CGAGAGGGGC AGCTGCGCTT TCGCTACCCC GAAAGCGCTA AACACCCGCA CCAGGCCTAT
GTCGCCGCCG GCGCGGAGGA CAGGAACAAT GGCTGA
 
Protein sequence
MRSAQELLDE LNASDESPRI EAKRAREVGK SVLETVIAFA NEPGMDGGHL LLGVDWSIND 
KGDTVYRPEG VPDPDKLQQD LASQCASMLS FPLRPEISVE RIDGKTLVVV YVAEVDSGHK
PVYLKATGLP RGAFRRIGST DQRCTDEDLW VLRGDTRPQK GPDQSILTDA RLDDFDPAAL
AEYRRVRSRL NASAEELGYG DDDLLEALGA VRRVDGEPRP TLAGILLFGK PMALRRMLPM
VKIDYIRVPG IEWMEDPHER FQSIEIRKPL LLALPLAEAS IIDELPKGFH LPEGELHSVQ
EPIVPRKVIR EALANAVMHR SYTQHSPIQI IRYSNRIEIR NVGHSLKPVA ELGIPGSRLR
NPTLSAVLHD LNLAEAKGTG IRSMRRLAAE AGLTLPEFHS SRESDEFRVT LFLHNLLTED
DHAWLRSLSS EPLDADETKV LIYARATGAV DNTACRDFSG LDTLTASRVL RRLRDKGLLE
KHGGGSHTYY ELASPTIPTP LAIHPSSSAP AGEACNPKHA TLPLELATLL ATLQGRVSTE
ALRGGIVRLC AWQALGVDQL ASFLNKDRHY LRNKHLIPMV REGQLRFRYP ESAKHPHQAY
VAAGAEDRNN G