Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1910 |
Symbol | |
ID | 7085679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2154840 |
End bp | 2156324 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643698935 |
Product | transcriptional regulator, GntR family with aminotransferase domain |
Protein accession | YP_002355557 |
Protein GI | 217970323 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.279247 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCTCG ACTGGCTGCT CGCCCCACCC TTGCCCGCGG CGGTGCCGCG CCAGCGCGTG CTCTATCTCC GCCTGCGCGA GGCCATCCTG TCGGGCCGGC TGGCGGCCGA CACGCGCCTG CCCGCCAGCC GCAGTCTCGC GGCGACGCTC GGCATCGCAC GCAACACGGT GCTGTTCGCC TATGAGCAGC TCGTCGCCGA AGGCTGCCTG GTGGCGGATC GCCACGGGAC GCGGGTGGCA CAACTGCCCG CCGCACCGGG CGCCTCGCAG CCCGTGTCCC GCACACAGGG GCCGCCGCCG GTGCTGTCCG CGCGCGCGGC TGCGGCGCTG CAACGCGAGC CGGCGCGCGA CGCCGAAGCG CTGCCGTTCT CGCCCGGCGT GCCGGACTTC GGTGCCTTCC CCTTCCGTGC CTGGCGCGCC TGCCTCGAGC GCGCGTGGCG CGACGCCGGC TGGCGTCAGC TCGGTTACGC CGCGCACGGT GGCGATCCGA GCCTGCGTGC GGCGCTCGCG GCCCACCTCG GCAGCGTACG CGGCCTCGCG GTGGATGCCG CGCAGCTCCT CATCACCAGC GGCACGCAGG CGGGGCTCGA CCTGTGCGCG CGCCTGCTCG CCGATCACGG CGACACGGTC TGGGTGGAGA ACCCGGGCTA CCTCGCCGCC CGCGTCGCCT TCGGGCTCGC CGGCCTGCAG GTGCACGACG TCATGGTCGA TGACGAGGGG CTGGCCCCCG GCGCGGAAGA CTGGCTGCGC CATCCGCCGC GGCTGATCAT GACCACCCCC TCGCACCAGT ACCCCAGCGG GCGGGTGATG TCGCTGCCGC GCCGGTTCGC GCTCATCGAG CGTGCCCGTG CGGCTGGCAC CTGGATCGTC GAGGACGACT ACGACAGCGA ATTCCGCCGT GCCGGTCCGG CACCGCCCGC GCTGTTCGGC CTGCAGCCCG GCGCCACGGT CGTGTACGCC GGCACCTTCA GCAAGACGCT GTATCCGGGC CTGCGCCTGG GCTATCTCGT CCTGCCGCGC ACCATCGCGG CGGACTTCAT CCACGCTGCG GCGCGCGCCA CGCGCGCCGG GCAGGGCATC GAGCAGCGCG CGCTCGCCGA CTTCATCGGG CGCGGGCATT ACATCACCCA TCTCCGTCGC ATGCGGGCGC GCTACAACGC CCGCCAGGCT GCGCTGCGTG CCGCCCTGCG GCAGGCCTTC GGTCCCGGGC TGCTGCTGTC GGGCGGAGAA GCCGGGCTGC ATCTGGTGAT GTGGCTGCCC GACGAGCTGC CCGATGTCGC GGTGGCGCAG CGCGCCGCGC AAGTGGGGCT GGGCGTGCGC GCGCTGTCCG CCTACGCGCG CCCGCCGGTG CGCTGCAACG GGCTGGTGCT CGGCTATGGG AACCTGGACG AGGGCGCGGT CGAAGGGGCG GTGGCGCGGC TGAAGCGGGC GGTGGAGGAT GCGTCTGCAG GGCCGAGGTC GCCGATCGGG GCTGCCCGAT GCTGA
|
Protein sequence | MELDWLLAPP LPAAVPRQRV LYLRLREAIL SGRLAADTRL PASRSLAATL GIARNTVLFA YEQLVAEGCL VADRHGTRVA QLPAAPGASQ PVSRTQGPPP VLSARAAAAL QREPARDAEA LPFSPGVPDF GAFPFRAWRA CLERAWRDAG WRQLGYAAHG GDPSLRAALA AHLGSVRGLA VDAAQLLITS GTQAGLDLCA RLLADHGDTV WVENPGYLAA RVAFGLAGLQ VHDVMVDDEG LAPGAEDWLR HPPRLIMTTP SHQYPSGRVM SLPRRFALIE RARAAGTWIV EDDYDSEFRR AGPAPPALFG LQPGATVVYA GTFSKTLYPG LRLGYLVLPR TIAADFIHAA ARATRAGQGI EQRALADFIG RGHYITHLRR MRARYNARQA ALRAALRQAF GPGLLLSGGE AGLHLVMWLP DELPDVAVAQ RAAQVGLGVR ALSAYARPPV RCNGLVLGYG NLDEGAVEGA VARLKRAVED ASAGPRSPIG AARC
|
| |