Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3092 |
Symbol | |
ID | 7874562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3345823 |
End bp | 3347283 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700015 |
Product | 2-hydroxymuconic semialdehyde dehydrogenase |
Protein accession | YP_002890067 |
Protein GI | 237653753 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.169742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGCTG ACAAGATCCT CAACTTCATC GACGGCGAAT ACGTCGCCAC CGACAAGTGG TACGAGAACC GCAACCCGAT CAACAACAAG GTGATCGGCA TGGTCGCCGA AGCCGGCGAG AAGGAAGTCG ACGCCGCCGT CAAGGCCGCC AAGGCCGCGC TGAAGGGCCC CTGGGGTTCG ATGTCGCTGC AGAAGCGCAT CGAGTTGCTC GAGGCCCTGG TGGTCGAGAT CAACAACCGC TTCGACGACT TCCTCGAGGC CGAGTGCGCC GACACCGGCA AGCCCAAGAG CATGGCCTCG CATGTGGACA TCCCGCGCGG TGCGGCCAAC TTCAAGGTCT TCGCGGACAT GGTCAAGAAC GTTCCGACCG AGTTCTTCGA GATGACGACG CCCGACGGCG GCAAGGCGAT CAACTACGGC TATCGCCGTC CGGTCGGCGT GGTCGGCGTG ATCTGCCCGT GGAACCTGCC GCTGCTGCTG ATGACCTGGA AGGTCGGCCC GGCGCTGGCC TGCGGCAACA CCGTGGTCGT CAAGCCCTCG GAAGACACCC CGCGCACCGC CGCGCTGCTC GGCGAGGTGA TGAACAAGGT CGGCATCCCC AAGGGGGTCT ACAACGTGGT CAACGGCTTC GGCGCCAACT CCGCCGGCGC CTTCCTGACC GCCCACCCGG ACGTCGACGC GCTCACCTTC ACCGGCGAGA CCCGCACCGG CGAGGTCATC ATGAAGGCGG CGGCCAACGG CTCGCGCCCG GTGTCGCTGG AAATGGGCGG CAAGAACGCC GCGATCGTGT TCGCCGACTG CGACTTCGAC AAGGCCATCG AAGGCACCCT GCGCTCCGTC TTCCTGAACT GCGGCCAGGT CTGCCTGGGC ACCGAGCGCG TCTATGTCGA GCGCCCGATC TTCGACAAGT TCGTCGCCGC CCTGAAGGCC GGCGCCGAAG GCATGAAGAT CGGCGTGCCG GACGATCCGG CCGCCAACTT CGGCCCGCTG GTCAGCAAGA AGCATCAGGA GAAGGTGCTG TCCTACTACA AGGTCGCCGT GGAAGAAGGT GCGACCGTGG TGACCGGTGG CGGCGTGCCC CAGATGCCGG GCGAACTCGC CGATGGCTGC TGGGTGCAGC CGACCATCTG GACCGGCCTG CCGGAAACCG CCCGCGTGAT CAAGGAAGAG ATCTTCGGGC CGTGCTGCCA CATCGCCCCC TTCGACACCG AGGAGGAAGT GCTGGAGAAG GCCAACGACA ACAAGTACGG CCTGGCCTGC GCGATCTGGA CGCAGGACGT CTCGCGCGCC CACCGCGTCG CGCAGAAGAT GGAAGTGGGC ATCTCGTGGG TGAACAGCTG GTTCCTGCGC GACCTGCGCA CCCCCTTCGG TGGCTCCAAG CAGTCGGGCA TCGGCCGTGA AGGCGGCGTG CACTCGCTCG AGTTCTACAC CGACCTCAAG AACGTCTGCA TCAAGCTGTA A
|
Protein sequence | MIADKILNFI DGEYVATDKW YENRNPINNK VIGMVAEAGE KEVDAAVKAA KAALKGPWGS MSLQKRIELL EALVVEINNR FDDFLEAECA DTGKPKSMAS HVDIPRGAAN FKVFADMVKN VPTEFFEMTT PDGGKAINYG YRRPVGVVGV ICPWNLPLLL MTWKVGPALA CGNTVVVKPS EDTPRTAALL GEVMNKVGIP KGVYNVVNGF GANSAGAFLT AHPDVDALTF TGETRTGEVI MKAAANGSRP VSLEMGGKNA AIVFADCDFD KAIEGTLRSV FLNCGQVCLG TERVYVERPI FDKFVAALKA GAEGMKIGVP DDPAANFGPL VSKKHQEKVL SYYKVAVEEG ATVVTGGGVP QMPGELADGC WVQPTIWTGL PETARVIKEE IFGPCCHIAP FDTEEEVLEK ANDNKYGLAC AIWTQDVSRA HRVAQKMEVG ISWVNSWFLR DLRTPFGGSK QSGIGREGGV HSLEFYTDLK NVCIKL
|
| |