Gene Tmz1t_3092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3092 
Symbol 
ID7874562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3345823 
End bp3347283 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content66% 
IMG OID643700015 
Product2-hydroxymuconic semialdehyde dehydrogenase 
Protein accessionYP_002890067 
Protein GI237653753 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.169742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGCTG ACAAGATCCT CAACTTCATC GACGGCGAAT ACGTCGCCAC CGACAAGTGG 
TACGAGAACC GCAACCCGAT CAACAACAAG GTGATCGGCA TGGTCGCCGA AGCCGGCGAG
AAGGAAGTCG ACGCCGCCGT CAAGGCCGCC AAGGCCGCGC TGAAGGGCCC CTGGGGTTCG
ATGTCGCTGC AGAAGCGCAT CGAGTTGCTC GAGGCCCTGG TGGTCGAGAT CAACAACCGC
TTCGACGACT TCCTCGAGGC CGAGTGCGCC GACACCGGCA AGCCCAAGAG CATGGCCTCG
CATGTGGACA TCCCGCGCGG TGCGGCCAAC TTCAAGGTCT TCGCGGACAT GGTCAAGAAC
GTTCCGACCG AGTTCTTCGA GATGACGACG CCCGACGGCG GCAAGGCGAT CAACTACGGC
TATCGCCGTC CGGTCGGCGT GGTCGGCGTG ATCTGCCCGT GGAACCTGCC GCTGCTGCTG
ATGACCTGGA AGGTCGGCCC GGCGCTGGCC TGCGGCAACA CCGTGGTCGT CAAGCCCTCG
GAAGACACCC CGCGCACCGC CGCGCTGCTC GGCGAGGTGA TGAACAAGGT CGGCATCCCC
AAGGGGGTCT ACAACGTGGT CAACGGCTTC GGCGCCAACT CCGCCGGCGC CTTCCTGACC
GCCCACCCGG ACGTCGACGC GCTCACCTTC ACCGGCGAGA CCCGCACCGG CGAGGTCATC
ATGAAGGCGG CGGCCAACGG CTCGCGCCCG GTGTCGCTGG AAATGGGCGG CAAGAACGCC
GCGATCGTGT TCGCCGACTG CGACTTCGAC AAGGCCATCG AAGGCACCCT GCGCTCCGTC
TTCCTGAACT GCGGCCAGGT CTGCCTGGGC ACCGAGCGCG TCTATGTCGA GCGCCCGATC
TTCGACAAGT TCGTCGCCGC CCTGAAGGCC GGCGCCGAAG GCATGAAGAT CGGCGTGCCG
GACGATCCGG CCGCCAACTT CGGCCCGCTG GTCAGCAAGA AGCATCAGGA GAAGGTGCTG
TCCTACTACA AGGTCGCCGT GGAAGAAGGT GCGACCGTGG TGACCGGTGG CGGCGTGCCC
CAGATGCCGG GCGAACTCGC CGATGGCTGC TGGGTGCAGC CGACCATCTG GACCGGCCTG
CCGGAAACCG CCCGCGTGAT CAAGGAAGAG ATCTTCGGGC CGTGCTGCCA CATCGCCCCC
TTCGACACCG AGGAGGAAGT GCTGGAGAAG GCCAACGACA ACAAGTACGG CCTGGCCTGC
GCGATCTGGA CGCAGGACGT CTCGCGCGCC CACCGCGTCG CGCAGAAGAT GGAAGTGGGC
ATCTCGTGGG TGAACAGCTG GTTCCTGCGC GACCTGCGCA CCCCCTTCGG TGGCTCCAAG
CAGTCGGGCA TCGGCCGTGA AGGCGGCGTG CACTCGCTCG AGTTCTACAC CGACCTCAAG
AACGTCTGCA TCAAGCTGTA A
 
Protein sequence
MIADKILNFI DGEYVATDKW YENRNPINNK VIGMVAEAGE KEVDAAVKAA KAALKGPWGS 
MSLQKRIELL EALVVEINNR FDDFLEAECA DTGKPKSMAS HVDIPRGAAN FKVFADMVKN
VPTEFFEMTT PDGGKAINYG YRRPVGVVGV ICPWNLPLLL MTWKVGPALA CGNTVVVKPS
EDTPRTAALL GEVMNKVGIP KGVYNVVNGF GANSAGAFLT AHPDVDALTF TGETRTGEVI
MKAAANGSRP VSLEMGGKNA AIVFADCDFD KAIEGTLRSV FLNCGQVCLG TERVYVERPI
FDKFVAALKA GAEGMKIGVP DDPAANFGPL VSKKHQEKVL SYYKVAVEEG ATVVTGGGVP
QMPGELADGC WVQPTIWTGL PETARVIKEE IFGPCCHIAP FDTEEEVLEK ANDNKYGLAC
AIWTQDVSRA HRVAQKMEVG ISWVNSWFLR DLRTPFGGSK QSGIGREGGV HSLEFYTDLK
NVCIKL