Gene Tmz1t_4046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_4046 
Symbol 
ID7873275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4442919 
End bp4444322 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content60% 
IMG OID643700979 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002891002 
Protein GI237654688 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAA AAATGCAGCC TTTCACCCCG CCTGCCATAG GCCATCTGTT TAGTGGAGCC 
ACTGCAGATC AGTACCGCGC GCACTATGAC CCGGGCCGCC TGGACTCGGT AGCGTCGCGA
ATTGCTATCG GTACGGCGGC AGATGTAGAC CTGGCAGTCC AACAGGCCTA TCGAGCCTTC
CCGGGCTGGC GAGACACGCC GGTGGCTCAG CGTGCGGCAT CACTTGCGAA GGCTGCTGCG
CTTGTCCAAG CCAACTCGGC CGAACTTGGC CCCCTCCTTG TTCGCGAACA CGGAGGCGTG
TCATGGGAGG CTCAGGCCGA TTTTGCGCTG GGATACGGAG TGCTGAGTCA CACCGCCGAT
CTCGCGGAGC GCTTCTTCAA CCCTGTCACG CACGACGAGG AACAGAGCTT CATCAGCATA
GAGAAGGATC CGCGTGGCGT AGTCTCAGCC ATCGTGCCTT GGAACATGCC GGTGGTGCTG
ACGATGATGA AGCTCGCTCC TGCGCTTGCG ACAGGAAACA CGCTGGTCCT CAAGCCATCC
CCCTTCGCAG CAGGAGCACT GACCCTGCTG ATCGAGCGAT TGAGCATGTT TTTCCCCGAA
GGCGTAATCA ACGTAGTCCA AGGGGATGTC GAAGTGGGCC AAGCGCTGAC AACCCACCCC
CTTGTCCGCA AGGTAGCCTT CACCGGCGGA ACAGCGACCG CACGTCACAT CATGACCGGC
GCAGCCAACA CGATAAAAAA CATCACCCTC GAACTCGGTG GTAACGATCC GGCAATCGTC
CTCGACGATG CAGACCTCGA CGCCACGCTG GACCGGATGT TGCCGGGGAT CTTCACACGC
AGCGGCCAGA TTTGTTTCGC GGTAAAGCGC ATCTATGTTC CACGTGCGTC CTACCGCAGG
TTTGTAGACG CTTTGTGCCA ACGCGTATCC GAGTACAAGG TGGGACATGG GTTGAACCCG
GAGGCAACGC TCGGCCCCTT GAACAACAAG GCTCAGTACG AGCGTGTGTG CAAGTTTATC
GATGGGGCTA AGTCCGGCCC CGCAAAGGTG GTGGAGCTCG GCCGCAGGCT CGAGCCTGAC
CAGTGGGACA ATGGCTACTA CGTCCTCCCC CACGTCGTTA GCGACGTGGC GCACGAAGCG
CAGACCACCT CACGTGAGCA ATTCGGCCCA ATCATCCCGG TCGTTGCTTA CGATTCCGAG
GAGCAGGTAC TCGCTTGGGC CAATGACAGC GAGTACGGCT TGGGTTCCTC CGTATGGACA
CGGGATAGTG CGCGTGGACT CGCTTTCGCC CGCAGGATCG AACCCGACTT TGACACTTCC
AAGGCACATT TCCGCCCCAC CTCCAGCAAC GGCGCGGGTT TCCGGGCGAT GATGCGCCAA
AATCCTAGAT ACATGGATTG TTGA
 
Protein sequence
MNTKMQPFTP PAIGHLFSGA TADQYRAHYD PGRLDSVASR IAIGTAADVD LAVQQAYRAF 
PGWRDTPVAQ RAASLAKAAA LVQANSAELG PLLVREHGGV SWEAQADFAL GYGVLSHTAD
LAERFFNPVT HDEEQSFISI EKDPRGVVSA IVPWNMPVVL TMMKLAPALA TGNTLVLKPS
PFAAGALTLL IERLSMFFPE GVINVVQGDV EVGQALTTHP LVRKVAFTGG TATARHIMTG
AANTIKNITL ELGGNDPAIV LDDADLDATL DRMLPGIFTR SGQICFAVKR IYVPRASYRR
FVDALCQRVS EYKVGHGLNP EATLGPLNNK AQYERVCKFI DGAKSGPAKV VELGRRLEPD
QWDNGYYVLP HVVSDVAHEA QTTSREQFGP IIPVVAYDSE EQVLAWANDS EYGLGSSVWT
RDSARGLAFA RRIEPDFDTS KAHFRPTSSN GAGFRAMMRQ NPRYMDC