Gene Tmz1t_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1839 
Symbol 
ID7084262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2060650 
End bp2062101 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content74% 
IMG OID643698862 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002355487 
Protein GI217970253 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.613277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAGA TCCACCTGCT GATCGGCGGC GAACGTCGCC GCGCCACGGA CGGCGCCAGC 
TTCGAACGCC GCAACCCGCT CGACCACGGC GTCGCCACGC GCGCTCCCGC GGCCACCGCT
GCGGACGCGG TGGCCGCGGT CGAGGCCGCC GCCGCGGCCT TCCCCGCGTG GGCCGCCACC
GGCCCCGGCG AGCGCCGCGC GCTGCTGATG AAGGCGGCGC ACGCGCTCGA GGCGCGCGCC
GAGGCCTTTA CCGCGGCGAT GGCCGCCGAG ACCGGCGCCT CGGCGATCTG GGCCGGCTTC
AACGTGCACC TGGCGGCCGG CATGCTGCTC GAGGCGGCGG CGCTGACCAC CCGCATCGAG
GGCAGCATCC TGCCCTCGGA CGTGCCCGGC TCGGTGGCGA TGGCGGTGCG CCAGCCCGCT
GGCGTGGTGC TCGGCATCGC GCCCTGGAAC GCCCCGGTGA TCCTGGGCGT GCGCGCGATC
GCCACCCCGC TCGCCTGCGG CAACACCGTG GTGCTCAAGG GCTCGGAGCT GTGCCCGGCC
ACCCACGGCC TGATCATCGA GGCGCTGCAG GACGCCGGGC TGCCGGCGGG CGTGGTGAAC
TTCGTCACCA ACGCCCCGGC CGACGCCGGC GCGGTGGTCG AGGCCATGGT CGCGCACCCG
GCGCTGCGCC GGGTGAACTT CACCGGCTCG ACCCACGTCG GCCGCCTGAT CGCGCAGACC
TGCGCCCGGT ACCTCAAGCC GGCGGTGCTC GAGCTCGGCG GCAAGGCGCC TTTCGTCGTG
CTCGACGACG CCGACCTCGA CGCCGCGGTG GCGGCGGCCA CCTTCGGCGC CTTCGCCAAC
TCGGGCCAGA TCTGCATGTC CACCGAGCGC ATCGTCGTCG ATGCGGCGGT GGCCGACGAC
TTCGTCGCCC GCCTGGCGGC GCGCGCCCGC GCCCTGCCCC TGGGCGACCC GCGCAAGGGC
CCGGTGGTGC TCGGCTCGGT GGTCGACCAG CGCACGGTCG AGCGCTGCAA CGCGCTGATC
GACGATGCGC TCGCCAAGGG GGCGACGCTG GTGTGCGGCG GCAAGGCCGA CAGCACGCTG
ATGCCCGCCA CGCTGCTCGA CCACGTCAGC GCGCAGATGC GCATCTACCA CGAGGAGACC
TTCGCCCCGG TGAAGGCGAT CGTGCGCGTC CAGGGCACCG AGGCCGCCAT CGCCTGCGCC
AACGACAACC CCTTCGGCCT CGCCGCCGCA GTGTTCGGGC GCGACCTCGC GCGCGCCTGG
CAGGTGGCCG GGCGCATCCA GTCGGGCATC TGCCACATCA ACGGGCCCAC GGTGCACGAC
GAAGCGCAGA TGCCCTTCGG CGGCGTGAAG GATTCGGGCT GGGGACGCTT CGGCGGCCAG
GCCGGGATCG AAGCCTTCAC CGAGCTGCGC TGGATCACCC TGCAGACGAG CCCGCGCCAC
TACCCGTTCT GA
 
Protein sequence
MNEIHLLIGG ERRRATDGAS FERRNPLDHG VATRAPAATA ADAVAAVEAA AAAFPAWAAT 
GPGERRALLM KAAHALEARA EAFTAAMAAE TGASAIWAGF NVHLAAGMLL EAAALTTRIE
GSILPSDVPG SVAMAVRQPA GVVLGIAPWN APVILGVRAI ATPLACGNTV VLKGSELCPA
THGLIIEALQ DAGLPAGVVN FVTNAPADAG AVVEAMVAHP ALRRVNFTGS THVGRLIAQT
CARYLKPAVL ELGGKAPFVV LDDADLDAAV AAATFGAFAN SGQICMSTER IVVDAAVADD
FVARLAARAR ALPLGDPRKG PVVLGSVVDQ RTVERCNALI DDALAKGATL VCGGKADSTL
MPATLLDHVS AQMRIYHEET FAPVKAIVRV QGTEAAIACA NDNPFGLAAA VFGRDLARAW
QVAGRIQSGI CHINGPTVHD EAQMPFGGVK DSGWGRFGGQ AGIEAFTELR WITLQTSPRH
YPF