Gene Tmz1t_0766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0766 
Symbol 
ID7084157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp850993 
End bp852495 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content71% 
IMG OID643697791 
ProductBetaine-aldehyde dehydrogenase 
Protein accessionYP_002354433 
Protein GI217969199 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCGAT ACAAGGGTGA ACTGCTGAGC GACGACACGC GCCGGTTCCT GGCGCAGCCC 
AAGCGCATGG CGATCGGCGG CGAATGGGTG GAGGCCCTCG GCGGTGGCAT GCTCGAGGTG
GTGGATCCCG CCAGCGGGCA GGTGTTCGAT CGCGTGCCGG CGGGCGAGGC GACCGACATC
GACCGCGCGG TGGCGGCGGC GCGGCGCGCC TTCGAGCAGG GCGACTGGCC GCGCATGCGA
CCGGTGGACC GCGAGCGCCT GCTGCTGCGC CTGGCCGAGC TGGTGGAGGC GCATGCGCAG
GAACTGGCCG AGATCGAGGC GCTCGACAAC GGCAAGCCGG TGACCATGGC GCGTGCGGTC
GACGTGGCGC TGGTGGTGGA CTTCCTGCGC TACATGGCCG GCTGGGCGAC CAAGATCGAA
GGCTCGACGA TGGAGGTGTC GGTCCCCCTG GTGCGCGACC GCGAGTTCTT CGGCTACACC
CGGCGCGAGC CGGTGGGCGT GGTGGGGGCG ATCATCCCGT GGAACTTCCC GCTGCTGATG
GTGGCGTGGA AGGCGGGGCC GGCACTCGCC TCCGGGTGCA CGATGGTGCT CAAGCCCGCC
GAGGAGACGC CGCTGTCGGC GCTGCGCTTT GCCGAGCTGG TGCAGCAGGC GGGCTACCCC
GCGGGGGTGT TCAACGTCGT CACCGGCCAC GGCCACTCCG CGGGTGCGGC GCTGGCCGCG
CACAAGGGCG TCGACAAGGT GGCCTTCACC GGCTCGACCG AGATCGGCAA GCTGGTCGGC
AAGGCCGCGC TCGACAACAT GACGCGGGTG TCGCTGGAGC TGGGCGGCAA GAGCCCGGTG
ATCGTGCTCG ACGACGCCGA CCCGGCGGTA GCGGCTGCGG GCGCGGCGCA GGCGATCTTC
TTCAACCAGG GGCAGGTGTG CTGCGCGGGT TCGCGCCTGT ATGTGCACAA GAGTCGTTTC
GAGCGCGTGG TGGAGGGGCT GTCGGGGATC GCCGCGGACA TGAAGCTCGG CGCCGGCATC
GAGCCCTCGA CGCAGATCGG CCCGCTGGTG TCGGCCGTCC AGCAGCAGCG CGTGCTGGGC
TACATCCGCA GCGGCTTCGA GGAGGGCGCG CGCGCGCTGG CGGGCGGCGC AGCGGGCGAG
GGCGAGGGCT ACTTCGTCAA GCCGACGGTG CTGGTCGATA CCCGCGACGA CATGCGCGTG
GTGCGCGAGG AGATCTTCGG CCCGGTGGTG GTGGCGATGC CCTACGACGA TCTCGACGAG
GTCGCCCGCC GCGCCAACGA CACCCCATAC GGCCTCGGCG CCAGCATCTG GTCCAACGAC
CTGTCGCGGG TGCACCGCCT GGTGCCGAAG ATCAAGGCCG GTACGGTGTG GGTCAACTGC
CACAACATCC TCGACGCCTC GATGCCCTTC GGCGGCTACA AGCAGTCCGG CATCGGCCGC
GAGATGGGGC GCGCGGTGCT CGACCTGTAC ACCGAGGGCA AGTCGGTGAT CATGGCCCTG
TAA
 
Protein sequence
MDRYKGELLS DDTRRFLAQP KRMAIGGEWV EALGGGMLEV VDPASGQVFD RVPAGEATDI 
DRAVAAARRA FEQGDWPRMR PVDRERLLLR LAELVEAHAQ ELAEIEALDN GKPVTMARAV
DVALVVDFLR YMAGWATKIE GSTMEVSVPL VRDREFFGYT RREPVGVVGA IIPWNFPLLM
VAWKAGPALA SGCTMVLKPA EETPLSALRF AELVQQAGYP AGVFNVVTGH GHSAGAALAA
HKGVDKVAFT GSTEIGKLVG KAALDNMTRV SLELGGKSPV IVLDDADPAV AAAGAAQAIF
FNQGQVCCAG SRLYVHKSRF ERVVEGLSGI AADMKLGAGI EPSTQIGPLV SAVQQQRVLG
YIRSGFEEGA RALAGGAAGE GEGYFVKPTV LVDTRDDMRV VREEIFGPVV VAMPYDDLDE
VARRANDTPY GLGASIWSND LSRVHRLVPK IKAGTVWVNC HNILDASMPF GGYKQSGIGR
EMGRAVLDLY TEGKSVIMAL