Gene Tmz1t_2872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2872 
Symbol 
ID7873774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3108790 
End bp3110310 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content66% 
IMG OID643699793 
ProductAldehyde dehydrogenase (NAD(+)) 
Protein accessionYP_002889848 
Protein GI237653534 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTACG CACTGCCCGG CCAGGCCGAA GCCAAGGTTC AGTTCAAGAC CCGCTACGAC 
AACTTCATTG GTGGCAAGTG GGTGGCCCCG GTCAAGGGCC AGTATTTCGA TGTGGTGACC
CCGATCACCG GCCAGAAGTA CACCCAGGCC GCGCGTTCCA CCGCCGAGGA CATCGAGCTC
GCCCTCGACG CCGCGCACGC AGCCTTCCCC AAGTGGGGCA AGGCCGACGC CACCACCCGT
TCGAACATCC TGCTCAAGAT CGCCGACCGC CTCGAGGCCA ACCTCGAGCT GCTCGCCTAC
GCCGAGACCG TCGATAACGG CAAGCCGATC CGCGAGACGC TCAACGCCGA CATCCCGCTC
GCCGTCGACC ACTTCCGCTA CTTCGCCGGC TGCCTGCGCT CGCAGGAAGG CGGCATCTCG
GAGATCGACG AGAACACCAT GGCCTATCAC ATCCACGAGC CCCTGGGCGT GGTCGGCCAG
ATCATCCCGT GGAACTTCCC CATCCTGATG GCGGCGTGGA AGCTGGCGCC GGCGCTGGGT
GCGGGCAACT GCGTGGTGCT CAAGCCCGCC GAGTCGACCC CGATCTCGAT CCTGGTGCTG
ATGGAGCTGA TCGCCGACCT GCTGCCGCCG GGCGTGCTCA ACATCGTCAA CGGCTACGGC
CGCGAGGCCG GCATGCCGCT CGCCACCAGC AAGCGCATCG CCAAGATCGC CTTCACCGGC
TCCACCTCCA CCGGCCGCGT GATCGCCCAG GCCGCCGCCA ACAACCTGAT CCCGGCCACG
CTCGAACTGG GCGGCAAGTC GCCCAACATC TTCTTCGCCG ACGTCATGGA CAAGGACGAC
GCCTTCCTCG ACAAGGCGAT CGAGGGCCTG GTGCTGTTCG CCTTCAACCA GGGCGAGGTG
TGCACCTGCC CGAGCCGCGC GCTGATCCAG GAATCGATCT ACGACCGCTT CATGGAGCGT
GCTCTGAAGC GCGTCGCCGC GATCAAGCAG GGCAGCCCGC TCGACACCGA CACCATGATG
GGCGCGCAGG CCTCCGCCGA GCAGATGAGC AAGATCCAGT CGTATCTGCA GCTCGGCAAG
GAAGAGGGCG CCCAGGTGCT GATCGGCGGT GCGCGCGCCC AACTCGGCGG CGACCTGGCC
GACGGCTTCT ATATCCAGCC GACGCTGTTC AAGGGCCACA ACAAGATGCG CATCTTCCAG
GAGGAGATCT TCGGGCCGGT GCTCGCGGTG ACCACCTTCA AGGACGAGGC CGAGGCCCTG
GCGATCGCAA ATGACACCCT GTATGGTCTG GGCGCCGGCG TGTGGAGCCG CAACGGCAAC
GTCGCCTACC GCATGGGCCG CGCCATCCAG GCCGGGCGCG TGTGGACCAA CTGCTACCAC
GCCTACCCGG CGCACGCGGC CTTCGGCGGC TACAAGGAGT CGGGCATCGG CCGCGAGACG
CACAAGGTCA TGCTCGACCA CTACCAGCAG ACCAAGAACC TGCTGGTGAG CTACAGCGAG
AACAAGCTCG GCTTCTTCTG A
 
Protein sequence
MLYALPGQAE AKVQFKTRYD NFIGGKWVAP VKGQYFDVVT PITGQKYTQA ARSTAEDIEL 
ALDAAHAAFP KWGKADATTR SNILLKIADR LEANLELLAY AETVDNGKPI RETLNADIPL
AVDHFRYFAG CLRSQEGGIS EIDENTMAYH IHEPLGVVGQ IIPWNFPILM AAWKLAPALG
AGNCVVLKPA ESTPISILVL MELIADLLPP GVLNIVNGYG REAGMPLATS KRIAKIAFTG
STSTGRVIAQ AAANNLIPAT LELGGKSPNI FFADVMDKDD AFLDKAIEGL VLFAFNQGEV
CTCPSRALIQ ESIYDRFMER ALKRVAAIKQ GSPLDTDTMM GAQASAEQMS KIQSYLQLGK
EEGAQVLIGG ARAQLGGDLA DGFYIQPTLF KGHNKMRIFQ EEIFGPVLAV TTFKDEAEAL
AIANDTLYGL GAGVWSRNGN VAYRMGRAIQ AGRVWTNCYH AYPAHAAFGG YKESGIGRET
HKVMLDHYQQ TKNLLVSYSE NKLGFF