Gene Tmz1t_2985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2985 
Symbol 
ID7874375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3238334 
End bp3239755 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content72% 
IMG OID643699906 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002889961 
Protein GI237653647 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.837677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA CGCCCTCCCT CGTCATCGGC GGCCGCAAGG TCGCCGGCGA CCGCGGCACC 
CTGCCGGTCA TCGACCCCGC GCTCGGCGAG GCCTTCGCCG AGTGCCCGAA CGCCTCCCCC
GCGCAGCTCG ACGAGGCGGT GGCGGCGGCC GCGCGCGCGT TCGAGCAGTG GCAGCACAGC
TCCTGCGCGG AACGCCGCGC CCTGCTCGAA GCGATCGCCG CGCGCATCGA ACAGAACGCG
CCCGAGCTCG CCGAGATCAT CGTCCGCGAG CAGGGCAAGC CGCTCGCGCT CGCGCACATG
GAGGTCGGCG GCGCGGTCGC ATGGACGCGC GCCACCGCCG CGCTCGAGCT GCCGGTCGAG
GTGATCGAGG ACCGCCCGGG CAAGCGCATC GAGCTGCACC GCAGGCCGCT CGGCGTGGTG
GGATCGATCA CGCCGTGGAA CTGGCCGCTG ATGATCGCGG TGTGGCACAT CATGCCCGCG
CTGCGCGCCG GCAACGCGGT GGTGATCAAG CCCTCGGAGC TGACCCCGCT CAACACCCTG
CGCCTGGTCG AGCTGATCGA CGAGGTGGCG CCGCCCGGGC TGGTGAATGC GGTGGCCGGC
GGCGCAGCGC TCGGGCGCGG GATCTCGGGC CACCCCGGCA TCCACAAGAT CGTGTTCACC
GGCTCGACGC GCACCGGCCA GGACATCATG CGCAACGCCG CCGACACGCT GAAGCGGCTC
ACACTGGAAC TGGGCGGCAA TGACGCCGGC ATCGTGCTGC CGGGCACCGA CATCGGCGCG
ATCGCCGAGG GGGTGTTCGG CAGCGCCTTC CTCAACATGG GACAGACCTG CGCCGCGCTC
AAGCGCCTCT ATGTGCACGA GTCCCAGTAC GAGGACATGT GCCGGCACCT GGTCGCCATC
GCCGCGCGGC AGAAGCTCGG CAGCGGCCTG GACGAGGGCA CGAGCTTCGG GCCGATCCAG
AACCGCGACC AGTTCGAGCG CGTATGCGAG CTCGTCGAGG ATGCACGCGC CGCCGGCGCC
CGCATCCTGT GCGGCGGCGA GCCGCTGCCC GGCAAGGGCT ACTTCTACCC GCCGACCATC
GTCGCCGACA TCGCCGACGG CACCCGGCTG GTCGACGAGG AGCAGTTCGG CCCGGTGCTG
CCGGTGATCC GCTACCGCGA CGTGGACGAG GCGCTGCGGC TCGCCAACGC CAGCACCAAC
GGTCTCGGCG GCTCGGTGTG GTCGGGCGAC CTGGAGGCCG CGCGCGCGCT GGCGAACCGC
CTCGAGTGCG GCACGGTGTG GATCAACGGC CATGCCGAGG TATTGCCGCA CTGCCCCTTT
GGCGGCTGCA AGATGTCGGG CTTCGGCGTC GAGTTCGGCC TCGAGGGGCT GCTCGAATAC
ACCCGCCCGC AGCTCTTCAA CATCAACCTC CCCGCCGCCT GA
 
Protein sequence
MNKTPSLVIG GRKVAGDRGT LPVIDPALGE AFAECPNASP AQLDEAVAAA ARAFEQWQHS 
SCAERRALLE AIAARIEQNA PELAEIIVRE QGKPLALAHM EVGGAVAWTR ATAALELPVE
VIEDRPGKRI ELHRRPLGVV GSITPWNWPL MIAVWHIMPA LRAGNAVVIK PSELTPLNTL
RLVELIDEVA PPGLVNAVAG GAALGRGISG HPGIHKIVFT GSTRTGQDIM RNAADTLKRL
TLELGGNDAG IVLPGTDIGA IAEGVFGSAF LNMGQTCAAL KRLYVHESQY EDMCRHLVAI
AARQKLGSGL DEGTSFGPIQ NRDQFERVCE LVEDARAAGA RILCGGEPLP GKGYFYPPTI
VADIADGTRL VDEEQFGPVL PVIRYRDVDE ALRLANASTN GLGGSVWSGD LEAARALANR
LECGTVWING HAEVLPHCPF GGCKMSGFGV EFGLEGLLEY TRPQLFNINL PAA