Gene Tmz1t_3946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3946 
Symbol 
ID7873592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4341710 
End bp4343215 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content74% 
IMG OID643700883 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002890906 
Protein GI237654592 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAGCA ACGCCCGCTT CCTTCCCCTC GGCCTCGACG CCCCCCGCCT GCGCGAGGGC 
GCGCTGGCGG TGCATAGCCC GATCGACGGC AGCCTGCTCG CGCGCCTCGC GCCGCAGGAT
GGCGCCACCA CCGATGCCGC GATCGCGCGC AGCGTGGCGG CCTTCGAGGC CTGGCGCCGC
GTGCCGGCGC CGCGCCGTGG CGAGCTGGTG CGCCGCTTCG CGCAGGCGCT GCGCGAGCAC
AAGGCCCTCC TCGCCGAACT GGTCACCCTC GAGTGCGGCA AGATCCGCAG CGAGGGCGAA
GGCGAGGTGC AGGAGATGAT CGACATCTGC GACTTCGCGG TCGGCCTGTC GCGCCAGCTG
CACGGCCTGA CCATCGCCTC CGAGCGGCCC GGCCATGCCT TGCGCGAGAG CTGGCACCCG
CTCGGCCCGG TGGCGATCGT CACCGCCTTC AACTTCCCGG TCGCGGTGTG GGCGTGGAAC
GCAGCGATCG CGCTGGTGTG CGGCGACAGC CTGCTGTGGA AGCCCTCCGA GCGCACCCCG
CTGTGCGCGC TCGCCTGCCA GCGCCTGCTC GAACAGGCGG CGGCGGGCAT GGAGGAGGTG
CCGCGCGGGC TGTCGGCGGT GATCGTGGGC GGCGCCGAGC GCGCCGTGCA GCTCGCCGAC
GACCGCCGCG TGGCGCTGCT CTCGGCCACC GGCAGCTGCG CGATGGGGCG GGCGCTCGCG
CCGCGGGTGG CGCAGCGGCT CGGGCGCAGC CTGCTCGAGC TCGGCGGCAA CAACGCGGTG
ATCGTGGCGC CGAGCGCCGA CCTCGAGCTC GCGCTGCGCG CCATCGTGTT CGGCGCGGTC
GGCACCGCCG GCCAGCGCTG CACCGGCACC CGCCGGCTGT TCGTGCACGC GGCCGTGCGC
GAGCAACTGC TCGAACGCCT GCGCGCGGTG TTTGCCGGCT TGGTGGTGGG CGATCCGCGT
GCGGCCGACA CCCTGGTCGG GCCGCTGATC GGGGGCGAGG CCTTCACCCG CATGCAGGCA
GCGCTCGCCG CGGCGCGCGC GGCGGGGGCG CGCATCGATG GTGGCGAGCG CGTGCTCGCC
GAGCGTTTCC CCGCGGCCTG GTACGTGCGT CCGGCCCTGG TCGAGCATCC GCCCTCCGTG
ACGAACAGCA TGGAGGAGGT CTTCGCGCCG CTGCTGAACT GCTTCGAGTA CGCGGAGCTG
GAGGATGCGA TCGCGCGCCA GAACGCGGTG CCGCAGGGCC TGTCGTCGGC GATCTTCACC
ACCGACCTGC GCGAGGCCGA GCGCTTCCTC TCCACCACCG GCAGCGACTG CGGCATCGCC
AACGTCAATG CGGGCACCAG TGGCGCCGAG ATCGGCGGCG CCTTCGGCGG CGAGAAGGAC
AGCGGCGGCG GGCGCGAGGC GGGTGCCGAT GCCTGGCGCG CCTACATGCG CAGGATGACC
GCGACGATCA ACTACTCCGA CGCGCTGCCG CTGGCGCAGG GGGTGCGTTT CGAGCCGGCC
GGCTGA
 
Protein sequence
MLSNARFLPL GLDAPRLREG ALAVHSPIDG SLLARLAPQD GATTDAAIAR SVAAFEAWRR 
VPAPRRGELV RRFAQALREH KALLAELVTL ECGKIRSEGE GEVQEMIDIC DFAVGLSRQL
HGLTIASERP GHALRESWHP LGPVAIVTAF NFPVAVWAWN AAIALVCGDS LLWKPSERTP
LCALACQRLL EQAAAGMEEV PRGLSAVIVG GAERAVQLAD DRRVALLSAT GSCAMGRALA
PRVAQRLGRS LLELGGNNAV IVAPSADLEL ALRAIVFGAV GTAGQRCTGT RRLFVHAAVR
EQLLERLRAV FAGLVVGDPR AADTLVGPLI GGEAFTRMQA ALAAARAAGA RIDGGERVLA
ERFPAAWYVR PALVEHPPSV TNSMEEVFAP LLNCFEYAEL EDAIARQNAV PQGLSSAIFT
TDLREAERFL STTGSDCGIA NVNAGTSGAE IGGAFGGEKD SGGGREAGAD AWRAYMRRMT
ATINYSDALP LAQGVRFEPA G