Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3946 |
Symbol | |
ID | 7873592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4341710 |
End bp | 4343215 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643700883 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002890906 |
Protein GI | 237654592 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGAGCA ACGCCCGCTT CCTTCCCCTC GGCCTCGACG CCCCCCGCCT GCGCGAGGGC GCGCTGGCGG TGCATAGCCC GATCGACGGC AGCCTGCTCG CGCGCCTCGC GCCGCAGGAT GGCGCCACCA CCGATGCCGC GATCGCGCGC AGCGTGGCGG CCTTCGAGGC CTGGCGCCGC GTGCCGGCGC CGCGCCGTGG CGAGCTGGTG CGCCGCTTCG CGCAGGCGCT GCGCGAGCAC AAGGCCCTCC TCGCCGAACT GGTCACCCTC GAGTGCGGCA AGATCCGCAG CGAGGGCGAA GGCGAGGTGC AGGAGATGAT CGACATCTGC GACTTCGCGG TCGGCCTGTC GCGCCAGCTG CACGGCCTGA CCATCGCCTC CGAGCGGCCC GGCCATGCCT TGCGCGAGAG CTGGCACCCG CTCGGCCCGG TGGCGATCGT CACCGCCTTC AACTTCCCGG TCGCGGTGTG GGCGTGGAAC GCAGCGATCG CGCTGGTGTG CGGCGACAGC CTGCTGTGGA AGCCCTCCGA GCGCACCCCG CTGTGCGCGC TCGCCTGCCA GCGCCTGCTC GAACAGGCGG CGGCGGGCAT GGAGGAGGTG CCGCGCGGGC TGTCGGCGGT GATCGTGGGC GGCGCCGAGC GCGCCGTGCA GCTCGCCGAC GACCGCCGCG TGGCGCTGCT CTCGGCCACC GGCAGCTGCG CGATGGGGCG GGCGCTCGCG CCGCGGGTGG CGCAGCGGCT CGGGCGCAGC CTGCTCGAGC TCGGCGGCAA CAACGCGGTG ATCGTGGCGC CGAGCGCCGA CCTCGAGCTC GCGCTGCGCG CCATCGTGTT CGGCGCGGTC GGCACCGCCG GCCAGCGCTG CACCGGCACC CGCCGGCTGT TCGTGCACGC GGCCGTGCGC GAGCAACTGC TCGAACGCCT GCGCGCGGTG TTTGCCGGCT TGGTGGTGGG CGATCCGCGT GCGGCCGACA CCCTGGTCGG GCCGCTGATC GGGGGCGAGG CCTTCACCCG CATGCAGGCA GCGCTCGCCG CGGCGCGCGC GGCGGGGGCG CGCATCGATG GTGGCGAGCG CGTGCTCGCC GAGCGTTTCC CCGCGGCCTG GTACGTGCGT CCGGCCCTGG TCGAGCATCC GCCCTCCGTG ACGAACAGCA TGGAGGAGGT CTTCGCGCCG CTGCTGAACT GCTTCGAGTA CGCGGAGCTG GAGGATGCGA TCGCGCGCCA GAACGCGGTG CCGCAGGGCC TGTCGTCGGC GATCTTCACC ACCGACCTGC GCGAGGCCGA GCGCTTCCTC TCCACCACCG GCAGCGACTG CGGCATCGCC AACGTCAATG CGGGCACCAG TGGCGCCGAG ATCGGCGGCG CCTTCGGCGG CGAGAAGGAC AGCGGCGGCG GGCGCGAGGC GGGTGCCGAT GCCTGGCGCG CCTACATGCG CAGGATGACC GCGACGATCA ACTACTCCGA CGCGCTGCCG CTGGCGCAGG GGGTGCGTTT CGAGCCGGCC GGCTGA
|
Protein sequence | MLSNARFLPL GLDAPRLREG ALAVHSPIDG SLLARLAPQD GATTDAAIAR SVAAFEAWRR VPAPRRGELV RRFAQALREH KALLAELVTL ECGKIRSEGE GEVQEMIDIC DFAVGLSRQL HGLTIASERP GHALRESWHP LGPVAIVTAF NFPVAVWAWN AAIALVCGDS LLWKPSERTP LCALACQRLL EQAAAGMEEV PRGLSAVIVG GAERAVQLAD DRRVALLSAT GSCAMGRALA PRVAQRLGRS LLELGGNNAV IVAPSADLEL ALRAIVFGAV GTAGQRCTGT RRLFVHAAVR EQLLERLRAV FAGLVVGDPR AADTLVGPLI GGEAFTRMQA ALAAARAAGA RIDGGERVLA ERFPAAWYVR PALVEHPPSV TNSMEEVFAP LLNCFEYAEL EDAIARQNAV PQGLSSAIFT TDLREAERFL STTGSDCGIA NVNAGTSGAE IGGAFGGEKD SGGGREAGAD AWRAYMRRMT ATINYSDALP LAQGVRFEPA G
|
| |