Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2985 |
Symbol | |
ID | 7874375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3238334 |
End bp | 3239755 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643699906 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002889961 |
Protein GI | 237653647 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.837677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA CGCCCTCCCT CGTCATCGGC GGCCGCAAGG TCGCCGGCGA CCGCGGCACC CTGCCGGTCA TCGACCCCGC GCTCGGCGAG GCCTTCGCCG AGTGCCCGAA CGCCTCCCCC GCGCAGCTCG ACGAGGCGGT GGCGGCGGCC GCGCGCGCGT TCGAGCAGTG GCAGCACAGC TCCTGCGCGG AACGCCGCGC CCTGCTCGAA GCGATCGCCG CGCGCATCGA ACAGAACGCG CCCGAGCTCG CCGAGATCAT CGTCCGCGAG CAGGGCAAGC CGCTCGCGCT CGCGCACATG GAGGTCGGCG GCGCGGTCGC ATGGACGCGC GCCACCGCCG CGCTCGAGCT GCCGGTCGAG GTGATCGAGG ACCGCCCGGG CAAGCGCATC GAGCTGCACC GCAGGCCGCT CGGCGTGGTG GGATCGATCA CGCCGTGGAA CTGGCCGCTG ATGATCGCGG TGTGGCACAT CATGCCCGCG CTGCGCGCCG GCAACGCGGT GGTGATCAAG CCCTCGGAGC TGACCCCGCT CAACACCCTG CGCCTGGTCG AGCTGATCGA CGAGGTGGCG CCGCCCGGGC TGGTGAATGC GGTGGCCGGC GGCGCAGCGC TCGGGCGCGG GATCTCGGGC CACCCCGGCA TCCACAAGAT CGTGTTCACC GGCTCGACGC GCACCGGCCA GGACATCATG CGCAACGCCG CCGACACGCT GAAGCGGCTC ACACTGGAAC TGGGCGGCAA TGACGCCGGC ATCGTGCTGC CGGGCACCGA CATCGGCGCG ATCGCCGAGG GGGTGTTCGG CAGCGCCTTC CTCAACATGG GACAGACCTG CGCCGCGCTC AAGCGCCTCT ATGTGCACGA GTCCCAGTAC GAGGACATGT GCCGGCACCT GGTCGCCATC GCCGCGCGGC AGAAGCTCGG CAGCGGCCTG GACGAGGGCA CGAGCTTCGG GCCGATCCAG AACCGCGACC AGTTCGAGCG CGTATGCGAG CTCGTCGAGG ATGCACGCGC CGCCGGCGCC CGCATCCTGT GCGGCGGCGA GCCGCTGCCC GGCAAGGGCT ACTTCTACCC GCCGACCATC GTCGCCGACA TCGCCGACGG CACCCGGCTG GTCGACGAGG AGCAGTTCGG CCCGGTGCTG CCGGTGATCC GCTACCGCGA CGTGGACGAG GCGCTGCGGC TCGCCAACGC CAGCACCAAC GGTCTCGGCG GCTCGGTGTG GTCGGGCGAC CTGGAGGCCG CGCGCGCGCT GGCGAACCGC CTCGAGTGCG GCACGGTGTG GATCAACGGC CATGCCGAGG TATTGCCGCA CTGCCCCTTT GGCGGCTGCA AGATGTCGGG CTTCGGCGTC GAGTTCGGCC TCGAGGGGCT GCTCGAATAC ACCCGCCCGC AGCTCTTCAA CATCAACCTC CCCGCCGCCT GA
|
Protein sequence | MNKTPSLVIG GRKVAGDRGT LPVIDPALGE AFAECPNASP AQLDEAVAAA ARAFEQWQHS SCAERRALLE AIAARIEQNA PELAEIIVRE QGKPLALAHM EVGGAVAWTR ATAALELPVE VIEDRPGKRI ELHRRPLGVV GSITPWNWPL MIAVWHIMPA LRAGNAVVIK PSELTPLNTL RLVELIDEVA PPGLVNAVAG GAALGRGISG HPGIHKIVFT GSTRTGQDIM RNAADTLKRL TLELGGNDAG IVLPGTDIGA IAEGVFGSAF LNMGQTCAAL KRLYVHESQY EDMCRHLVAI AARQKLGSGL DEGTSFGPIQ NRDQFERVCE LVEDARAAGA RILCGGEPLP GKGYFYPPTI VADIADGTRL VDEEQFGPVL PVIRYRDVDE ALRLANASTN GLGGSVWSGD LEAARALANR LECGTVWING HAEVLPHCPF GGCKMSGFGV EFGLEGLLEY TRPQLFNINL PAA
|
| |