Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4046 |
Symbol | |
ID | 7873275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4442919 |
End bp | 4444322 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643700979 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002891002 |
Protein GI | 237654688 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACAA AAATGCAGCC TTTCACCCCG CCTGCCATAG GCCATCTGTT TAGTGGAGCC ACTGCAGATC AGTACCGCGC GCACTATGAC CCGGGCCGCC TGGACTCGGT AGCGTCGCGA ATTGCTATCG GTACGGCGGC AGATGTAGAC CTGGCAGTCC AACAGGCCTA TCGAGCCTTC CCGGGCTGGC GAGACACGCC GGTGGCTCAG CGTGCGGCAT CACTTGCGAA GGCTGCTGCG CTTGTCCAAG CCAACTCGGC CGAACTTGGC CCCCTCCTTG TTCGCGAACA CGGAGGCGTG TCATGGGAGG CTCAGGCCGA TTTTGCGCTG GGATACGGAG TGCTGAGTCA CACCGCCGAT CTCGCGGAGC GCTTCTTCAA CCCTGTCACG CACGACGAGG AACAGAGCTT CATCAGCATA GAGAAGGATC CGCGTGGCGT AGTCTCAGCC ATCGTGCCTT GGAACATGCC GGTGGTGCTG ACGATGATGA AGCTCGCTCC TGCGCTTGCG ACAGGAAACA CGCTGGTCCT CAAGCCATCC CCCTTCGCAG CAGGAGCACT GACCCTGCTG ATCGAGCGAT TGAGCATGTT TTTCCCCGAA GGCGTAATCA ACGTAGTCCA AGGGGATGTC GAAGTGGGCC AAGCGCTGAC AACCCACCCC CTTGTCCGCA AGGTAGCCTT CACCGGCGGA ACAGCGACCG CACGTCACAT CATGACCGGC GCAGCCAACA CGATAAAAAA CATCACCCTC GAACTCGGTG GTAACGATCC GGCAATCGTC CTCGACGATG CAGACCTCGA CGCCACGCTG GACCGGATGT TGCCGGGGAT CTTCACACGC AGCGGCCAGA TTTGTTTCGC GGTAAAGCGC ATCTATGTTC CACGTGCGTC CTACCGCAGG TTTGTAGACG CTTTGTGCCA ACGCGTATCC GAGTACAAGG TGGGACATGG GTTGAACCCG GAGGCAACGC TCGGCCCCTT GAACAACAAG GCTCAGTACG AGCGTGTGTG CAAGTTTATC GATGGGGCTA AGTCCGGCCC CGCAAAGGTG GTGGAGCTCG GCCGCAGGCT CGAGCCTGAC CAGTGGGACA ATGGCTACTA CGTCCTCCCC CACGTCGTTA GCGACGTGGC GCACGAAGCG CAGACCACCT CACGTGAGCA ATTCGGCCCA ATCATCCCGG TCGTTGCTTA CGATTCCGAG GAGCAGGTAC TCGCTTGGGC CAATGACAGC GAGTACGGCT TGGGTTCCTC CGTATGGACA CGGGATAGTG CGCGTGGACT CGCTTTCGCC CGCAGGATCG AACCCGACTT TGACACTTCC AAGGCACATT TCCGCCCCAC CTCCAGCAAC GGCGCGGGTT TCCGGGCGAT GATGCGCCAA AATCCTAGAT ACATGGATTG TTGA
|
Protein sequence | MNTKMQPFTP PAIGHLFSGA TADQYRAHYD PGRLDSVASR IAIGTAADVD LAVQQAYRAF PGWRDTPVAQ RAASLAKAAA LVQANSAELG PLLVREHGGV SWEAQADFAL GYGVLSHTAD LAERFFNPVT HDEEQSFISI EKDPRGVVSA IVPWNMPVVL TMMKLAPALA TGNTLVLKPS PFAAGALTLL IERLSMFFPE GVINVVQGDV EVGQALTTHP LVRKVAFTGG TATARHIMTG AANTIKNITL ELGGNDPAIV LDDADLDATL DRMLPGIFTR SGQICFAVKR IYVPRASYRR FVDALCQRVS EYKVGHGLNP EATLGPLNNK AQYERVCKFI DGAKSGPAKV VELGRRLEPD QWDNGYYVLP HVVSDVAHEA QTTSREQFGP IIPVVAYDSE EQVLAWANDS EYGLGSSVWT RDSARGLAFA RRIEPDFDTS KAHFRPTSSN GAGFRAMMRQ NPRYMDC
|
| |