Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3544 |
Symbol | |
ID | 7873050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3885069 |
End bp | 3885809 |
Gene Length | 741 bp |
Protein Length | 246 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643700485 |
Product | short chain dehydrogenase |
Protein accession | YP_002890515 |
Protein GI | 237654201 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.16365 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCGA TGCGCGCCGT ACTCACCGGA CACAACCGCG GCCTCGGCGA GGCGGTCGCC GCGGAGCTGC TCGATCGCGG CATCGCGGTG CTCGCGCTCG CACGCCGCAG CAATCCCGCG CTCGCCGCCC GCCACGGCGA GCGCCTGCGC GAGGTCGCGC TCGATCTCGC CGACGCCGCG GCGCTCGCCG CCTGGCTCGG CTCGGGCATG CTGCAAGGCT GGCTCGCCGA CGCCGACGAG GCGCTGCTGA TCAACAACGC CGGCACCCTG CAGCCGATGG GACCGCTCGC GCTGCAGGAC CCGGCGGCGG TGGCGCGTGC GGTGGCGAGC AACGTCGGCG CGCCGCTGGC GCTCGCCGCC GCCTTCGCCG CCGCCGAAGG CCCGCAGGCG CGCCGCGTGC TGCATGTCTC CAGCGGTGCG GCGCGCAAGC CCTACCCGGG CTGGAGCGTG TATTGCGCCA CCAAGGCCGC GCTCGACCAC CACGCCCGCG CGGTGCAGCT CGACGCCGTG CCGGGGCTGC GCATCTGCGC GCTCGCCCCG GGAGTGATCG ATACCGGCAT GCAGGCCGAG ATCCGCGCCA GCACGTCGGA ACGCTTCCCG CTGCGCGACC GCTTCGCCGA GATGCAGGCC AGCGGCGGTC TGGTCGCCCC GGCCGAGTGC GCGATGCACC TGGTCGACTT CCTGCTCGAC GAGGACTTCG GCCGCGAGGC GGTGGCCGAC CTGCGCGACC TCGGACGATG A
|
Protein sequence | MNAMRAVLTG HNRGLGEAVA AELLDRGIAV LALARRSNPA LAARHGERLR EVALDLADAA ALAAWLGSGM LQGWLADADE ALLINNAGTL QPMGPLALQD PAAVARAVAS NVGAPLALAA AFAAAEGPQA RRVLHVSSGA ARKPYPGWSV YCATKAALDH HARAVQLDAV PGLRICALAP GVIDTGMQAE IRASTSERFP LRDRFAEMQA SGGLVAPAEC AMHLVDFLLD EDFGREAVAD LRDLGR
|
| |