Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3099 |
Symbol | benD |
ID | 7874569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3355578 |
End bp | 3356354 |
Gene Length | 777 bp |
Protein Length | 258 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643700022 |
Product | 1,6-dihydroxycyclohexa-2,4-diene-1-carboxylate dehydrogenase |
Protein accession | YP_002890074 |
Protein GI | 237653760 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.763033 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAACC GTTTCAAGAA GAAAGTGGCC GTCATCACCG GCGCCGCGCA GGGCATCGGC AAGCGCGTCG CCGAGAAGAT GGCCAAGGAG AAGGGCACCC TGGTGCTGGT CGACCGCTCC GAGCTCGTGC ATGAGGTGGC CAAGGAGCTG TCCGGCAAGG TCGAAGTCCT CAGCCTCACC GCCGACCTGG AGAAGTTCGC CGACTGCAAG CGGGTCATGG AGGCCGCGGT CGAGAAGTTC GGCCGCATCG ACATCCTGAT CAACAACGTC GGCGGCACGA TCTGGACCAA GCCCTTCGAG CACTACGAGG AGGACGAGAT CGAGGCCGAG GTGCGGCGCT CGCTCTTCCC CACGCTGTGG TGCTGCCGCG CCGCCCTGCC CTACATGCAG GAACAGGGCA AGGGTGCGAT CGTGAACGTG TCCTCGATCG CCACCCGCGG CGTCAACCGC GTGCCCTACG GCGCGGCCAA GGGCGGCGTC AATGCGCTCA CCGCCTGCCT GGCGTTCGAG AACGCCGAGC GCGGCATCCG CGTCAATGCC ACCGCGCCCG GCGCCACCGA GGCCCCGCCG CGTCTGATCC CGCGCAACAC CAAGGAACAG AGCGAGCAGG AGAAGCTCTG GTACAAGCAG ATCTACACCC AGTCGATCGA CAGCAGCCTG ATGAAGCGCT ACGGCACGCT CGACGAGCAG GCCGACCCGA TCCTGTTCCT CGCCTCCGAC GACGCCGCCT ACATCACCGG CACCATCGTG CCGGTGGGCG GTGGCGACCT GGGCTGA
|
Protein sequence | MANRFKKKVA VITGAAQGIG KRVAEKMAKE KGTLVLVDRS ELVHEVAKEL SGKVEVLSLT ADLEKFADCK RVMEAAVEKF GRIDILINNV GGTIWTKPFE HYEEDEIEAE VRRSLFPTLW CCRAALPYMQ EQGKGAIVNV SSIATRGVNR VPYGAAKGGV NALTACLAFE NAERGIRVNA TAPGATEAPP RLIPRNTKEQ SEQEKLWYKQ IYTQSIDSSL MKRYGTLDEQ ADPILFLASD DAAYITGTIV PVGGGDLG
|
| |