Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1648 |
Symbol | |
ID | 7084067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1849121 |
End bp | 1850173 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643698668 |
Product | aldo/keto reductase |
Protein accession | YP_002355299 |
Protein GI | 217970065 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0658571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTAC GTGAAATGGA ATACCGCCGG CTGGGCGGAG CGGGGCCGCA GGTGTCGGCG GTGTGCCTGG GCACCATGAC CTTTGGCCAG CAGAACAGCG AGGCCGAGGC GCACAGCCAG CTCGACCTCG CCGTGGAGCG CGGGATCGAC TTCATCGACA CCGCCGAGAT GTATCCGGTC CCGGCGCGTG CCGAGACCAG CGGCGAGACC GAACGCATCG TCGGCAACTG GCTGGCGCCC CAGGTGCGCG AGAAGCTGGT GATCGCGACC AAGGTGGCGG GGCCGGCGCG CAGCCTGGAG TGGATCCGCG GCGGTCCGCT CGCGCTCGAC CGCGCCAACA TCCGCACCGC GGTCGAGGGC AGCCTGCGCC GGCTGCGCAC CGACTACATC GACCTGTATC AGCTGCACTG GCCGGCGCGC AACCAGCCGC TGTTCGGCCA GCTGCCCTTC GATCCGGCGC GCGAGCGCGA ATCCACGCCG ATCCGCGCCC AGCTCGAGGC GCTCGCCGAA CTGGTGGACG AGGGCAAGAT CCGCCACGTC GGCCTGTCGA ACGAGCAGCC CTGGGGGCTG ATGGAGTTCG TGCGCATGGC GCAGGAGAGC GGCCTGCCAC GCGTGGTGTC GGTGCAGAAC GCCTACAACC TGCTCAACAG GGTGTACGAG TACGGCATGA GCGAGATCAC GCTGCGCGAG GACGTGGCGC TGCTGGCGTA TTCGCCGCTC GCCTTCGGCC ACCTGTCGGG CAAGTATCTC GCCGACCCGG CTGCGCGCGG GCGCCTCACC GAGTTCGAGA ATTTCGGTGT GCGCTACGCC AAGCCCGGTG TGCGTGCGGC GGTGGAGCGC TACGCGGAGA TCGCGCGGCG GCGCGGGATG AGCCTCACCG CGCTGGCGCT CGCCTTCGTG TATTCGCGCT GGTTCGTGGG CAGCACCATC GTGGGCGCGA CCACGGTGGC GCAACTCGCC GAGAACCTCG ACGCCTGGCA TCGGCGTCTC GACGAGGATG CGCTCGCCGA GATCGAGCGG GTTCATCAGC TGCACGGCAA CCCGGCGCCC TGA
|
Protein sequence | MAVREMEYRR LGGAGPQVSA VCLGTMTFGQ QNSEAEAHSQ LDLAVERGID FIDTAEMYPV PARAETSGET ERIVGNWLAP QVREKLVIAT KVAGPARSLE WIRGGPLALD RANIRTAVEG SLRRLRTDYI DLYQLHWPAR NQPLFGQLPF DPARERESTP IRAQLEALAE LVDEGKIRHV GLSNEQPWGL MEFVRMAQES GLPRVVSVQN AYNLLNRVYE YGMSEITLRE DVALLAYSPL AFGHLSGKYL ADPAARGRLT EFENFGVRYA KPGVRAAVER YAEIARRRGM SLTALALAFV YSRWFVGSTI VGATTVAQLA ENLDAWHRRL DEDALAEIER VHQLHGNPAP
|
| |