Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2453 |
Symbol | |
ID | 7874137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2646404 |
End bp | 2647402 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643699376 |
Product | putative dehydrogenase |
Protein accession | YP_002889433 |
Protein GI | 237653119 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAGCA CCAACCTCGC CTTCTGGACC GTCCGCCCGG GCTACGGCGA ATTGCGCCCG GCGCCGCTGC GCCCGCCCGC AGACGGCGAG CTGCGGGTGC GCAACCTCTT CGGCGCAGTC AGCCGCGGCA GCGAGAGCCT GGTGTTCCGC GGCGAGGTGC CCGAAAGCGA ATACGAACGC ATGCGCGCCC CCTTCCAGGA GGGCGACTTC CCCGGGCCGC TCAAGTACGG CTACATCGGC GTCGGCGTGG TGGAGGACGG CGTCGGCACC GCGGCCACCG CCTTGCGCGG CCGCACGGTG TTCTGCCTGC ACCCGCATCA GCAGCGCTAT GTGGTACCCG CCGGCGCCGT CGTCCCCCTC CCCGCCGGCG TGCCGGCCGC GCGCGCGGTG CTGGCCGCCA ACCTCGAGAC CGCGATCAAC GCCTGCTGGG ACGGCGTCCC CGCGCTGGGC GACCGCATCG CGGTGGTCGG CGCCGGCGTG GTCGGCAGCC TGGTGGCCTG GCTGTGCGCG CGCCTCCCCG GCGTCGAGCT CGAGCTGATC GACACCGACC CCGGCCGCGC CGGCCTCGCC GCCGCGCTCG GCCTCGTTCA CCGCCTCCCG GAGCAGGCGC GTGGCAACTG CGACCTCGTC TTCCACGCCA GCGGCAACCC CGCCGGCCTG GTGCGCGCGC TCGAACTCGC CGGACAGGAC GCCACCGTCG TGGAGATGAG CTGGTACGGC CGCCGCAGCG CGGAGCTGCC GCTCGGCGCC GCCTTCCACG CCCGCCGCCT GCGCCTGCAG TCCAGCCAGG TCGGCCGCCT GCCGCCGCCA CGCAGCCCGC GCTGGGACTA CCGTCGCCGC ATGGAACTCG CGCTCGCGCT GCTCGTCGAT CCACGTCTGG ACGCACTGAT CAGCGGCGAG ACCGACTTCA CCGACCTGCC CGCGCTGATG CAGCGCCTCG CCGAAGCCCC CGCCGGGGCG CTGTGCGAGC GCATCCGCTA TGCCAGTCCG AGCACCTGA
|
Protein sequence | MSSTNLAFWT VRPGYGELRP APLRPPADGE LRVRNLFGAV SRGSESLVFR GEVPESEYER MRAPFQEGDF PGPLKYGYIG VGVVEDGVGT AATALRGRTV FCLHPHQQRY VVPAGAVVPL PAGVPAARAV LAANLETAIN ACWDGVPALG DRIAVVGAGV VGSLVAWLCA RLPGVELELI DTDPGRAGLA AALGLVHRLP EQARGNCDLV FHASGNPAGL VRALELAGQD ATVVEMSWYG RRSAELPLGA AFHARRLRLQ SSQVGRLPPP RSPRWDYRRR MELALALLVD PRLDALISGE TDFTDLPALM QRLAEAPAGA LCERIRYASP ST
|
| |