Gene Tmz1t_2453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2453 
Symbol 
ID7874137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2646404 
End bp2647402 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content75% 
IMG OID643699376 
Productputative dehydrogenase 
Protein accessionYP_002889433 
Protein GI237653119 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGCA CCAACCTCGC CTTCTGGACC GTCCGCCCGG GCTACGGCGA ATTGCGCCCG 
GCGCCGCTGC GCCCGCCCGC AGACGGCGAG CTGCGGGTGC GCAACCTCTT CGGCGCAGTC
AGCCGCGGCA GCGAGAGCCT GGTGTTCCGC GGCGAGGTGC CCGAAAGCGA ATACGAACGC
ATGCGCGCCC CCTTCCAGGA GGGCGACTTC CCCGGGCCGC TCAAGTACGG CTACATCGGC
GTCGGCGTGG TGGAGGACGG CGTCGGCACC GCGGCCACCG CCTTGCGCGG CCGCACGGTG
TTCTGCCTGC ACCCGCATCA GCAGCGCTAT GTGGTACCCG CCGGCGCCGT CGTCCCCCTC
CCCGCCGGCG TGCCGGCCGC GCGCGCGGTG CTGGCCGCCA ACCTCGAGAC CGCGATCAAC
GCCTGCTGGG ACGGCGTCCC CGCGCTGGGC GACCGCATCG CGGTGGTCGG CGCCGGCGTG
GTCGGCAGCC TGGTGGCCTG GCTGTGCGCG CGCCTCCCCG GCGTCGAGCT CGAGCTGATC
GACACCGACC CCGGCCGCGC CGGCCTCGCC GCCGCGCTCG GCCTCGTTCA CCGCCTCCCG
GAGCAGGCGC GTGGCAACTG CGACCTCGTC TTCCACGCCA GCGGCAACCC CGCCGGCCTG
GTGCGCGCGC TCGAACTCGC CGGACAGGAC GCCACCGTCG TGGAGATGAG CTGGTACGGC
CGCCGCAGCG CGGAGCTGCC GCTCGGCGCC GCCTTCCACG CCCGCCGCCT GCGCCTGCAG
TCCAGCCAGG TCGGCCGCCT GCCGCCGCCA CGCAGCCCGC GCTGGGACTA CCGTCGCCGC
ATGGAACTCG CGCTCGCGCT GCTCGTCGAT CCACGTCTGG ACGCACTGAT CAGCGGCGAG
ACCGACTTCA CCGACCTGCC CGCGCTGATG CAGCGCCTCG CCGAAGCCCC CGCCGGGGCG
CTGTGCGAGC GCATCCGCTA TGCCAGTCCG AGCACCTGA
 
Protein sequence
MSSTNLAFWT VRPGYGELRP APLRPPADGE LRVRNLFGAV SRGSESLVFR GEVPESEYER 
MRAPFQEGDF PGPLKYGYIG VGVVEDGVGT AATALRGRTV FCLHPHQQRY VVPAGAVVPL
PAGVPAARAV LAANLETAIN ACWDGVPALG DRIAVVGAGV VGSLVAWLCA RLPGVELELI
DTDPGRAGLA AALGLVHRLP EQARGNCDLV FHASGNPAGL VRALELAGQD ATVVEMSWYG
RRSAELPLGA AFHARRLRLQ SSQVGRLPPP RSPRWDYRRR MELALALLVD PRLDALISGE
TDFTDLPALM QRLAEAPAGA LCERIRYASP ST