Gene Tmz1t_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2079 
Symbol 
ID7085349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2353235 
End bp2354272 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content66% 
IMG OID643699099 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_002355716 
Protein GI217970482 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.636454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTTC GCGGCAAGAA GATCACCGTC CACGACATGA CCCTGCGTGA CGGCATGCAC 
CCCAAGCGCC ACCTGATGAC GCTCGAGCAG ATGAAGACCA TCGCCGTCGG CCTGGACGAA
GCGGGCATCC CGCTGATCGA GGTCACCCAC GGCGATGGTC TGGGCGGCAG CTCGGTGAAC
TACGGCTTCC CCGCCCACAG CGACGAGGAA TACCTCGGCG CGGTGATCCC GCTGATGAAG
CAGGCCAAGG TCTCGGCGCT GCTGCTGCCG GGCATCGGCA CCGTCGATCA CCTGAAGATG
GCGCACGAGA TCGGGGTCTC CACCATCCGG GTGGCCACCC ACTGCACCGA GGCCGACGTC
TCCGAGCAGC ACATCGGCAT GGCCCGCAAG CTCGGCATGG ACACCGTCGG CTTCCTGATG
ATGGCGCACA TGAACAGCCC CGAAGGGCTC GTGAAGCAGG CCAAGCTCAT GGAGAGCTAC
GGCGCCAACT GCATCTACGT CACCGACTCG GCCGGGCACC TGCTGCCCGA CACGGTCAAG
TCGCGCCTCA GTGCCGTGCG GGACGCGCTG AAACCGGAAA CGGAACTGGG CTTTCACGGC
CACCACAACC TCGCCATGGG CGTGGCCAAC AGCCTCGCGG CGCTCGAAGT CGGCGCCACC
CGTATCGACG CCGCCGCCGC CGGGCTGGGT GCCGGTGCGG GCAACACCCC GATGGAGGTC
TTCATCGCGG TGTGCGACCT GATGGGAATC GAGACCGGCG TGGACGTGTT CAAGATCCAG
GACGTGGCCG AGGACCTGGT GGTGCCGATC ATGGACTTCC CGATCCGCAT CGACCGCGAC
GCGCTCACGC TGGGCTATGC CGGGGTGTAT GGCTCCTTCC TGCTGTTCGC CAAGCGGGCC
GAGAAGAAGT ACGGCGTGCC CGCGCGCGAG ATCCTGGTCG AGATGGGCCG GCGCGGCATG
GTCGGCGGGC AGGAGGACAT GATCGAGGAT ACGGCGCTGA ATCTGGCGCG GGCGAAGGGG
ATTGCTCCAT CTGCATAA
 
Protein sequence
MELRGKKITV HDMTLRDGMH PKRHLMTLEQ MKTIAVGLDE AGIPLIEVTH GDGLGGSSVN 
YGFPAHSDEE YLGAVIPLMK QAKVSALLLP GIGTVDHLKM AHEIGVSTIR VATHCTEADV
SEQHIGMARK LGMDTVGFLM MAHMNSPEGL VKQAKLMESY GANCIYVTDS AGHLLPDTVK
SRLSAVRDAL KPETELGFHG HHNLAMGVAN SLAALEVGAT RIDAAAAGLG AGAGNTPMEV
FIAVCDLMGI ETGVDVFKIQ DVAEDLVVPI MDFPIRIDRD ALTLGYAGVY GSFLLFAKRA
EKKYGVPARE ILVEMGRRGM VGGQEDMIED TALNLARAKG IAPSA