Gene Tmz1t_3105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3105 
Symbol 
ID7874574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3360323 
End bp3361354 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content67% 
IMG OID643700027 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_002890079 
Protein GI237653765 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0368513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTTC GCGGCAAGAA GATCACCGTC CACGACATGA CCCTGCGGGA CGGCATGCAC 
CCCAAGCGCC ACCTGATGAC GCTCGAGCAG ATGAAATCGA TCGCCACCGG GCTCGATGCG
GCGGGCGTAC CGCTGATCGA GGTCACCCAC GGCGACGGCC TCGGTGGCAG CTCGGTCAAT
TACGGCTTCC CGGCGCACAG CGACGAGGAA TACCTCGGCA CCGTGATCCC GCTGATGAAG
CAGGCCAAGG TCTCGGCGCT GCTGCTGCCG GGCATCGGCA CCGTCGATCA CCTCAAGATG
GCGCACGAGC TCGGCGTGTC CACCATCCGC GTCGCCACCC ACTGCACCGA GGCCGACGTC
TCCGAGCAGC ACATCGGCAT GGCGCGCAAG CTGGGCATGG ACACCGTCGG CTTCCTGATG
ATGGCGCACA TGAACAGCCC CGAAGGCCTG GTCACGCAGG CCAGGCTGAT GGAGAGCTAC
GGCGCCAACT GCATCTACGT CACCGACTCG GCCGGCCACC TGCTGCCCGA CACGGTGAAG
GCACGCCTGT CCGCGGTGCG TGACGCGCTC AAGCCCGAGA CCGAACTCGG CTTCCACGGC
CACCACAACC TCGCCATGGG CGTGGCCAAC AGCCTGGCCG CGCTGGAAGT GGGTGCCACC
CGCATCGACG CGGCCGCCGC GGGCCTGGGT GCCGGCGCGG GCAACACCCC GCTGGAGGTC
TTCATCGCGG TGTGCGACCT GATGGGCATC GAGACCGGCG TGGATGTGTT CAAGATCCAG
GACGTGGCCG AAGACCTGGT GGTGCCGATC ATGGACTTCC CGATCCGCAT CGACCGCGAT
GCGCTCACGC TCGGCTACGC CGGGGTGTAC GGCTCCTTCC TGCTGTTCGC CAAGCGCGCC
GAGAAGAAGT ACGGCGTGCC GGCGCGCGAG ATCCTGGTCG AGATGGGCAA GCGCGGCATG
GTCGGCGGCC AGGAAGACAT GATCGAGGAC ACCGCGCTCA ACCTCGCCAA GGCGCGCGGC
CTGGCGGTGT GA
 
Protein sequence
MELRGKKITV HDMTLRDGMH PKRHLMTLEQ MKSIATGLDA AGVPLIEVTH GDGLGGSSVN 
YGFPAHSDEE YLGTVIPLMK QAKVSALLLP GIGTVDHLKM AHELGVSTIR VATHCTEADV
SEQHIGMARK LGMDTVGFLM MAHMNSPEGL VTQARLMESY GANCIYVTDS AGHLLPDTVK
ARLSAVRDAL KPETELGFHG HHNLAMGVAN SLAALEVGAT RIDAAAAGLG AGAGNTPLEV
FIAVCDLMGI ETGVDVFKIQ DVAEDLVVPI MDFPIRIDRD ALTLGYAGVY GSFLLFAKRA
EKKYGVPARE ILVEMGKRGM VGGQEDMIED TALNLAKARG LAV