Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3105 |
Symbol | |
ID | 7874574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3360323 |
End bp | 3361354 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643700027 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_002890079 |
Protein GI | 237653765 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0368513 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTTC GCGGCAAGAA GATCACCGTC CACGACATGA CCCTGCGGGA CGGCATGCAC CCCAAGCGCC ACCTGATGAC GCTCGAGCAG ATGAAATCGA TCGCCACCGG GCTCGATGCG GCGGGCGTAC CGCTGATCGA GGTCACCCAC GGCGACGGCC TCGGTGGCAG CTCGGTCAAT TACGGCTTCC CGGCGCACAG CGACGAGGAA TACCTCGGCA CCGTGATCCC GCTGATGAAG CAGGCCAAGG TCTCGGCGCT GCTGCTGCCG GGCATCGGCA CCGTCGATCA CCTCAAGATG GCGCACGAGC TCGGCGTGTC CACCATCCGC GTCGCCACCC ACTGCACCGA GGCCGACGTC TCCGAGCAGC ACATCGGCAT GGCGCGCAAG CTGGGCATGG ACACCGTCGG CTTCCTGATG ATGGCGCACA TGAACAGCCC CGAAGGCCTG GTCACGCAGG CCAGGCTGAT GGAGAGCTAC GGCGCCAACT GCATCTACGT CACCGACTCG GCCGGCCACC TGCTGCCCGA CACGGTGAAG GCACGCCTGT CCGCGGTGCG TGACGCGCTC AAGCCCGAGA CCGAACTCGG CTTCCACGGC CACCACAACC TCGCCATGGG CGTGGCCAAC AGCCTGGCCG CGCTGGAAGT GGGTGCCACC CGCATCGACG CGGCCGCCGC GGGCCTGGGT GCCGGCGCGG GCAACACCCC GCTGGAGGTC TTCATCGCGG TGTGCGACCT GATGGGCATC GAGACCGGCG TGGATGTGTT CAAGATCCAG GACGTGGCCG AAGACCTGGT GGTGCCGATC ATGGACTTCC CGATCCGCAT CGACCGCGAT GCGCTCACGC TCGGCTACGC CGGGGTGTAC GGCTCCTTCC TGCTGTTCGC CAAGCGCGCC GAGAAGAAGT ACGGCGTGCC GGCGCGCGAG ATCCTGGTCG AGATGGGCAA GCGCGGCATG GTCGGCGGCC AGGAAGACAT GATCGAGGAC ACCGCGCTCA ACCTCGCCAA GGCGCGCGGC CTGGCGGTGT GA
|
Protein sequence | MELRGKKITV HDMTLRDGMH PKRHLMTLEQ MKSIATGLDA AGVPLIEVTH GDGLGGSSVN YGFPAHSDEE YLGTVIPLMK QAKVSALLLP GIGTVDHLKM AHELGVSTIR VATHCTEADV SEQHIGMARK LGMDTVGFLM MAHMNSPEGL VTQARLMESY GANCIYVTDS AGHLLPDTVK ARLSAVRDAL KPETELGFHG HHNLAMGVAN SLAALEVGAT RIDAAAAGLG AGAGNTPLEV FIAVCDLMGI ETGVDVFKIQ DVAEDLVVPI MDFPIRIDRD ALTLGYAGVY GSFLLFAKRA EKKYGVPARE ILVEMGKRGM VGGQEDMIED TALNLAKARG LAV
|
| |