Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2079 |
Symbol | |
ID | 7085349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2353235 |
End bp | 2354272 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643699099 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_002355716 |
Protein GI | 217970482 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.636454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTTC GCGGCAAGAA GATCACCGTC CACGACATGA CCCTGCGTGA CGGCATGCAC CCCAAGCGCC ACCTGATGAC GCTCGAGCAG ATGAAGACCA TCGCCGTCGG CCTGGACGAA GCGGGCATCC CGCTGATCGA GGTCACCCAC GGCGATGGTC TGGGCGGCAG CTCGGTGAAC TACGGCTTCC CCGCCCACAG CGACGAGGAA TACCTCGGCG CGGTGATCCC GCTGATGAAG CAGGCCAAGG TCTCGGCGCT GCTGCTGCCG GGCATCGGCA CCGTCGATCA CCTGAAGATG GCGCACGAGA TCGGGGTCTC CACCATCCGG GTGGCCACCC ACTGCACCGA GGCCGACGTC TCCGAGCAGC ACATCGGCAT GGCCCGCAAG CTCGGCATGG ACACCGTCGG CTTCCTGATG ATGGCGCACA TGAACAGCCC CGAAGGGCTC GTGAAGCAGG CCAAGCTCAT GGAGAGCTAC GGCGCCAACT GCATCTACGT CACCGACTCG GCCGGGCACC TGCTGCCCGA CACGGTCAAG TCGCGCCTCA GTGCCGTGCG GGACGCGCTG AAACCGGAAA CGGAACTGGG CTTTCACGGC CACCACAACC TCGCCATGGG CGTGGCCAAC AGCCTCGCGG CGCTCGAAGT CGGCGCCACC CGTATCGACG CCGCCGCCGC CGGGCTGGGT GCCGGTGCGG GCAACACCCC GATGGAGGTC TTCATCGCGG TGTGCGACCT GATGGGAATC GAGACCGGCG TGGACGTGTT CAAGATCCAG GACGTGGCCG AGGACCTGGT GGTGCCGATC ATGGACTTCC CGATCCGCAT CGACCGCGAC GCGCTCACGC TGGGCTATGC CGGGGTGTAT GGCTCCTTCC TGCTGTTCGC CAAGCGGGCC GAGAAGAAGT ACGGCGTGCC CGCGCGCGAG ATCCTGGTCG AGATGGGCCG GCGCGGCATG GTCGGCGGGC AGGAGGACAT GATCGAGGAT ACGGCGCTGA ATCTGGCGCG GGCGAAGGGG ATTGCTCCAT CTGCATAA
|
Protein sequence | MELRGKKITV HDMTLRDGMH PKRHLMTLEQ MKTIAVGLDE AGIPLIEVTH GDGLGGSSVN YGFPAHSDEE YLGAVIPLMK QAKVSALLLP GIGTVDHLKM AHEIGVSTIR VATHCTEADV SEQHIGMARK LGMDTVGFLM MAHMNSPEGL VKQAKLMESY GANCIYVTDS AGHLLPDTVK SRLSAVRDAL KPETELGFHG HHNLAMGVAN SLAALEVGAT RIDAAAAGLG AGAGNTPMEV FIAVCDLMGI ETGVDVFKIQ DVAEDLVVPI MDFPIRIDRD ALTLGYAGVY GSFLLFAKRA EKKYGVPARE ILVEMGRRGM VGGQEDMIED TALNLARAKG IAPSA
|
| |