Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3129 |
Symbol | |
ID | 7874271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3384862 |
End bp | 3387045 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643700057 |
Product | malate synthase G |
Protein accession | YP_002890103 |
Protein GI | 237653789 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01345] malate synthase G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.936124 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAC GAGTGAGTGT CCAGCGTCTG CAGGTCGCAG CGAACCTGAA GCGTTTCATT GAAGAAGAAG CCCTGCCGGG CAGCGGTGTC GACGCTGCCG CCTTCTGGTC CGGGTTCGAT GCGCTGGTAC ACGATCTCGC ACCGAAGAAC GCCGCTCTGC TCGCCGAACG CGATCGCATC CAGGCCGAGA TCGACGCCTG GCATCGCGCC CACCCGGGGC CGATCGCCGA CATGGCCGCG TACAAGGCCT TCCTGTCCTC GATCGGCTAC CTGCTGCCGG TGCCCGCCGG TGCCAAGGCC ACCACCACCA ACGTCGATGA CGAGCTCGCC GTGCAGGCCG GCCCGCAGCT CGTGGTGCCG GTGATGAACG CGCGCTACGC GCTCAACGCC GCCAACGCGC GCTGGGGCTC GCTCTACGAC GCGCTCTACG GTACCGACGC CATCCCGCAG AAGGACGGCG CCGAGCTCAC CAAGGGCTAC AACCCGGTGC GCGGCGCGAA GGTCATCGCC TTCGGCCGCC AGGTGCTCGA CCAGGCCGCG CCGCTCGCCG GTGCCTCGCA TGCCGACGCC GCGGGCTACG CGGTCGAGGC CGGCCGCCTG GTGGTGAAGC TGCAGAGCGG TGCCAGCACC GGCCTGCAGC AGGCCGAGAA GTTCGTCGGC TTCCAGGGCG AGGCCGCCGC GCCGCGCGCC GTGCTGCTGA AGAACAACGG CTTGCACATC GAGATCCAGA TCGACCGCAG CGGCGCCATC GGCAAGTCCG ATGCCGCGGG CGTGAACGAC CTGCTGATGG AAGCCGCGCT GTCGACCATC ATGGACTGCG AGGACTCGGT CGCCGCGGTG GATGCCGACG ACAAGGTCGT GGTGTACCGC AACTGGCTCG GCTTGATGGA CGGTACGCTG GAAGACACCT TCGACAAGGG CGGCAAGCCG ATGACGCGGC GTCTGAACGC CGACCGCGAG TACACCGGTG CCGACGGCAA GCCGGTAAAG CTGCACGGTC GCGCGCTGCT CTTCGTGCGC AACGTCGGCC ACCTGATGAC CAACCCGGCC ATCCTTTGGG GCGAGGGCAA GGAAATCCCC GAAGGCATCA TGGATGCGGT GGTCACCACG CTGATCGCCA AGCGCGACCT CGAGCGCCGT GGCAACTCGC GCAAGGGCAG CATCTACATC GTCAAGCCCA AGATGCACGG CCCGGCGGAG ATCGGCTTCG CCGACGAGCT GTTCACCCGC GTCGAGCAGC TGCTCGGGCT GCCGGCCAAC ACCGTCAAGC TCGGCATCAT GGACGAGGAG CGCCGCACCA GCGTCAACCT GCGTGCCTGC ATCGCCGCCG CGCCGGCGCG CGTGGCCTTC ATCAATACCG GCTTCCTCGA TCGCACCGGC GACGAGATGC ACACCTCGAT GGAGGCCGGC GCGATGATGC GCAAGGGCGA CATCAAGAAC AGCAAGTGGA TCGCCGCCTA CGAGCGCCGC AACGTGCTCA TCGGTCTGGC CGCCGGCCTG CGCGGCCGCG CCCAGATCGG CAAGGGCATG TGGGCCATGC CCGACCTGAT GGCCGCGATG CTCGAGCAGA AGATCGGCCA TCCCAAGGCG GGCGCCAACA CCGCCTGGGT GCCGTCGCCG ACCGCCGCCA CCCTGCATGC GCTGCATTAC CACCAGGTCA GCGTGGCCGC GGTGCAGCAG GACATCGAGA AGCTCGACCT CGACAAGGAA GCCGAGGCGC TGCTCGACGA CCTGCTCACC ATCCCGGTGG TGGCCAAGGC CGAGTGGAGC GAGACCGAGC GCCAGCAGGA ACTCGACAAC AACTGCCAGG GCATCCTCGG CTACGTGGTG CGCTGGGTGG AGCAGGGCGT GGGCTGCTCG AAGGTGCCCG ACATCAACGA CATCGGCCTG ATGGAAGACC GCGCCACGCT GCGCATCTCC AGCCAGCACA TCGCCAACTG GCTGCGCCAT GGCGTGGTGC CCGCCGAGCA GGTCGAGGCC ACGCTCGAGC GCATGGCCGC GGTGGTCGAT GGCCAGAACG CGGGCGACCC GCTGTACCGC CCGATGGCGC CGAACTTCGA CGACTCGGTG GCCTACCAGG CCGCGCGCGC GCTGATCTTC GAGGGCTGCG CCCAGCCGAG CGGCTACACC GAGCCGCTGC TGCACCGCTA TCGCCAGCAG TTCAAGGCCA AGGTCGGCGC CTGA
|
Protein sequence | MTERVSVQRL QVAANLKRFI EEEALPGSGV DAAAFWSGFD ALVHDLAPKN AALLAERDRI QAEIDAWHRA HPGPIADMAA YKAFLSSIGY LLPVPAGAKA TTTNVDDELA VQAGPQLVVP VMNARYALNA ANARWGSLYD ALYGTDAIPQ KDGAELTKGY NPVRGAKVIA FGRQVLDQAA PLAGASHADA AGYAVEAGRL VVKLQSGAST GLQQAEKFVG FQGEAAAPRA VLLKNNGLHI EIQIDRSGAI GKSDAAGVND LLMEAALSTI MDCEDSVAAV DADDKVVVYR NWLGLMDGTL EDTFDKGGKP MTRRLNADRE YTGADGKPVK LHGRALLFVR NVGHLMTNPA ILWGEGKEIP EGIMDAVVTT LIAKRDLERR GNSRKGSIYI VKPKMHGPAE IGFADELFTR VEQLLGLPAN TVKLGIMDEE RRTSVNLRAC IAAAPARVAF INTGFLDRTG DEMHTSMEAG AMMRKGDIKN SKWIAAYERR NVLIGLAAGL RGRAQIGKGM WAMPDLMAAM LEQKIGHPKA GANTAWVPSP TAATLHALHY HQVSVAAVQQ DIEKLDLDKE AEALLDDLLT IPVVAKAEWS ETERQQELDN NCQGILGYVV RWVEQGVGCS KVPDINDIGL MEDRATLRIS SQHIANWLRH GVVPAEQVEA TLERMAAVVD GQNAGDPLYR PMAPNFDDSV AYQAARALIF EGCAQPSGYT EPLLHRYRQQ FKAKVGA
|
| |