Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2041 |
Symbol | |
ID | 7083801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2303294 |
End bp | 2305981 |
Gene Length | 2688 bp |
Protein Length | 895 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643699068 |
Product | CoA-binding domain protein |
Protein accession | YP_002355685 |
Protein GI | 217970451 |
COG category | [C] Energy production and conversion |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) |
TIGRFAM ID | [TIGR02717] acetyl coenzyme A synthetase (ADP forming), alpha domain |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.171993 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAA AGCACTATCT GACCCCGCTC CTCGAGCCGC GCTCCGTCGG CATCATCGGT GCCAGCGAAC GCGAGGCCTC CCTCGGCAGC GTGCTCATGC GCAACATGCT CGAGGCCGGC TACAAGGGCA AGCTGTTCGC GATCAACCCG AAGCACGAGA AGGTGCACGG GGTCGCCTGC TACAAGTCGG TGGAAGACGT GCCGCAGCGC CTGGACCTGG TGGTGATGGC AATCCGCGCC GAGAAGACCC CGGCGCTGAT GGAGGCCTGC GGCCGCGCCG GGGTGAAGGC GGTGATCCTG CTGTCGGGCG GCTTCTCCGA ATCGGGCGCG CGCGGCGCGC TGCTCGAGCG CCAGGTGGTC GAAGCCGCGC ACCGCCACCG CATCCGCCTG CTCGGCCCCA ATTGTCTGGG CGTCATGCGG CCGCAGCTCG GCGTCAACGC CACCTTCGCG CACGCCAGCG CACTCAAGGG CAGCATCGGC CTGATCTCGC AGTCGGGCGC GCTGTGCGCG GCCATCCTCG ACTGGGCCAA ACCGAACAAC GTCGGCTTCT CGACCGTGGT GTCCCTGGGC TCCTCGTCCG ACATCGACTT CGGCGAGGTG CTCGAATACA TGATCTCGGA CCCGCGCACC GAGAGCATCT TCCTCTACGT CGAAGGCATC CGCGACGCCC GCCGCTTCAT GAGCGCGCTG CGCGGCGCCG CGCGCGTCAA GCCGGTGCTG CTGGTCAAGG CCGGCCGCCA CCCCGGCGCC TCGCGCGCGA TCCTGTCGCA CAGCGGCGCG CCGATGGGCG AGGACGCGGT GTTCGACGCC GCGCTGCGTC GCGCCGGCGT GATCCGCCTG TACAACATGG GCCAGCTCTT CGCCGCGGCC AACGCGCTGT TCTCGCACTT CCGCCCGCGC GGCAACCGCC TCGCGATCAT CACCAACGGC GGCGGCCCGG GCGTCATGGC GGCCGACCGC GCGGCCGACA TCGGCATCCC GCTCGCGGAG TTCGCCGAGA GCACGGTCGA GAAGCTCAAC GCCTGCCTGC CCAGCGGCTG GTCGCACGGC AACCCGGTGG ACATCCTCGG CGACGCCGGC CCGGAACGTT ATCGCGCGGC GCTCAAGGCG GTGCTCGAAG GCCCCAACGT CGACGGCGTG CTGGTGATGC TGACCCCGCA GGCGGTGACC GACCCGAGCG GCGTCGCCGA CGTCGTGATC GAGCTCGAGA AGACCGCCGA CAAGCCGGTG CTGGTGTGCT GGATGGGCGA GGAGCTCGTC GCCGAGGCGC GCGCCAAGTT CAGCGCCGCG GGCATCCCGC ACTTCCGCAC GCCGGAGCCG GCGGTCGAGC TCTTCAGCCA CATCTCGGCC TACTACCAGA ACCAGAAGCT GCTGATGCAG ACGCCGTCCT CGCTGTCGCA CCTGCCGCCG CCCTCGATCG AGAGCGCGCG CCTGGTGATC GAGATGGCCC TGTCCGAGCG CCGCAAGAAG CTCAACGAGA TGGAGTCGAA GGCGCTGCTC GCGGCCTTCC GCATTCCGAT CGCGCAGACC GTGGTCGCGC GCAGTGCCGC CGAGGCGATG GTGCTCTCGG CCGAGATCGG CCTGCCGGTG GTGATGAAGA TCGACTCGCC GAACATCGTG CACAAGTCCG AGGTGGGCGG CGTGCGCCTG AACATCCGCA GCCTCGCCGC GGTGCGCTCG ACCTATCAGG AGATCCTCGA CGAGGTCAAG CGCGTGCAGC CCGAGGCGGT GATCAACGGC ATCGCGATCG AGCCGATGAT CCAGAAGCGC AACGGCCGCG AGCTGGTGGT GAGCGTGCGC CGCGACCCGG TGTTCGGCCC CGCGATCACC TTCGGCGAGG GCGGCAACCT GGTCGAGGAG AACCGCGACG TCGCGGTGGC CCTGCCGCCG CTCAACAGCT TCCTCGTCAA GGACATGATC CGCTCGACCC GCATCTCCAC CCGCCTCGGC GAGTTCCGCA ACATGCCGGC GGTGGACATG AACGCGCTCG AGCTGGTGCT GTTGCGCATC TCCGAGATGG TCTGCGAGCT GCCGTGGATC ACCGCGATGG AGATCAACCC GCTGATCGTG GACGAGAACG GCGTGGTCGC GGTCGACGCG AACATCAGTG TGGAAAACGT GTCTCCGACG GCGGACCGCT ACGACCACGT CGCGATCCAC CCGTATCCGT CGCACCTGAT CAGCACCTGG ACCGTGCCCG ACGGCACCAC GGTGACGATC CGTCCGATCA AGCCGGAGGA CGCCGAGCTC GAGGTCGACT TCGTGCGCCG CCTGTCGGCC GAGACCAAGT ACTACCGCTT CATGAACACC ATGCGCGAGC TGCCGCCCGC GATGGTCGCC CGCCTCACCC AGATCGACTA CGACCGCGAG ATGGCCTTCG TCGCCACCCT CGAGGCCGAC GGCGTAGAGA ACGAGATCGG TGTGTGCCGC TACGCGGTGA ACCCCGACGG CGAGTCCTGC GAGTTCGCGG TCGTGGTGGC CGACGACTGG CAGCACCGCG GCCTCGCCCG CAAGCTCATG GGCGTACTGA TCGAGACCGC GCGCAGCCGT GGCATCCAGT ACATGAACGG CGTCTTCCTC GCCAACAACG AGCGCATGCT CAAGTTCGTG CAGAAGCTCG GCTTCGTGCT CAGCAACGAC CCGGAAGACA GCTCGGTCAA GCTAGGCATT CTGGCGCTGC AGGACTGA
|
Protein sequence | MKEKHYLTPL LEPRSVGIIG ASEREASLGS VLMRNMLEAG YKGKLFAINP KHEKVHGVAC YKSVEDVPQR LDLVVMAIRA EKTPALMEAC GRAGVKAVIL LSGGFSESGA RGALLERQVV EAAHRHRIRL LGPNCLGVMR PQLGVNATFA HASALKGSIG LISQSGALCA AILDWAKPNN VGFSTVVSLG SSSDIDFGEV LEYMISDPRT ESIFLYVEGI RDARRFMSAL RGAARVKPVL LVKAGRHPGA SRAILSHSGA PMGEDAVFDA ALRRAGVIRL YNMGQLFAAA NALFSHFRPR GNRLAIITNG GGPGVMAADR AADIGIPLAE FAESTVEKLN ACLPSGWSHG NPVDILGDAG PERYRAALKA VLEGPNVDGV LVMLTPQAVT DPSGVADVVI ELEKTADKPV LVCWMGEELV AEARAKFSAA GIPHFRTPEP AVELFSHISA YYQNQKLLMQ TPSSLSHLPP PSIESARLVI EMALSERRKK LNEMESKALL AAFRIPIAQT VVARSAAEAM VLSAEIGLPV VMKIDSPNIV HKSEVGGVRL NIRSLAAVRS TYQEILDEVK RVQPEAVING IAIEPMIQKR NGRELVVSVR RDPVFGPAIT FGEGGNLVEE NRDVAVALPP LNSFLVKDMI RSTRISTRLG EFRNMPAVDM NALELVLLRI SEMVCELPWI TAMEINPLIV DENGVVAVDA NISVENVSPT ADRYDHVAIH PYPSHLISTW TVPDGTTVTI RPIKPEDAEL EVDFVRRLSA ETKYYRFMNT MRELPPAMVA RLTQIDYDRE MAFVATLEAD GVENEIGVCR YAVNPDGESC EFAVVVADDW QHRGLARKLM GVLIETARSR GIQYMNGVFL ANNERMLKFV QKLGFVLSND PEDSSVKLGI LALQD
|
| |