Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2879 |
Symbol | |
ID | 7873781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3117950 |
End bp | 3119773 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643699800 |
Product | thiamine pyrophosphate protein TPP binding domain protein |
Protein accession | YP_002889855 |
Protein GI | 237653541 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.62815 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATCC GCGTTGCCGA CTGGCTGCTC GCCCGCCTTG CCGACGAGGG CATCCACCAC ATCTTCATGC TGCCCGGCGG CGGTGCCATG TACCTCAACG ACGCCCTCGC CTGTGAACCC CGCGTGAACG CCGTGCCCTG CCACCATGAG CAGGCTGCCG CCATCGCCGC CGAAGCCGCC GGCCGGACCG GCAACGCCGG CAACCCGGGT TTCGGCGTGG CGATGGTTAC CACCGGCCCC GGCGCCACCA ATGCCATCAC CCCGGTGGCC GGCGCGTGGA TCGACTCGGT GCCGATGCTG GTGCTCTCCG GCCAGGCCAA GCGCCCCGAC CGCCTGGGCG GCCGCCCGAT CCGCCAGGGC GGCGTGCAGG AGGTGGACAT CGTCCCCATC GTCAGTCCGA TCACCAAGTA CGCCGTGACG CTGGACGACC CACAGTCGGT GCGCGTCCAT CTCGAAAAAG CCCTGCACCT CATGAAGACC GGCCGCCCCG GCCCGGTGTG GATCGACGTG CCCCTCGACG TGCAGGCCGC GCCCATCGAC CCGGCCACCC TGCCCGGCTG GACGCCGCCG GCCGACGCCA CCGCACCGAT GCCCGATCTC ACCCCGGTGC TGACGATGCT CGCCGAGGCC AAGCGCCCGC TCATTCTCGC CGGCCACGGC GTGCGCCTCG CCGGTGCCGC CGACGCCTTC CGCCAACTCG TGGATCACCT CCAGGTGCCC GCCGTGCTCA CCTGGAACGC GCTCGACCTG CTGCCCTACG ATCATCCGCT CAACATCGGT CGCCCCGGCG TGGTCGCCGC CCGCGCTGCG AACTTCGCGG TGCAGAACTG CGACCTGCTG ATCAGCATCG GCGCCCGCCT CGACATGATC GTCACCGCCT ACAACCCCAA AGGCTTCGCC CGCGCCGCGC GCAAGGTGGT GGTGGACGTG GACGCCAACG AGCTGGCCGA CAAGACCGCG ATGGCCATCG ACCAACCCCT GGCGATGGAC GCCGGCGACT TCATCCAAAC CCTGCTGGCC GCCGCCATAC CCGGCGACAC CACCGACTGG CGCGCCCGCT GCACCCGCTG GAAGGCCCGC TACACGCAGA ACGAGGGCCG CGTCTTCCCG CCTTCCGGCC CCATCGGCCA CGCCCACTTC GTCGAGGCCC TGTCAGATGC TGCGCCGGCC GACACTCTTA TTGCCACCGG CAGCTCCGGC CTGGCAGTGG AGTTCTTCTA CGCTGGTTTT CGCAACAAAC GGGGCCAGCG CACCTTCCTC ACCTCCGGCC TCGGCGCGAT GGGCTACGGT CTGCCGGCAG CGATCGGCGC CTGCCTCGGC AACGACCGCA AGCCCATGCT GGCCGTGGAA TCGGATGGCA GTCTGCAGCT CAACCTGCAG GAGCTTGCCA CCCTCACCGG TCTGCAGCTG CCGATCTGCC TGTTCATCAT GAACAACGGC GGCTACGCCT CCATTCGCAA CACCCAGCGC AACTACTTCA ATGGGCGCTA CGTGGGCAGC GGCCCGGCCT CGGGGCTATT CATGCCCGAT CTCGAAAAAC TCGCCGCAGT GTATGGTCTG CCGTACCTGC GCATCGACGA CTGCGCCGAA CTCGCCGCCG CGCTGGCCCG CGCCCAGGCG CTGCCGCGCC CCTGCCTCAT CGACGTACGC CTGATTCCGG AAGAGAGCCT GCAGCCCAAG TGCGCCGCCA TTCCCCGGGC GGACGGCTCC ATCATTTCAA TGCCGCTGGA GGACATGAGC CCCCTGCTGC CCCTGGAAAC CCTCGAGGCC GAGATGATCG TGCCGCTCCT GCCTGCTTCG CTCGACGCCC CGCGGCCCGC CTGA
|
Protein sequence | MTIRVADWLL ARLADEGIHH IFMLPGGGAM YLNDALACEP RVNAVPCHHE QAAAIAAEAA GRTGNAGNPG FGVAMVTTGP GATNAITPVA GAWIDSVPML VLSGQAKRPD RLGGRPIRQG GVQEVDIVPI VSPITKYAVT LDDPQSVRVH LEKALHLMKT GRPGPVWIDV PLDVQAAPID PATLPGWTPP ADATAPMPDL TPVLTMLAEA KRPLILAGHG VRLAGAADAF RQLVDHLQVP AVLTWNALDL LPYDHPLNIG RPGVVAARAA NFAVQNCDLL ISIGARLDMI VTAYNPKGFA RAARKVVVDV DANELADKTA MAIDQPLAMD AGDFIQTLLA AAIPGDTTDW RARCTRWKAR YTQNEGRVFP PSGPIGHAHF VEALSDAAPA DTLIATGSSG LAVEFFYAGF RNKRGQRTFL TSGLGAMGYG LPAAIGACLG NDRKPMLAVE SDGSLQLNLQ ELATLTGLQL PICLFIMNNG GYASIRNTQR NYFNGRYVGS GPASGLFMPD LEKLAAVYGL PYLRIDDCAE LAAALARAQA LPRPCLIDVR LIPEESLQPK CAAIPRADGS IISMPLEDMS PLLPLETLEA EMIVPLLPAS LDAPRPA
|
| |