Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1096 |
Symbol | |
ID | 7084625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1195813 |
End bp | 1197687 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643698112 |
Product | thiamine pyrophosphate protein domain protein TPP-binding |
Protein accession | YP_002354752 |
Protein GI | 217969518 |
COG category | [C] Energy production and conversion |
COG ID | [COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits |
TIGRFAM ID | [TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.12158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGCTG CCCTGCCCGA AACCGCCCCG AGCCGCCTGT TCCTGTCCGG CAACGAGGCC GTGGCGCTCG CCGTGCGCGA CGCCGGCTGC CGCGTCGCCG CCGCCTATCC GGGCACGCCG GCGACCGAGA TCCTCGAGGA GCTGTCGCGC TTTCCCGACC TGTACACCGA ATGGTCGGTG AACGAGAAGG TCTCGCTCGA GGTCGCGCTC GGCGCCTCGA TGGTGGGCGC GCGCGCCTTC TGCGCGATGA AGCACGTCGG CCTCAACGTC GCCGCCGACG CGCTGATGAC CATGACCGTC ACCGGCACCG AAGGCGGCCT CGTGATCGCG GTCGCCGACG ACGTCGGCAT GTCCTCGTCG CAGAACGAGC AGGACTCGCG CTTCTGGGCG CGCTTCGCCC ACCTGCCGCT GTTCGAGCCC GCCGACGCGC AGGAGTCCTA CGACATGGCG CGCGAGGCCT TCGCGGTGTC CGAGCGCTTC CGCTGTCCGG CGATCGTGCG CCTGACCACC CGCATCTGCC ACGTCAAGGG CCGCGTCGTC GCGGGCGAGC GCGAGGCCCA CGAGCCCGCC GGCTTCGTCA AGGATCCGCA GCGCTGGGTC ATGGTGCCCG GCCACGCCAA GCCGCGCGTG GCGCTGATGT ACGAGCGCGA GGAGGCCTTG CGCGCCGCCT CCGCGCAGAG CCCGTTCAAC CTGCTCGTCG AGGGCAGCGA CCGCCGCATC GGCTTCGTGA CCTCGGGCCC GGCCTACATG CACGTGCGCG AGACCTTCCC CGACGCGCCG GTGTTCAAGC TCGGCCAGTC CTACCCGCTG CCGCTCGAGC GCCTGCGCGA GTTCGCCGCC GGCGTCGACC AGCTCCTGGT GGTCGAAGAG ACCGAGCCCC TGGTCGAAAG CGAGCTGCGC GCCGGCGGCA TCGCCTGCAC GGGCAAGGAC GTGCTGCCGC GCGTCGGCGA GCTCTCGCCC GACCGCCTGC GCCCGGCCGT GGCCCGCCTG CGCGGCGAGG AAATCCCAGT CACCGCGCAG CCCCGCCTGG TGCCGCAGCA GGTCTTCCCG CGCCCGCCCA CCATGTGCGT CGCCTGTCCG CACCTGGGCA TCTATTACAC CCTCGCGCAG CTGCGCAACC TCACCATCTC GGGCGACATC GGCTGCTACA CCCTGGGCGC CGGCCACCCC TGGAACGCGC TCGACACCTG CATCTCCATG GGCGCCTCGA TGGGCGTGGC GCTCGGCATG GACAAGGGCC GCGGCCAGGC CGACGCCAAG AAGGCGGTGA TCGCCGTGAT CGGCGACTCC ACCTTCATGC ACATGGGCAT GCAGGGCCTG CTCGACATCA CCTGGAACCG CGGCAACGTC ACCGTGCTGC TGCTCGACAA CCGCGCGGTG GGCATGACCG GCGGCCAGGA CAACCCCGGC ACCGGCCGCG ACATCCACGG CGAGAGCGCG CAGCGGGTCG ACTTCGCCAA GCTGTGCGAG GCGCTCGGCG TCAAGAAGGA GCGCATCCAC ACGCTCGACC CCTACGAGCT GCCCACGCTG TTCAAGACCC TGCGCGAGGA GATCAAGATC CCCGAGCCCT CGGTGATCAT CACCGACCGG CCCTGCGTGC TGATCGACCA CTACAAGCCC ACGCAGCCCT ACAAGGTGAT CGCGGACAAG TGCACCGGCT GCGCCAACTG CATCGACGTC GGCTGCCCGG CGATCCACGT CACCCGGCGC GAGACCCAGG TCAAGCCCTC GGGCAAGGAA GTCGAGCTCG CCTTCGTGCG CATCGAGACC TCCGCCTGCA CCGGCTGCGG CCTGTGCGTG CAGCCGTGTG CGCCCGAAGC CATCGTCCAT GCCCTGCCGG AGCATCCGGT GAAGTTCCTG CACGCCAAGG TCTGA
|
Protein sequence | MGAALPETAP SRLFLSGNEA VALAVRDAGC RVAAAYPGTP ATEILEELSR FPDLYTEWSV NEKVSLEVAL GASMVGARAF CAMKHVGLNV AADALMTMTV TGTEGGLVIA VADDVGMSSS QNEQDSRFWA RFAHLPLFEP ADAQESYDMA REAFAVSERF RCPAIVRLTT RICHVKGRVV AGEREAHEPA GFVKDPQRWV MVPGHAKPRV ALMYEREEAL RAASAQSPFN LLVEGSDRRI GFVTSGPAYM HVRETFPDAP VFKLGQSYPL PLERLREFAA GVDQLLVVEE TEPLVESELR AGGIACTGKD VLPRVGELSP DRLRPAVARL RGEEIPVTAQ PRLVPQQVFP RPPTMCVACP HLGIYYTLAQ LRNLTISGDI GCYTLGAGHP WNALDTCISM GASMGVALGM DKGRGQADAK KAVIAVIGDS TFMHMGMQGL LDITWNRGNV TVLLLDNRAV GMTGGQDNPG TGRDIHGESA QRVDFAKLCE ALGVKKERIH TLDPYELPTL FKTLREEIKI PEPSVIITDR PCVLIDHYKP TQPYKVIADK CTGCANCIDV GCPAIHVTRR ETQVKPSGKE VELAFVRIET SACTGCGLCV QPCAPEAIVH ALPEHPVKFL HAKV
|
| |