Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0101 |
Symbol | |
ID | 7083484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 115164 |
End bp | 116792 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643697148 |
Product | indolepyruvate/phenylpyruvate decarboxylase |
Protein accession | YP_002353797 |
Protein GI | 217968563 |
COG category | [G] Carbohydrate transport and metabolism [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes |
TIGRFAM ID | [TIGR03394] indolepyruvate/phenylpyruvate decarboxylase, Azospirillum family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.438272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCTGA CCGAATCGCT GCTGCACGCG CTGGTCGACC ACGGCGCACG CCAGATCTTC GGCATCCCGG GCGACTTCGC CTTGCCGTAT TTCCGCATCA TCGAGCAGAC CGGCATCCTG CCGCTGCACA CGCTGTCGCA CGAGCCCGGC GTGGGCTTCG CCGCCGACGC GGCCGCGCGC GTCCAGGGTG GCATCGGGGT GGCGGCGGTC ACCTACGGCG CCGGCGCGCT CAACATGGTC AACGCGGTCG CCGCCGCGTA TGCCGAGAAG TCGCCGCTGG TGGTCATCTC GGGCGGACCG GGGCTGGGCG AATCGCAATC CGGCCTGCTG CTGCACCACC AGGCCAAGAC GCTGGACTCG CAGTTGCGCA TCTTCCAGGA GATCACCTGC GACCAGGTGC GGCTCGACGA CGCGGCACGC GCGCCGGCCG ACATCGCGCG CGTGCTCGGC AACTGCGTGC GCAATTCACT GCCGGTGTAC ATCGAGATCC CGCGCGACAT GGTCGCCCGG CCATGCGCGC CGGTAGTGCG CGAGGCGCCG CGCGCGGTCG ATGCCGACGC GCTCGCGGCC TGTGTGGACG AGATCCTCGC ACGCCTGCGC GCGGCCCGTG CCCCGGTGCT GATGGCCGGC GTCGAGGTGC GCCGCTTCGG GCTGGAGGAC AAGGTCGCAG AGCTGTCGCG CCGGCTCGGC ATCCCGGTGG TGACCAGCTT CATGGGCCGC GGCCTGCTCG CCGACCAGGA CGCCCCGCTG ATGGGCACCT ACATGGGGCT CGCCGGCCTG CCCGAGGTGA GCGCGCTGGT GGAGGATTCC GACGGACTGT TCCTGGTCGG GGTGATCATC TCCGACACCA ACTTCGCCGT CTCCGGCAAG CACATCGACC TGCGCCACAC CATCCAGGCG CTCGAGGCCC GGGTCACGAT GGGCTACCAC ACCTATGCCG ACATCCCGCT CGACGCCCTG ATCGACGGGC TTCTCGCGCG CGTTCCGCAC GCCGACGCGC ACTTCACGGT CGACCGCCCC GCCTTTCCGC ACGGCCTGCA GGCGGACGAA GCGACGATCG CGCCCGCCGA CATCGCCTGC GCGGTAAACG ACCTGATGGC GGCCCACGGC AAGCTGCCGA TCGCCTCCGA CATGGGCGAC TGCCTGTTCA CCGCGATGGA CATCGAGCAC ACCGCGCTGG TCGCGCCCGG CTACTACGCC ACCATGGGCT TCGGCGTGCC CGCCGGTCTC GGTGTGCAGG CCGCCACCGG GCAGCGGCCG CTGATCCTGG TCGGCGACGG CGCCTTCCAG ATGACCGGCT GGGAGCTCGG CAACTGCCGG CGCTACGGCT GGGATCCGAT CGTGCTGTTG TTCAATAACG CGAGCTGGGA GATGCTGCGC ACCTTCCAGC CCGAGTCAGG CTTCAACGAC CTCGACGACT GGGGCTTCGC GCAGATGGCC GCGGGCCTGG GTGGCGACGG CGTGCGCGTG CACACGCGCG CGCAGCTGAA GGCTGCGCTC GACAAGGCGA TCGCCACGCG CGGACGCTTC CAGCTCATCG AGGTGATGAT CCCGCGCGGC GTGCTGTCGG AGTCGCTGTC GCGCTTCGTC AACGCGGTCA AGCGCCTCAA CGCCGCCAAG GCCGCCTGA
|
Protein sequence | MNLTESLLHA LVDHGARQIF GIPGDFALPY FRIIEQTGIL PLHTLSHEPG VGFAADAAAR VQGGIGVAAV TYGAGALNMV NAVAAAYAEK SPLVVISGGP GLGESQSGLL LHHQAKTLDS QLRIFQEITC DQVRLDDAAR APADIARVLG NCVRNSLPVY IEIPRDMVAR PCAPVVREAP RAVDADALAA CVDEILARLR AARAPVLMAG VEVRRFGLED KVAELSRRLG IPVVTSFMGR GLLADQDAPL MGTYMGLAGL PEVSALVEDS DGLFLVGVII SDTNFAVSGK HIDLRHTIQA LEARVTMGYH TYADIPLDAL IDGLLARVPH ADAHFTVDRP AFPHGLQADE ATIAPADIAC AVNDLMAAHG KLPIASDMGD CLFTAMDIEH TALVAPGYYA TMGFGVPAGL GVQAATGQRP LILVGDGAFQ MTGWELGNCR RYGWDPIVLL FNNASWEMLR TFQPESGFND LDDWGFAQMA AGLGGDGVRV HTRAQLKAAL DKAIATRGRF QLIEVMIPRG VLSESLSRFV NAVKRLNAAK AA
|
| |