Gene Tmz1t_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0101 
Symbol 
ID7083484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp115164 
End bp116792 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content70% 
IMG OID643697148 
Productindolepyruvate/phenylpyruvate decarboxylase 
Protein accessionYP_002353797 
Protein GI217968563 
COG category[G] Carbohydrate transport and metabolism
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes 
TIGRFAM ID[TIGR03394] indolepyruvate/phenylpyruvate decarboxylase, Azospirillum family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.438272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTGA CCGAATCGCT GCTGCACGCG CTGGTCGACC ACGGCGCACG CCAGATCTTC 
GGCATCCCGG GCGACTTCGC CTTGCCGTAT TTCCGCATCA TCGAGCAGAC CGGCATCCTG
CCGCTGCACA CGCTGTCGCA CGAGCCCGGC GTGGGCTTCG CCGCCGACGC GGCCGCGCGC
GTCCAGGGTG GCATCGGGGT GGCGGCGGTC ACCTACGGCG CCGGCGCGCT CAACATGGTC
AACGCGGTCG CCGCCGCGTA TGCCGAGAAG TCGCCGCTGG TGGTCATCTC GGGCGGACCG
GGGCTGGGCG AATCGCAATC CGGCCTGCTG CTGCACCACC AGGCCAAGAC GCTGGACTCG
CAGTTGCGCA TCTTCCAGGA GATCACCTGC GACCAGGTGC GGCTCGACGA CGCGGCACGC
GCGCCGGCCG ACATCGCGCG CGTGCTCGGC AACTGCGTGC GCAATTCACT GCCGGTGTAC
ATCGAGATCC CGCGCGACAT GGTCGCCCGG CCATGCGCGC CGGTAGTGCG CGAGGCGCCG
CGCGCGGTCG ATGCCGACGC GCTCGCGGCC TGTGTGGACG AGATCCTCGC ACGCCTGCGC
GCGGCCCGTG CCCCGGTGCT GATGGCCGGC GTCGAGGTGC GCCGCTTCGG GCTGGAGGAC
AAGGTCGCAG AGCTGTCGCG CCGGCTCGGC ATCCCGGTGG TGACCAGCTT CATGGGCCGC
GGCCTGCTCG CCGACCAGGA CGCCCCGCTG ATGGGCACCT ACATGGGGCT CGCCGGCCTG
CCCGAGGTGA GCGCGCTGGT GGAGGATTCC GACGGACTGT TCCTGGTCGG GGTGATCATC
TCCGACACCA ACTTCGCCGT CTCCGGCAAG CACATCGACC TGCGCCACAC CATCCAGGCG
CTCGAGGCCC GGGTCACGAT GGGCTACCAC ACCTATGCCG ACATCCCGCT CGACGCCCTG
ATCGACGGGC TTCTCGCGCG CGTTCCGCAC GCCGACGCGC ACTTCACGGT CGACCGCCCC
GCCTTTCCGC ACGGCCTGCA GGCGGACGAA GCGACGATCG CGCCCGCCGA CATCGCCTGC
GCGGTAAACG ACCTGATGGC GGCCCACGGC AAGCTGCCGA TCGCCTCCGA CATGGGCGAC
TGCCTGTTCA CCGCGATGGA CATCGAGCAC ACCGCGCTGG TCGCGCCCGG CTACTACGCC
ACCATGGGCT TCGGCGTGCC CGCCGGTCTC GGTGTGCAGG CCGCCACCGG GCAGCGGCCG
CTGATCCTGG TCGGCGACGG CGCCTTCCAG ATGACCGGCT GGGAGCTCGG CAACTGCCGG
CGCTACGGCT GGGATCCGAT CGTGCTGTTG TTCAATAACG CGAGCTGGGA GATGCTGCGC
ACCTTCCAGC CCGAGTCAGG CTTCAACGAC CTCGACGACT GGGGCTTCGC GCAGATGGCC
GCGGGCCTGG GTGGCGACGG CGTGCGCGTG CACACGCGCG CGCAGCTGAA GGCTGCGCTC
GACAAGGCGA TCGCCACGCG CGGACGCTTC CAGCTCATCG AGGTGATGAT CCCGCGCGGC
GTGCTGTCGG AGTCGCTGTC GCGCTTCGTC AACGCGGTCA AGCGCCTCAA CGCCGCCAAG
GCCGCCTGA
 
Protein sequence
MNLTESLLHA LVDHGARQIF GIPGDFALPY FRIIEQTGIL PLHTLSHEPG VGFAADAAAR 
VQGGIGVAAV TYGAGALNMV NAVAAAYAEK SPLVVISGGP GLGESQSGLL LHHQAKTLDS
QLRIFQEITC DQVRLDDAAR APADIARVLG NCVRNSLPVY IEIPRDMVAR PCAPVVREAP
RAVDADALAA CVDEILARLR AARAPVLMAG VEVRRFGLED KVAELSRRLG IPVVTSFMGR
GLLADQDAPL MGTYMGLAGL PEVSALVEDS DGLFLVGVII SDTNFAVSGK HIDLRHTIQA
LEARVTMGYH TYADIPLDAL IDGLLARVPH ADAHFTVDRP AFPHGLQADE ATIAPADIAC
AVNDLMAAHG KLPIASDMGD CLFTAMDIEH TALVAPGYYA TMGFGVPAGL GVQAATGQRP
LILVGDGAFQ MTGWELGNCR RYGWDPIVLL FNNASWEMLR TFQPESGFND LDDWGFAQMA
AGLGGDGVRV HTRAQLKAAL DKAIATRGRF QLIEVMIPRG VLSESLSRFV NAVKRLNAAK
AA