Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0826 |
Symbol | |
ID | 7084218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 914416 |
End bp | 915906 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697850 |
Product | UbiD family decarboxylase |
Protein accession | YP_002354491 |
Protein GI | 217969257 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATACG ACGACCTCCG CGACTTCCTC GCCCAGCTCG AAGCCCGTGG CGAGCTCCGG CGCATCAAGA CCCCCGTCGA CACCCACCTC GAGATGACCG AGATCGCCGA CCGCGTGCTG CGCGCCGGCG GGCCGGCGCT GCTGTTCGAG AGGCCGGTGA CCAGGGGCGT GGCGCAGGCG ATCCCCGTGC TCGCCAACCT CTTCGGCACG CCGCAGCGGG TGGCGATGGG CATGGGGGAG GAGGTCGCCG ACGGCGACTG GAGCACGCCC CTGCGCGAGG TCGGCAAGCT GCTCGCCTAT CTCAAGGAGC CCGAGCCGCC CAAGGGGCTG AAGGACGCCT GGGACAAGCT GCCGGTGCTG AAGCAGGTGC TCAACATGGC GCCCAAGGAG GTGCGCTCCG CGCCCTGCCA GCAGGTGGTG TGGTCGGGCG ACGAGGTCGA CCTGGCGAAG CTGCCGATCC AGCACTGCTG GCCGGGCGAC GCCGCGCCGC TGATCACCTG GGGCCTGGTG GTGACGCGCG GGCCGCACAA GAAGCGCCAG AACCTGGGCA TCTACCGCCA GCAGGTGATC GGCCGCAACC GGGTGATCAT GCGCTGGCTG GCGCACCGGG GCGGGGCGAT CGACTTCCTC GAGCACCAGC GCGCGCATCC GGGCGAGCCT TTCCCGGTCG CGGTGGTGCT GGGCTGCGAT CCGGCGACCA TCCTCGGCGC GGTGACGCCG GTGCCCGATT CGCTCTCGGA GTACCAGTTC GCCGGCCTCC TGCGCGGTGC CAAGACCGAG CTGGTGAAAT GTCTCGGCAG CGACCTGCAG GTGCCGGCGT CGGCCGAGAT CGTGCTCGAG GGCGCGATTC ACCCGGGCGA CATGGCGCCC GAAGGCCCCT ATGGCGACCA CACCGGCTAC TACAACGAGG TCTCCGATTT CCCGGTGTTC ACGATCGAGC GCATCACCAT GCGGCGCGAT CCGATCTATC ACAGCACCTA CACCGGCAAG CCGCCCGACG AACCGGCGAT GCTCGGCGTC GCCCTGAACG AGGTCTTCGT GCCGCTGCTG CAGAAGCAGT TCACCGAGAT CGTCGACTTC TACCTGCCCC CGGAGGGCTG CTCGTATCGC CTGGCGGTGG TCAGCATCCG CAAGCAGTAC CCGGGCCACG CCAAGCGGGT GATGTTCGGC ATCTGGAGCT TCCTGCGCCA GTTCATGTAC ACCAAGTTCA TCATCGTGGT GGACGAGGAT GTGAACATCC GCGACTGGAA GGAGGTGATC TGGGCGCTCA CGACGCGCAT GGACGCCACG CGCGACACCA CGCTGGTCGA CAACACGCCG ATCGACTATC TCGACTTCGC CAGCCCGGTC GCCGGACTGG GCAGCAAGAT GGGGCTGGAC GCGACCAACA AGTGGCCGGG CGAGACCAGC CGCGAGTGGG GGACGCCGAT CGTGATGGAT GCGGCCGTGA AGGCGAAGGT GGATGCGATG TGGGGCGAGC TGGGGCTGTA G
|
Protein sequence | MKYDDLRDFL AQLEARGELR RIKTPVDTHL EMTEIADRVL RAGGPALLFE RPVTRGVAQA IPVLANLFGT PQRVAMGMGE EVADGDWSTP LREVGKLLAY LKEPEPPKGL KDAWDKLPVL KQVLNMAPKE VRSAPCQQVV WSGDEVDLAK LPIQHCWPGD AAPLITWGLV VTRGPHKKRQ NLGIYRQQVI GRNRVIMRWL AHRGGAIDFL EHQRAHPGEP FPVAVVLGCD PATILGAVTP VPDSLSEYQF AGLLRGAKTE LVKCLGSDLQ VPASAEIVLE GAIHPGDMAP EGPYGDHTGY YNEVSDFPVF TIERITMRRD PIYHSTYTGK PPDEPAMLGV ALNEVFVPLL QKQFTEIVDF YLPPEGCSYR LAVVSIRKQY PGHAKRVMFG IWSFLRQFMY TKFIIVVDED VNIRDWKEVI WALTTRMDAT RDTTLVDNTP IDYLDFASPV AGLGSKMGLD ATNKWPGETS REWGTPIVMD AAVKAKVDAM WGELGL
|
| |