Gene Tmz1t_0826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0826 
Symbol 
ID7084218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp914416 
End bp915906 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content68% 
IMG OID643697850 
ProductUbiD family decarboxylase 
Protein accessionYP_002354491 
Protein GI217969257 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATACG ACGACCTCCG CGACTTCCTC GCCCAGCTCG AAGCCCGTGG CGAGCTCCGG 
CGCATCAAGA CCCCCGTCGA CACCCACCTC GAGATGACCG AGATCGCCGA CCGCGTGCTG
CGCGCCGGCG GGCCGGCGCT GCTGTTCGAG AGGCCGGTGA CCAGGGGCGT GGCGCAGGCG
ATCCCCGTGC TCGCCAACCT CTTCGGCACG CCGCAGCGGG TGGCGATGGG CATGGGGGAG
GAGGTCGCCG ACGGCGACTG GAGCACGCCC CTGCGCGAGG TCGGCAAGCT GCTCGCCTAT
CTCAAGGAGC CCGAGCCGCC CAAGGGGCTG AAGGACGCCT GGGACAAGCT GCCGGTGCTG
AAGCAGGTGC TCAACATGGC GCCCAAGGAG GTGCGCTCCG CGCCCTGCCA GCAGGTGGTG
TGGTCGGGCG ACGAGGTCGA CCTGGCGAAG CTGCCGATCC AGCACTGCTG GCCGGGCGAC
GCCGCGCCGC TGATCACCTG GGGCCTGGTG GTGACGCGCG GGCCGCACAA GAAGCGCCAG
AACCTGGGCA TCTACCGCCA GCAGGTGATC GGCCGCAACC GGGTGATCAT GCGCTGGCTG
GCGCACCGGG GCGGGGCGAT CGACTTCCTC GAGCACCAGC GCGCGCATCC GGGCGAGCCT
TTCCCGGTCG CGGTGGTGCT GGGCTGCGAT CCGGCGACCA TCCTCGGCGC GGTGACGCCG
GTGCCCGATT CGCTCTCGGA GTACCAGTTC GCCGGCCTCC TGCGCGGTGC CAAGACCGAG
CTGGTGAAAT GTCTCGGCAG CGACCTGCAG GTGCCGGCGT CGGCCGAGAT CGTGCTCGAG
GGCGCGATTC ACCCGGGCGA CATGGCGCCC GAAGGCCCCT ATGGCGACCA CACCGGCTAC
TACAACGAGG TCTCCGATTT CCCGGTGTTC ACGATCGAGC GCATCACCAT GCGGCGCGAT
CCGATCTATC ACAGCACCTA CACCGGCAAG CCGCCCGACG AACCGGCGAT GCTCGGCGTC
GCCCTGAACG AGGTCTTCGT GCCGCTGCTG CAGAAGCAGT TCACCGAGAT CGTCGACTTC
TACCTGCCCC CGGAGGGCTG CTCGTATCGC CTGGCGGTGG TCAGCATCCG CAAGCAGTAC
CCGGGCCACG CCAAGCGGGT GATGTTCGGC ATCTGGAGCT TCCTGCGCCA GTTCATGTAC
ACCAAGTTCA TCATCGTGGT GGACGAGGAT GTGAACATCC GCGACTGGAA GGAGGTGATC
TGGGCGCTCA CGACGCGCAT GGACGCCACG CGCGACACCA CGCTGGTCGA CAACACGCCG
ATCGACTATC TCGACTTCGC CAGCCCGGTC GCCGGACTGG GCAGCAAGAT GGGGCTGGAC
GCGACCAACA AGTGGCCGGG CGAGACCAGC CGCGAGTGGG GGACGCCGAT CGTGATGGAT
GCGGCCGTGA AGGCGAAGGT GGATGCGATG TGGGGCGAGC TGGGGCTGTA G
 
Protein sequence
MKYDDLRDFL AQLEARGELR RIKTPVDTHL EMTEIADRVL RAGGPALLFE RPVTRGVAQA 
IPVLANLFGT PQRVAMGMGE EVADGDWSTP LREVGKLLAY LKEPEPPKGL KDAWDKLPVL
KQVLNMAPKE VRSAPCQQVV WSGDEVDLAK LPIQHCWPGD AAPLITWGLV VTRGPHKKRQ
NLGIYRQQVI GRNRVIMRWL AHRGGAIDFL EHQRAHPGEP FPVAVVLGCD PATILGAVTP
VPDSLSEYQF AGLLRGAKTE LVKCLGSDLQ VPASAEIVLE GAIHPGDMAP EGPYGDHTGY
YNEVSDFPVF TIERITMRRD PIYHSTYTGK PPDEPAMLGV ALNEVFVPLL QKQFTEIVDF
YLPPEGCSYR LAVVSIRKQY PGHAKRVMFG IWSFLRQFMY TKFIIVVDED VNIRDWKEVI
WALTTRMDAT RDTTLVDNTP IDYLDFASPV AGLGSKMGLD ATNKWPGETS REWGTPIVMD
AAVKAKVDAM WGELGL