Gene Tmz1t_3632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3632 
Symbol 
ID7873137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3989554 
End bp3990897 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content68% 
IMG OID643700573 
Productamidohydrolase 
Protein accessionYP_002890602 
Protein GI237654288 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCTGA AAACCTGTCT CGGTTGCGGG GTCGTCGTGC TCGCATCGCT GTGGCTGTCG 
CTGGCGCATG CGGCGGACAA GGCCGTCCTG TTCGAGAATG TCTTCGTCTT CGACGGCAGG
AGCAGCGAGC GGGGCCGCTC GCCGGTCAAC GTGCTGGTGG TGGGCGACAC CATCCAGACC
ATCTCGGCGG CGCCGATCGC GCCGCCCGCG GACGCCGACC TGACGCGTGT CGCCGGCGGG
GGCCGCACGC TGATGCCCGG CCTGATCGAT GCGCACTGGC ACTCGATGCT GATCGGCCAG
AAGCTGGTCG ATGCGATGAC CAGCGACGTC GGCTATACCA ATCTGCGTGC GGCGCAGGTC
GCCGAGGCGA CGCTCATGCG CGGCTTCACC ACGGTGCGCG ACATGGGCGG GCCGGTGCAG
GGCCTGCGGC GTGCGATCGA GGAGGGGCGT TTCCCCGGGC CGCGCATCTT CCCGTCGGGC
GCGATGATCT CGCAGACCGG CGGCCATGGC GATTTCCGTC TGCCGCACGA GGTGCCGCGC
GCCGCCAGCC AGGGTCTCAG TCATACCGAA CTTACCCGCA CCGCCGCCAT TGCCGATGGC
GCCGACGAGG TGCTGCGGCG CACCCGCGAA CAGCTGATGC TCGGCGCGAC GCAGATCAAG
CTGATGGCTG GCGGTGGGGT GACGTCGATC TACGACCCGA TCGACGCCAC CCAATACACC
CAGGACGAGA TCCGCGCCGC AGTGGCTGCG GCAGAGAACT GGGGCACTTA CGTTACGGTG
CACGCCTACA CCAGCCGTGC GGTTCAGGTC GCCATCGAGG CCGGGGTCAA GGCGGTGGAG
CACGGCCAGC TGGTCGATGA GGAGACGGTC AGGCTGATGG CGCAGAAGGG GATCTGGTGG
TCGCTCCAGC CCTTCCTCGA CAACGAGCTC GCCAACCCGC AGGCCGGGGC CAACCGGGTC
AAGCAACTGA TGGTGGCGGC CGGCACCGAT CGCGCCTACG CACTGGCGCG CAAGCACGCG
GTGAAGGTGG CCTTCGGCAC CGACATCCTG TTCTCGGGGG ACAACGGAGA GGTGCAGAAC
GCACGCCTGG TCTCGCTCGA GCGCTGGTAT CCCCCCGGCG AGGTGCTGCA GATCGCGACC
GGCAACAACG GCGCCCTGCT CGAGCTTACC GGCGAGCGCA ACCCGTATCG CAAGCCGCTC
GGGGTCGTCG CCGAGGGCGC GCTCGCCGAC CTGTTGCTGG TCGATGGCGA CCCGACGGCG
GACCTGTCGC TCATCAAGCG TCCCGAGTCG AGCTTCGTCC TCATCATGAA GAACGGGCGC
ATCTACAAGA ACCTGCTGCC CTGA
 
Protein sequence
MRLKTCLGCG VVVLASLWLS LAHAADKAVL FENVFVFDGR SSERGRSPVN VLVVGDTIQT 
ISAAPIAPPA DADLTRVAGG GRTLMPGLID AHWHSMLIGQ KLVDAMTSDV GYTNLRAAQV
AEATLMRGFT TVRDMGGPVQ GLRRAIEEGR FPGPRIFPSG AMISQTGGHG DFRLPHEVPR
AASQGLSHTE LTRTAAIADG ADEVLRRTRE QLMLGATQIK LMAGGGVTSI YDPIDATQYT
QDEIRAAVAA AENWGTYVTV HAYTSRAVQV AIEAGVKAVE HGQLVDEETV RLMAQKGIWW
SLQPFLDNEL ANPQAGANRV KQLMVAAGTD RAYALARKHA VKVAFGTDIL FSGDNGEVQN
ARLVSLERWY PPGEVLQIAT GNNGALLELT GERNPYRKPL GVVAEGALAD LLLVDGDPTA
DLSLIKRPES SFVLIMKNGR IYKNLLP