Gene Tmz1t_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0520 
Symbol 
ID7085134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp585397 
End bp586449 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content75% 
IMG OID643697548 
Producturea amidolyase related protein 
Protein accessionYP_002354190 
Protein GI217968956 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCC GGACCGAAGT GGAAATCCTC TCCCCCGGCG CCTTCGCCTC CATCCAGGAC 
GGCGGCCGCC GCGGCCACCG CCGCATCGGC GTGCCTTGGG CGGGCGTGCT CGACCGCCGC
CTGATGCGCA TCGCCAATGC GCTCGCCGGT CGCGCCGAGG ACGCCGCGGT GATCGAATGC
TTCGACGGCG GCCTGCACGT CGCCGCGAGC GGCGGCGCGG TGAAGCTCGC GGTGGCCGGC
GACGCGGTGG TCGAGGTCGA AGGCACCGAG GGCCGGCGCC AGCTCGCGCC GTGGCGCTCG
GTGACGCTGG CCGACGGCGA GCAGCTGCGC ATCCGCAAGA TGGAGGGCGG ACGCATCGCC
ATGGTCGCGA TCGTCGGCCT CGAGCCGGCG GCGGTGATGG GCAGCGCCTC GACCTATGCG
CGCGCCGGCA TCGGCGGCGT GGATGGCCGT GCGCTCGGCG CCGGCACGCG CCTGGCGCTC
TCCGCCGACG CCGACCCCTG GGACAGCGAC CGCGTGCTCG CCCAGTCGCC CGCGGCCGAC
ACCGGTCCGA TCCGCCTGGT GCCCGGTCCG CAGGCCGACC ACTTCAGCCC CACCGCGCTC
GACGCCCTGG TGGGCGGCGA GTATCGCGTC ACCACCGAGG CCGACCGCAT GGGCATCCGC
CTCGAGGGCG CGCAGCTGGA GCACGCCGGC GCCGCCGAGA TCGTCTCCGA CGCCACCGTG
CCCGGCTCCA TCCAGGTGCC CGGTGCCGGC CAGCCCATCG TGCTGCTCGC CGACGCGCAG
ACCGCCGGCG GCTATCCCAA GATCGCCACC GTGATCGGCG CCGACCTCGG CCGTCTCGCC
GCGCTGCGCC CCGGCCAGAG CCTGCGCTTC GCCGCCGTGA GCGCCGCCGA GGGCGCGTGC
ATCGCGCGCG CCGCAGAGAC CGAGACCCGG GCGTTGATCG CCTCGATCCG CGCCCTGCCG
CCCGATGGCA TCGACCTGAT GGCGCTGTAC ACCGGGAACC TGGTCGACGG CGTCGTGCAT
GCCCTCGGCA CCGAATACCG ACCGCTGTAT TGA
 
Protein sequence
MSTRTEVEIL SPGAFASIQD GGRRGHRRIG VPWAGVLDRR LMRIANALAG RAEDAAVIEC 
FDGGLHVAAS GGAVKLAVAG DAVVEVEGTE GRRQLAPWRS VTLADGEQLR IRKMEGGRIA
MVAIVGLEPA AVMGSASTYA RAGIGGVDGR ALGAGTRLAL SADADPWDSD RVLAQSPAAD
TGPIRLVPGP QADHFSPTAL DALVGGEYRV TTEADRMGIR LEGAQLEHAG AAEIVSDATV
PGSIQVPGAG QPIVLLADAQ TAGGYPKIAT VIGADLGRLA ALRPGQSLRF AAVSAAEGAC
IARAAETETR ALIASIRALP PDGIDLMALY TGNLVDGVVH ALGTEYRPLY