Gene Tmz1t_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1336 
Symbol 
ID7084457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1478723 
End bp1480030 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content72% 
IMG OID643698353 
Productguanine deaminase 
Protein accessionYP_002354991 
Protein GI217969757 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase
[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.521939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCCT CCGCCCCACC CTCCCCGCGG CTCTCCGCCG TACGCGGCGA GATCGTCCAC 
TTCCTCGCCG ACCCCGCGGC CGACCCGCGC GCGCTCGAGC ACTTCGCCGA CGGCGTGCTG
ATCGTGCGCG ACGGCCACGT CGCCGAATGC GGCCCCGCCG CCGCGCTGTT GCCCAAGCTG
CCCGCCGGCA CGCCGCTCGC CGACCACCGC GGCAAGCTGA TCCTGCCCGG CTTCGTCGAC
ACCCACGTGC ACTACCCGCA GACCGACATC ATCGCCAGCC ACGGCGAGCA GCTGCTCGAA
TGGCTGGAGA AATACACCTT CCCCGCCGAG CGCCGCTTCG CCGACCGCGC GCATGCCGCC
GAGGTCGCCG GCTTCTTCTG CGACGAGCTG CTGCGCAACG GCACCACCAC CGCGGCGGCC
TTCGCCACGG TGCACGCGGC CTCGGTGGAT GCGCTCTTCG AGGCCGCGCG CGCGCGCCGC
ATGCGCATGA TCACCGGCAA GGTGCTGATG GACCGCAACT GCCCCGACTT CCTGCGCGAC
ACGGCGGAGA CCGGCTACGC GGAGTCGAAG GCCTTGATCG AGCGCTGGCA CAACCGCGAC
CGCCTGCTGT ACGCCATCAC CCCGCGCTTC GCGCCCACCT CCACGCCCGC GCAGATGACG
CTCGCCGGTC GCCTCTTCAA CGAGCACCCG GGCGTCTTCC TGCAGTCGCA CCTCGCCGAG
AACCGCGCCG AGGTGGCCTG GGTGGCGCAG CTGTACCCGC AGGCGCGCAG CTACCTCGAC
GTGTATGCCC GCGCCGGCCA GCTCGGCACG CGCGCGGTGT TCGCGCACTG CATCTGGCTC
GACGACGCCG ACCGCCGGCA CATGGCCGAG ACCGGCGCGG CGATCAGCTT CTGCCCCACC
TCCAACCTCT TCCTCGGCTC GGGGCTCTTC GACCTGCGGC GCGCGCGCGC GCTCGGCGTG
CGCGTGGGAC TGGGCACGGA CGTGGGGGGC GGCACCAGCT TCTCGATGCT GCAGACCATG
AACGAGGCCT ACAAGGTGCT GCAGCTCGCC GGCCAGTCGC TGTCGGCGGC GAGCGCTTTC
CACCTCGCCA CCCTGGGCGG CGCGCACAGC CTCTACCTCG ACGACCGCAT CGGCAACCTC
GCGCCCGGCA AGGAGGCCGA CTTCGTCGTC CTGAACCCGC GCGCAACGCC CCTGCTCGAG
CGCCGCAGCG CCGCCTGCGC GACGCTGGAG GAGCGCCTCT TCGTGCTGAT GATGCTTGGG
GACGACCGTG CGGTGGCGGC GACGCACGTG CTGGGCGTGC CGGTGTAG
 
Protein sequence
MKPSAPPSPR LSAVRGEIVH FLADPAADPR ALEHFADGVL IVRDGHVAEC GPAAALLPKL 
PAGTPLADHR GKLILPGFVD THVHYPQTDI IASHGEQLLE WLEKYTFPAE RRFADRAHAA
EVAGFFCDEL LRNGTTTAAA FATVHAASVD ALFEAARARR MRMITGKVLM DRNCPDFLRD
TAETGYAESK ALIERWHNRD RLLYAITPRF APTSTPAQMT LAGRLFNEHP GVFLQSHLAE
NRAEVAWVAQ LYPQARSYLD VYARAGQLGT RAVFAHCIWL DDADRRHMAE TGAAISFCPT
SNLFLGSGLF DLRRARALGV RVGLGTDVGG GTSFSMLQTM NEAYKVLQLA GQSLSAASAF
HLATLGGAHS LYLDDRIGNL APGKEADFVV LNPRATPLLE RRSAACATLE ERLFVLMMLG
DDRAVAATHV LGVPV