Gene Tmz1t_0443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0443 
Symbol 
ID7084953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp505002 
End bp506081 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content75% 
IMG OID643697475 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_002354118 
Protein GI217968884 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATT TTTCCATCCG CCTGATCGAA TGGCAGCGCA AGCACGGCCG CCACGACCTG 
CCCTGGCAGG GCGGCCACGA CCCCTACCGC ATCTGGCTGT CGGAGATCAT GCTGCAGCAG
ACCCGGGTCG AGACCGTGAT CCCCTACTAC GAGCGCTTCC TCGCGCGCTT CCCCGACGTC
GCCGCGCTCG CCGCGGCGCC GGTCGAGGAC GTCATGGCGT TGTGGAGCGG CCTGGGCTAC
TACGCCCGCG CGCGCAACCT GCACCGCGCG GCGCGGGTGG TCATGGACGC GCACGGCGGC
GCCTTTCCGC GCAGCGCCGC GGCGATCGCC GGGCTGCCCG GCATCGGTCG CTCCACCGCG
GCGGCGATCG CCGCCTTCGC CTGGGGCGAG CGTGCGGCGA TCCTCGACGG CAACGTCAAG
CGCGTGCTGT GCCGCGTCTT CGGCATCGAG GGCTTTCCCG GCGACAAGGC GGTGGAGACG
CGGCTGTGGG CGCTCGCCGA GTCGCTGCTG CCGGAGCGCG GGATCGGCCG CTACATCCAG
GCGCAGATGG ATCTCGGCGC CACGCTGTGC ACCCGCGCCC GCCCCGCCTG CGCGCGCTGC
CCCTTCCACG ACGACTGCGT CGCCCGTCGC GACGGGCGCG TGGCCGCGTT GCCGACCGCG
CGCCCGAAGA AGGTGGTGCC GCGGCGTGGT GCGCGCTGTG CGGTGATCCT GCACCAGGGC
GCGGTGCTGC TGGAGCGTCG CCCGCCGGCG GGGATCTGGG GCGGCCTGCT GGCGCTGCCG
GAATTGCCCG CCGAGGTGGA CGACGCCCAG GCCTGGAGCG CCCAGCGTTT CGGCCTGGCC
ACCGCCGCGC CCCGGCCGCT CGCACCACTC ACCCACGCCT TCACCCACTT CGTGCTCGAG
CTGCAGCCGC TGCTGCTGCA CGCCAGTGCC ATCCAAGGCC TGGCCGACGA CGGCGCGCTG
TGCTGGCTGC CGCTGGGCGC CCACGCCGAG GCCGCCCTGC CCGCGCCGGT GCGGCGCATC
CTCGACGGCC TCGCGGCACC GGGCCTCTTC GACGAGGGCG CGCCCGCGCG CGGCGCCTGA
 
Protein sequence
MSDFSIRLIE WQRKHGRHDL PWQGGHDPYR IWLSEIMLQQ TRVETVIPYY ERFLARFPDV 
AALAAAPVED VMALWSGLGY YARARNLHRA ARVVMDAHGG AFPRSAAAIA GLPGIGRSTA
AAIAAFAWGE RAAILDGNVK RVLCRVFGIE GFPGDKAVET RLWALAESLL PERGIGRYIQ
AQMDLGATLC TRARPACARC PFHDDCVARR DGRVAALPTA RPKKVVPRRG ARCAVILHQG
AVLLERRPPA GIWGGLLALP ELPAEVDDAQ AWSAQRFGLA TAAPRPLAPL THAFTHFVLE
LQPLLLHASA IQGLADDGAL CWLPLGAHAE AALPAPVRRI LDGLAAPGLF DEGAPARGA