Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0443 |
Symbol | |
ID | 7084953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 505002 |
End bp | 506081 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643697475 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_002354118 |
Protein GI | 217968884 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATT TTTCCATCCG CCTGATCGAA TGGCAGCGCA AGCACGGCCG CCACGACCTG CCCTGGCAGG GCGGCCACGA CCCCTACCGC ATCTGGCTGT CGGAGATCAT GCTGCAGCAG ACCCGGGTCG AGACCGTGAT CCCCTACTAC GAGCGCTTCC TCGCGCGCTT CCCCGACGTC GCCGCGCTCG CCGCGGCGCC GGTCGAGGAC GTCATGGCGT TGTGGAGCGG CCTGGGCTAC TACGCCCGCG CGCGCAACCT GCACCGCGCG GCGCGGGTGG TCATGGACGC GCACGGCGGC GCCTTTCCGC GCAGCGCCGC GGCGATCGCC GGGCTGCCCG GCATCGGTCG CTCCACCGCG GCGGCGATCG CCGCCTTCGC CTGGGGCGAG CGTGCGGCGA TCCTCGACGG CAACGTCAAG CGCGTGCTGT GCCGCGTCTT CGGCATCGAG GGCTTTCCCG GCGACAAGGC GGTGGAGACG CGGCTGTGGG CGCTCGCCGA GTCGCTGCTG CCGGAGCGCG GGATCGGCCG CTACATCCAG GCGCAGATGG ATCTCGGCGC CACGCTGTGC ACCCGCGCCC GCCCCGCCTG CGCGCGCTGC CCCTTCCACG ACGACTGCGT CGCCCGTCGC GACGGGCGCG TGGCCGCGTT GCCGACCGCG CGCCCGAAGA AGGTGGTGCC GCGGCGTGGT GCGCGCTGTG CGGTGATCCT GCACCAGGGC GCGGTGCTGC TGGAGCGTCG CCCGCCGGCG GGGATCTGGG GCGGCCTGCT GGCGCTGCCG GAATTGCCCG CCGAGGTGGA CGACGCCCAG GCCTGGAGCG CCCAGCGTTT CGGCCTGGCC ACCGCCGCGC CCCGGCCGCT CGCACCACTC ACCCACGCCT TCACCCACTT CGTGCTCGAG CTGCAGCCGC TGCTGCTGCA CGCCAGTGCC ATCCAAGGCC TGGCCGACGA CGGCGCGCTG TGCTGGCTGC CGCTGGGCGC CCACGCCGAG GCCGCCCTGC CCGCGCCGGT GCGGCGCATC CTCGACGGCC TCGCGGCACC GGGCCTCTTC GACGAGGGCG CGCCCGCGCG CGGCGCCTGA
|
Protein sequence | MSDFSIRLIE WQRKHGRHDL PWQGGHDPYR IWLSEIMLQQ TRVETVIPYY ERFLARFPDV AALAAAPVED VMALWSGLGY YARARNLHRA ARVVMDAHGG AFPRSAAAIA GLPGIGRSTA AAIAAFAWGE RAAILDGNVK RVLCRVFGIE GFPGDKAVET RLWALAESLL PERGIGRYIQ AQMDLGATLC TRARPACARC PFHDDCVARR DGRVAALPTA RPKKVVPRRG ARCAVILHQG AVLLERRPPA GIWGGLLALP ELPAEVDDAQ AWSAQRFGLA TAAPRPLAPL THAFTHFVLE LQPLLLHASA IQGLADDGAL CWLPLGAHAE AALPAPVRRI LDGLAAPGLF DEGAPARGA
|
| |