Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1336 |
Symbol | |
ID | 7084457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1478723 |
End bp | 1480030 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643698353 |
Product | guanine deaminase |
Protein accession | YP_002354991 |
Protein GI | 217969757 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase [TIGR02967] guanine deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.521939 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCCT CCGCCCCACC CTCCCCGCGG CTCTCCGCCG TACGCGGCGA GATCGTCCAC TTCCTCGCCG ACCCCGCGGC CGACCCGCGC GCGCTCGAGC ACTTCGCCGA CGGCGTGCTG ATCGTGCGCG ACGGCCACGT CGCCGAATGC GGCCCCGCCG CCGCGCTGTT GCCCAAGCTG CCCGCCGGCA CGCCGCTCGC CGACCACCGC GGCAAGCTGA TCCTGCCCGG CTTCGTCGAC ACCCACGTGC ACTACCCGCA GACCGACATC ATCGCCAGCC ACGGCGAGCA GCTGCTCGAA TGGCTGGAGA AATACACCTT CCCCGCCGAG CGCCGCTTCG CCGACCGCGC GCATGCCGCC GAGGTCGCCG GCTTCTTCTG CGACGAGCTG CTGCGCAACG GCACCACCAC CGCGGCGGCC TTCGCCACGG TGCACGCGGC CTCGGTGGAT GCGCTCTTCG AGGCCGCGCG CGCGCGCCGC ATGCGCATGA TCACCGGCAA GGTGCTGATG GACCGCAACT GCCCCGACTT CCTGCGCGAC ACGGCGGAGA CCGGCTACGC GGAGTCGAAG GCCTTGATCG AGCGCTGGCA CAACCGCGAC CGCCTGCTGT ACGCCATCAC CCCGCGCTTC GCGCCCACCT CCACGCCCGC GCAGATGACG CTCGCCGGTC GCCTCTTCAA CGAGCACCCG GGCGTCTTCC TGCAGTCGCA CCTCGCCGAG AACCGCGCCG AGGTGGCCTG GGTGGCGCAG CTGTACCCGC AGGCGCGCAG CTACCTCGAC GTGTATGCCC GCGCCGGCCA GCTCGGCACG CGCGCGGTGT TCGCGCACTG CATCTGGCTC GACGACGCCG ACCGCCGGCA CATGGCCGAG ACCGGCGCGG CGATCAGCTT CTGCCCCACC TCCAACCTCT TCCTCGGCTC GGGGCTCTTC GACCTGCGGC GCGCGCGCGC GCTCGGCGTG CGCGTGGGAC TGGGCACGGA CGTGGGGGGC GGCACCAGCT TCTCGATGCT GCAGACCATG AACGAGGCCT ACAAGGTGCT GCAGCTCGCC GGCCAGTCGC TGTCGGCGGC GAGCGCTTTC CACCTCGCCA CCCTGGGCGG CGCGCACAGC CTCTACCTCG ACGACCGCAT CGGCAACCTC GCGCCCGGCA AGGAGGCCGA CTTCGTCGTC CTGAACCCGC GCGCAACGCC CCTGCTCGAG CGCCGCAGCG CCGCCTGCGC GACGCTGGAG GAGCGCCTCT TCGTGCTGAT GATGCTTGGG GACGACCGTG CGGTGGCGGC GACGCACGTG CTGGGCGTGC CGGTGTAG
|
Protein sequence | MKPSAPPSPR LSAVRGEIVH FLADPAADPR ALEHFADGVL IVRDGHVAEC GPAAALLPKL PAGTPLADHR GKLILPGFVD THVHYPQTDI IASHGEQLLE WLEKYTFPAE RRFADRAHAA EVAGFFCDEL LRNGTTTAAA FATVHAASVD ALFEAARARR MRMITGKVLM DRNCPDFLRD TAETGYAESK ALIERWHNRD RLLYAITPRF APTSTPAQMT LAGRLFNEHP GVFLQSHLAE NRAEVAWVAQ LYPQARSYLD VYARAGQLGT RAVFAHCIWL DDADRRHMAE TGAAISFCPT SNLFLGSGLF DLRRARALGV RVGLGTDVGG GTSFSMLQTM NEAYKVLQLA GQSLSAASAF HLATLGGAHS LYLDDRIGNL APGKEADFVV LNPRATPLLE RRSAACATLE ERLFVLMMLG DDRAVAATHV LGVPV
|
| |