Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0520 |
Symbol | |
ID | 7085134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 585397 |
End bp | 586449 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643697548 |
Product | urea amidolyase related protein |
Protein accession | YP_002354190 |
Protein GI | 217968956 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCC GGACCGAAGT GGAAATCCTC TCCCCCGGCG CCTTCGCCTC CATCCAGGAC GGCGGCCGCC GCGGCCACCG CCGCATCGGC GTGCCTTGGG CGGGCGTGCT CGACCGCCGC CTGATGCGCA TCGCCAATGC GCTCGCCGGT CGCGCCGAGG ACGCCGCGGT GATCGAATGC TTCGACGGCG GCCTGCACGT CGCCGCGAGC GGCGGCGCGG TGAAGCTCGC GGTGGCCGGC GACGCGGTGG TCGAGGTCGA AGGCACCGAG GGCCGGCGCC AGCTCGCGCC GTGGCGCTCG GTGACGCTGG CCGACGGCGA GCAGCTGCGC ATCCGCAAGA TGGAGGGCGG ACGCATCGCC ATGGTCGCGA TCGTCGGCCT CGAGCCGGCG GCGGTGATGG GCAGCGCCTC GACCTATGCG CGCGCCGGCA TCGGCGGCGT GGATGGCCGT GCGCTCGGCG CCGGCACGCG CCTGGCGCTC TCCGCCGACG CCGACCCCTG GGACAGCGAC CGCGTGCTCG CCCAGTCGCC CGCGGCCGAC ACCGGTCCGA TCCGCCTGGT GCCCGGTCCG CAGGCCGACC ACTTCAGCCC CACCGCGCTC GACGCCCTGG TGGGCGGCGA GTATCGCGTC ACCACCGAGG CCGACCGCAT GGGCATCCGC CTCGAGGGCG CGCAGCTGGA GCACGCCGGC GCCGCCGAGA TCGTCTCCGA CGCCACCGTG CCCGGCTCCA TCCAGGTGCC CGGTGCCGGC CAGCCCATCG TGCTGCTCGC CGACGCGCAG ACCGCCGGCG GCTATCCCAA GATCGCCACC GTGATCGGCG CCGACCTCGG CCGTCTCGCC GCGCTGCGCC CCGGCCAGAG CCTGCGCTTC GCCGCCGTGA GCGCCGCCGA GGGCGCGTGC ATCGCGCGCG CCGCAGAGAC CGAGACCCGG GCGTTGATCG CCTCGATCCG CGCCCTGCCG CCCGATGGCA TCGACCTGAT GGCGCTGTAC ACCGGGAACC TGGTCGACGG CGTCGTGCAT GCCCTCGGCA CCGAATACCG ACCGCTGTAT TGA
|
Protein sequence | MSTRTEVEIL SPGAFASIQD GGRRGHRRIG VPWAGVLDRR LMRIANALAG RAEDAAVIEC FDGGLHVAAS GGAVKLAVAG DAVVEVEGTE GRRQLAPWRS VTLADGEQLR IRKMEGGRIA MVAIVGLEPA AVMGSASTYA RAGIGGVDGR ALGAGTRLAL SADADPWDSD RVLAQSPAAD TGPIRLVPGP QADHFSPTAL DALVGGEYRV TTEADRMGIR LEGAQLEHAG AAEIVSDATV PGSIQVPGAG QPIVLLADAQ TAGGYPKIAT VIGADLGRLA ALRPGQSLRF AAVSAAEGAC IARAAETETR ALIASIRALP PDGIDLMALY TGNLVDGVVH ALGTEYRPLY
|
| |