Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3632 |
Symbol | |
ID | 7873137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3989554 |
End bp | 3990897 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643700573 |
Product | amidohydrolase |
Protein accession | YP_002890602 |
Protein GI | 237654288 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCTGA AAACCTGTCT CGGTTGCGGG GTCGTCGTGC TCGCATCGCT GTGGCTGTCG CTGGCGCATG CGGCGGACAA GGCCGTCCTG TTCGAGAATG TCTTCGTCTT CGACGGCAGG AGCAGCGAGC GGGGCCGCTC GCCGGTCAAC GTGCTGGTGG TGGGCGACAC CATCCAGACC ATCTCGGCGG CGCCGATCGC GCCGCCCGCG GACGCCGACC TGACGCGTGT CGCCGGCGGG GGCCGCACGC TGATGCCCGG CCTGATCGAT GCGCACTGGC ACTCGATGCT GATCGGCCAG AAGCTGGTCG ATGCGATGAC CAGCGACGTC GGCTATACCA ATCTGCGTGC GGCGCAGGTC GCCGAGGCGA CGCTCATGCG CGGCTTCACC ACGGTGCGCG ACATGGGCGG GCCGGTGCAG GGCCTGCGGC GTGCGATCGA GGAGGGGCGT TTCCCCGGGC CGCGCATCTT CCCGTCGGGC GCGATGATCT CGCAGACCGG CGGCCATGGC GATTTCCGTC TGCCGCACGA GGTGCCGCGC GCCGCCAGCC AGGGTCTCAG TCATACCGAA CTTACCCGCA CCGCCGCCAT TGCCGATGGC GCCGACGAGG TGCTGCGGCG CACCCGCGAA CAGCTGATGC TCGGCGCGAC GCAGATCAAG CTGATGGCTG GCGGTGGGGT GACGTCGATC TACGACCCGA TCGACGCCAC CCAATACACC CAGGACGAGA TCCGCGCCGC AGTGGCTGCG GCAGAGAACT GGGGCACTTA CGTTACGGTG CACGCCTACA CCAGCCGTGC GGTTCAGGTC GCCATCGAGG CCGGGGTCAA GGCGGTGGAG CACGGCCAGC TGGTCGATGA GGAGACGGTC AGGCTGATGG CGCAGAAGGG GATCTGGTGG TCGCTCCAGC CCTTCCTCGA CAACGAGCTC GCCAACCCGC AGGCCGGGGC CAACCGGGTC AAGCAACTGA TGGTGGCGGC CGGCACCGAT CGCGCCTACG CACTGGCGCG CAAGCACGCG GTGAAGGTGG CCTTCGGCAC CGACATCCTG TTCTCGGGGG ACAACGGAGA GGTGCAGAAC GCACGCCTGG TCTCGCTCGA GCGCTGGTAT CCCCCCGGCG AGGTGCTGCA GATCGCGACC GGCAACAACG GCGCCCTGCT CGAGCTTACC GGCGAGCGCA ACCCGTATCG CAAGCCGCTC GGGGTCGTCG CCGAGGGCGC GCTCGCCGAC CTGTTGCTGG TCGATGGCGA CCCGACGGCG GACCTGTCGC TCATCAAGCG TCCCGAGTCG AGCTTCGTCC TCATCATGAA GAACGGGCGC ATCTACAAGA ACCTGCTGCC CTGA
|
Protein sequence | MRLKTCLGCG VVVLASLWLS LAHAADKAVL FENVFVFDGR SSERGRSPVN VLVVGDTIQT ISAAPIAPPA DADLTRVAGG GRTLMPGLID AHWHSMLIGQ KLVDAMTSDV GYTNLRAAQV AEATLMRGFT TVRDMGGPVQ GLRRAIEEGR FPGPRIFPSG AMISQTGGHG DFRLPHEVPR AASQGLSHTE LTRTAAIADG ADEVLRRTRE QLMLGATQIK LMAGGGVTSI YDPIDATQYT QDEIRAAVAA AENWGTYVTV HAYTSRAVQV AIEAGVKAVE HGQLVDEETV RLMAQKGIWW SLQPFLDNEL ANPQAGANRV KQLMVAAGTD RAYALARKHA VKVAFGTDIL FSGDNGEVQN ARLVSLERWY PPGEVLQIAT GNNGALLELT GERNPYRKPL GVVAEGALAD LLLVDGDPTA DLSLIKRPES SFVLIMKNGR IYKNLLP
|
| |