Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3803 |
Symbol | |
ID | 7874045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4196194 |
End bp | 4197330 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700745 |
Product | UDP-N-acetylglucosamine 2-epimerase |
Protein accession | YP_002890769 |
Protein GI | 237654455 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0381] UDP-N-acetylglucosamine 2-epimerase |
TIGRFAM ID | [TIGR00236] UDP-N-acetylglucosamine 2-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAC TCAAAGTCAT GTCCGTGGTC GGCACCCGGC CCGAAATCAT CCGCCTGTCG CGCGTGCTGG CGGCGCTCGA CGAGCACTGC GAGCACGTAC TGGTGCACAC CGGTCAGAAC TACGACTACG AGCTCAACCA GGTCTTCTTC GACGACCTGG GGGTGCGCAA GCCGGATCAC TTCCTCAACA GCGCCGAGGG CAGCACCGGC GCGGCGCACA CCATCGGCAA CCTGATCATC GCGGTCGACC GCGTGCTGGG CGAGGTGCAG CCCGAGGCCA TGCTGGTGCT GGGCGACACC AATAGCTGCC TGTCGGTGAT CCCGGCCAAG CGGCGCAAGA TCCCGATCTT CCACATGGAG GCGGGCAATC GCTGCTTCGA CCAGCGCGTG CCGGAAGAGA CCAATCGCCG CATCGTCGAC CACACCGCCG ACATCAACCT CACCTACAGC ACCATCGCGC GCGACTACCT GCTGCGCGAG GGCCTGCCGC CCGACCAGGT GATCAAGACC GGCAGCCCGA TGTTCGAGGT GCTGACGCAC TATCGCCCGC GCATCGAGGC GTCGGACGTG CTGCAGCGCC TGGCGCTGGA GGCGGGGCGC TACTTCGTGG TGAGCGCGCA CCGGGAAGAG AACATCGAAT CCGAGAAGTC CTTCACCAAG CTGGTGGCGG TGCTCAACGC AGTGGCGGAA GACCACGGCC TGCCGGTGAT CGTGTCGACC CACCCGCGCA CGCAGAAGCG CGTGGATGCC ACCGGCGCGA AGTTCCACCC GATGGTGCGG CTGCTCAAGC CGCTGGGCTT TCACGACTAC GTGAAGCTGC AGCTTTCGGC CAAGGCGGTG CTGTCGGACA GCGGCACGAT CAACGAGGAG TCGTCGATCC TCAACTTCCC GGCGCTGAAC CTGCGCGAGG CGCACGAGCG GCCGGAGGGC ATGGAAGAGG CGGCGGTGAT GATGGTGGGG CTGGAGGTCG ACCGGGTGCG CCAGGGGCTG GCGGTGCTCG CGTCGCAGTC GCGCGGTGAG GAACGCAGCC TGCGCCAGGT GGCCGACTAC AGCATGCCGA ACGTGTCGGA CAAGGTGGTG CGCATCATCC ACAGCTACAC GGATTACGTG AAGCGGGTGG TGTGGAGGCA GTACTAA
|
Protein sequence | MKKLKVMSVV GTRPEIIRLS RVLAALDEHC EHVLVHTGQN YDYELNQVFF DDLGVRKPDH FLNSAEGSTG AAHTIGNLII AVDRVLGEVQ PEAMLVLGDT NSCLSVIPAK RRKIPIFHME AGNRCFDQRV PEETNRRIVD HTADINLTYS TIARDYLLRE GLPPDQVIKT GSPMFEVLTH YRPRIEASDV LQRLALEAGR YFVVSAHREE NIESEKSFTK LVAVLNAVAE DHGLPVIVST HPRTQKRVDA TGAKFHPMVR LLKPLGFHDY VKLQLSAKAV LSDSGTINEE SSILNFPALN LREAHERPEG MEEAAVMMVG LEVDRVRQGL AVLASQSRGE ERSLRQVADY SMPNVSDKVV RIIHSYTDYV KRVVWRQY
|
| |