Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3606 |
Symbol | |
ID | 7873111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3957836 |
End bp | 3959197 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700546 |
Product | UDP-N-acetylglucosamine pyrophosphorylase |
Protein accession | YP_002890576 |
Protein GI | 237654262 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) |
TIGRFAM ID | [TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.311729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGTCG TCGTTCTCGC TGCCGGCCAG GGCAAGCGCA TGCGCTCGGT CCTGCCCAAG GTCCTCCAAC CCATCGCCGG CAAGCCGATG CTGGCCCACG TGCTCGATGC CGCCCGCACG CTCGACGCCC AGCGCGTCTG CGTGGTGTAC GGCCATGGGG GTGAAGTGGT GCGCGAGCGC CTCGATGCCG TCGACCTGGC CTGGGCGCGG CAGGAGCCGC AGCTCGGTAC CGGTCATGCG GTGCAGCAGG CGCTGCCCCA CCTCGACGAC GAGCACGTGG CCCTGGTGCT GTACGGCGAC GTGCCGCTGA TCGGCGTGCC CACGCTGCGC CGCCTGGTCG CCGCCGCCGG CGACGCGCGC CTGGCGCTGC TCACCGTCGA GATGGACAAC CCGACCGGCT ACGGCCGCAT CCTGCGCGAC GCCGCCGGCA AGGTGGTGCG CATCGTCGAG GAGAAGGACG CCTCCGACGC GGAGCGCAAG GTGCGCGAGG TCAACACCGG CATCCTGGTC GCCCCGGTCC GCCACCTGCG CGCCTGGCTC GGCCGCATCG GCAACGACAA CGCCCAGGGC GAGTACTACC TCACCGACAT CATCGGGCTG GCCGTGGCCG ACGGCGTCGA GGTGACGACC GTGCAGCCCG ACGCCTTCGC CGAGACGCTG GGCGTCAACA ACAAGATGCA GCTCGCCGAG CTCGAGCGCA TCCACCAGGG CAACATCGCC CGCCGCCTGA TGGAAGAGGG CGTCACCCTG ATCGACCCGG CGCGCATCGA CGTGCGCGGC ACGCTCACGG TCGGGCGCGA CGTCGAGATC GACGTCAATT GCGTCTTCGA AGGCGAGGTC GAGCTCGGCG ACGGCGTACG CATCGGTGCC AACTGCGTCG TCCGCGACGC CCGCATCGGC GCCGGTACCC GGCTCGAGCC CTTCAGCCAC GTCGACAGCA CCACCATGGG CCAGGCCTGC GTGATCGGGC CCTACGCCCG CACCCGCCCC GGCACCGTGC TCGGCACCGA CGTGCACCTG GGCAACTTCG TCGAGATCAA GAACAGCGTC ATCGCCGACC ACTCCAAGGC CAACCACCTC GCCTACGTGG GCGACGCCGA CGTCGGCAGC AAGGTCAACA TCGGCGCCGG CACCATCACC TGCAACTACG ACGGCGCCAA CAAGCACCGC ACCATCATCG AGGACGAGGT CTTCATCGGC TCCGACACCC AGCTCGTCGC GCCGGTGCGC GTGGGCCGCG GCGCCACCCT GGGCGCCGGC ACCACGCTCA CCAAGGACGC CCCCGCCGGA CAGCTCACCG TGTCGCGCAG CAAGCAGATC AGCCTCGAAC ACTGGAAGCG CCCGGTCAAG CAGCCGCGCT GA
|
Protein sequence | MQVVVLAAGQ GKRMRSVLPK VLQPIAGKPM LAHVLDAART LDAQRVCVVY GHGGEVVRER LDAVDLAWAR QEPQLGTGHA VQQALPHLDD EHVALVLYGD VPLIGVPTLR RLVAAAGDAR LALLTVEMDN PTGYGRILRD AAGKVVRIVE EKDASDAERK VREVNTGILV APVRHLRAWL GRIGNDNAQG EYYLTDIIGL AVADGVEVTT VQPDAFAETL GVNNKMQLAE LERIHQGNIA RRLMEEGVTL IDPARIDVRG TLTVGRDVEI DVNCVFEGEV ELGDGVRIGA NCVVRDARIG AGTRLEPFSH VDSTTMGQAC VIGPYARTRP GTVLGTDVHL GNFVEIKNSV IADHSKANHL AYVGDADVGS KVNIGAGTIT CNYDGANKHR TIIEDEVFIG SDTQLVAPVR VGRGATLGAG TTLTKDAPAG QLTVSRSKQI SLEHWKRPVK QPR
|
| |