Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1499 |
Symbol | nagK |
ID | 6968342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1481309 |
End bp | 1482220 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643385470 |
Product | N-acetyl-D-glucosamine kinase |
Protein accession | YP_002269964 |
Protein GI | 209397162 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000306944 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATTACG GGTTTGATAT TGGTGGAACA AAAATTGCGC TTGGCGTGTT TGATAGCGGT CGGCAGTTGC AGTGGGAAAA GCGGGTGCCG ACACCGCGTG ACAGCTATGA CGCATTTTTA GATGCAGTGT GCGAGCTGGT AGCCGAAGCT GATCAACGTT TTGGCTGTAA AGGCTCTGTC GGCATCGGTA TTCCGGGAAT GCCGGAAACA GAAGATGGTA CGCTGTATGC CGCCAATGTC CCTGCTGCCA GCGGTAAACC GCTGCGTGCC GACCTGAGCG CACGTCTTGA TCGCGATGTA CGCCTTGATA ACGATGCCAA CTGTTTTGCC CTTTCAGAAG CCTGGGATGA CGAATTTACG CAATATCCGT TGGTGATGGG GTTGATTCTC GGCACCGGCG TTGGCGGCGG GCTGATTTTC AACGGCAAAC CGATTACCGG GAAAAGCTAC ATTACCGGCG AGTTTGGCCA TATGCGTCTG CCGGTTGATG CGTTAACCAT GATGGGGCTG GATTTCCCGT TACGCCGCTG CGGCTGTGGT CAGCATGGCT GCATTGAAAA TTATCTGTCT GGTCGCGGTT TTGCGTGGCT GTATCAACAC TATTATCATC AACCGTTGCC GGCTCCCGAA ATTATTGCGC TTTATGATCA AGGCGATGAG CAGGCAAGGG CGCACGTTGA GCGTTATCTG GATTTATTAG CGGTTTGTCT GGGAAATATC CTGACCATTG TTGACCCTGA CCTGGTCGTC ATTGGTGGTG GCTTATCGAA TTTCCCGGCA ATCACAACGC AACTGGCGGA CAGGCTGCCT CGTCATCTCT TACCTGTAGC TCGTGTTCCG CGCATTGAAC GCGCGCGCCA CGGTGATGCG GGAGGAATGC GTGGTGCGGC CTTCCTACAT CTAACCGATT AA
|
Protein sequence | MYYGFDIGGT KIALGVFDSG RQLQWEKRVP TPRDSYDAFL DAVCELVAEA DQRFGCKGSV GIGIPGMPET EDGTLYAANV PAASGKPLRA DLSARLDRDV RLDNDANCFA LSEAWDDEFT QYPLVMGLIL GTGVGGGLIF NGKPITGKSY ITGEFGHMRL PVDALTMMGL DFPLRRCGCG QHGCIENYLS GRGFAWLYQH YYHQPLPAPE IIALYDQGDE QARAHVERYL DLLAVCLGNI LTIVDPDLVV IGGGLSNFPA ITTQLADRLP RHLLPVARVP RIERARHGDA GGMRGAAFLH LTD
|
| |