Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2007 |
Symbol | nagK |
ID | 6142817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2027006 |
End bp | 2027917 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616883 |
Product | N-acetyl-D-glucosamine kinase |
Protein accession | YP_001744059 |
Protein GI | 170682022 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.988883 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.0770572 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATTACG GGTTTGATAT TGGTGGAACA AAAATTGCGC TAGGCGTGTT TGATAGCGGT CGGCAGTTGC AGTGGGAAAA GCGGGTGCCG ACACCGCGTG ACAGCTATGA CGCATTTTTA GATGCAGTGT GTGAGCTGGT AGCTGAAGCT GACCAACGTT TTGGCTGTAA AGGCTCTGTC GGCATCGGTA TTCCGGGAAT GCCGGAAACA GAAGATGGTA CGCTGTATGC CGCCAATGTC CCCGCTGCCA GCGGTAAACC GCTGCGTGCC GACCTGAGCG CACGTCTTGA TCGCGATGTA CGCCTTGATA ACGATGCCAA CTGTTTTGCC CTTTCAGAAG CCTGGGATGA TGAATTTACT CAATATCCAC TGGTGATGGG GTTGATTCTC GGCACCGGCG TTGGCGGCGG GCTGATTTTC AACGGTAAAC CAATTACCGG TAAAAGCTAT ATTACCGGCG AATTTGGCCA TATGCGTCTG CCGGTTGATG CGTTAACCAT GATGGGGCTG GATTTCCCGT TACGCCGCTG CGGCTGTGGT CAGCATGGCT GCATTGAAAA TTATCTGTCT GGTCGCGGTT TTGCGTGGCT GTATCAACAC TATTATCATC AACCGTTGCA GGCTCCTGAG ATCATTGCGC TTTATGATCA AGGCGATGAG CAGGCAAGGG CGCACGTTGA GCGTTATCTG GATTTATTAG CGGTTTGTCT GGGAAATATC CTGACCATTG TTGACCCTGA CCTGGTCGTC ATTGGTGGAG GCTTATCGAA TTTCCCGGCA ATCACAACGC AACTGGCGGA CAGGCTGCCT CGTCATCTCT TACCTGTAGC TCGTGTTCCG CGCATTGAAC GCGCGCGCCA CGGTGATGCG GGGGGAATGC GTGGTGCGGC CTTCCTACAT CTAACCGATT AA
|
Protein sequence | MYYGFDIGGT KIALGVFDSG RQLQWEKRVP TPRDSYDAFL DAVCELVAEA DQRFGCKGSV GIGIPGMPET EDGTLYAANV PAASGKPLRA DLSARLDRDV RLDNDANCFA LSEAWDDEFT QYPLVMGLIL GTGVGGGLIF NGKPITGKSY ITGEFGHMRL PVDALTMMGL DFPLRRCGCG QHGCIENYLS GRGFAWLYQH YYHQPLQAPE IIALYDQGDE QARAHVERYL DLLAVCLGNI LTIVDPDLVV IGGGLSNFPA ITTQLADRLP RHLLPVARVP RIERARHGDA GGMRGAAFLH LTD
|
| |