Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2148 |
Symbol | nagK |
ID | 6872606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2060245 |
End bp | 2061156 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642785254 |
Product | N-acetyl-D-glucosamine kinase |
Protein accession | YP_002215917 |
Protein GI | 198245162 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.964696 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.000723972 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTATTACG GGTTTGACAT TGGCGGAACA AAGATTGCGT TAGGCGTATT TGACTCAACG CGGCGGCTGC AGTGGGAAAA ACGGGTTCCC ACGCCCCATA CCAGCTATAG CGCCTTTTTA GACGCCGTAT GCGAACTGGT CGAGGAAGCC GACCAGCGAT TTGGCGTAAA AGGGTCGGTA GGGATTGGCA TACCCGGTAT GCCGGAAACC GAAGACGGCA CGCTGTACGC CGCGAATGTC CCGGCAGCCA GCGGTAAGCC GTTGCGCGCC GATCTCAGCG CCCGACTGGA TCGCGATGTG CGTCTGGACA ATGACGCTAA CTGTTTTGCC TTGTCCGAAG CCTGGGATGA TGAATTCACG CAATATCCTT TGGTTATGGG GCTTATCCTC GGCACCGGCG TCGGCGGCGG CCTGGTGCTA AACGGGAAGC CGATTACTGG TCAGAGCTAT ATCACCGGCG AGTTTGGTCA TATGCGTTTG CCGGTTGACG CGCTAACGTT GATGGGGTTT GATTTTCCTC TCCGCCGTTG TGGATGCGGC CAGATGGGCT GCATTGAGAA TTATCTGTCC GGGCGCGGGT TTGCGTGGCT ATATCAGCAT TATTATCATC AATCGCTGCA GGCGCCGGAG ATTATCGCGT TGTGGGAGCA GGGCGATGAG CAGGCGCACG CGCATGTTGA GCGCTATCTG GATTTACTGG CGGTTTGTCT GGGGAATATA CTGACGATTG TCGATCCCGA TTTATTGGTG ATCGGCGGCG GACTATCGAA CTTTACGGCA ATAACAACGC AACTGGCGGA AAGACTGCCG CGCCATCTCC TCCCTGTTGC CCGCGCGCCG CGCATTGAGC GTGCGCGGCA TGGGGATGCA GGTGGGAGGC GCGGTGCTGC TTTTTTACAT CTTACCGACT AA
|
Protein sequence | MYYGFDIGGT KIALGVFDST RRLQWEKRVP TPHTSYSAFL DAVCELVEEA DQRFGVKGSV GIGIPGMPET EDGTLYAANV PAASGKPLRA DLSARLDRDV RLDNDANCFA LSEAWDDEFT QYPLVMGLIL GTGVGGGLVL NGKPITGQSY ITGEFGHMRL PVDALTLMGF DFPLRRCGCG QMGCIENYLS GRGFAWLYQH YYHQSLQAPE IIALWEQGDE QAHAHVERYL DLLAVCLGNI LTIVDPDLLV IGGGLSNFTA ITTQLAERLP RHLLPVARAP RIERARHGDA GGRRGAAFLH LTD
|
| |