Gene EcSMS35_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2007 
SymbolnagK 
ID6142817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2027006 
End bp2027917 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content53% 
IMG OID641616883 
ProductN-acetyl-D-glucosamine kinase 
Protein accessionYP_001744059 
Protein GI170682022 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.988883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0770572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTACG GGTTTGATAT TGGTGGAACA AAAATTGCGC TAGGCGTGTT TGATAGCGGT 
CGGCAGTTGC AGTGGGAAAA GCGGGTGCCG ACACCGCGTG ACAGCTATGA CGCATTTTTA
GATGCAGTGT GTGAGCTGGT AGCTGAAGCT GACCAACGTT TTGGCTGTAA AGGCTCTGTC
GGCATCGGTA TTCCGGGAAT GCCGGAAACA GAAGATGGTA CGCTGTATGC CGCCAATGTC
CCCGCTGCCA GCGGTAAACC GCTGCGTGCC GACCTGAGCG CACGTCTTGA TCGCGATGTA
CGCCTTGATA ACGATGCCAA CTGTTTTGCC CTTTCAGAAG CCTGGGATGA TGAATTTACT
CAATATCCAC TGGTGATGGG GTTGATTCTC GGCACCGGCG TTGGCGGCGG GCTGATTTTC
AACGGTAAAC CAATTACCGG TAAAAGCTAT ATTACCGGCG AATTTGGCCA TATGCGTCTG
CCGGTTGATG CGTTAACCAT GATGGGGCTG GATTTCCCGT TACGCCGCTG CGGCTGTGGT
CAGCATGGCT GCATTGAAAA TTATCTGTCT GGTCGCGGTT TTGCGTGGCT GTATCAACAC
TATTATCATC AACCGTTGCA GGCTCCTGAG ATCATTGCGC TTTATGATCA AGGCGATGAG
CAGGCAAGGG CGCACGTTGA GCGTTATCTG GATTTATTAG CGGTTTGTCT GGGAAATATC
CTGACCATTG TTGACCCTGA CCTGGTCGTC ATTGGTGGAG GCTTATCGAA TTTCCCGGCA
ATCACAACGC AACTGGCGGA CAGGCTGCCT CGTCATCTCT TACCTGTAGC TCGTGTTCCG
CGCATTGAAC GCGCGCGCCA CGGTGATGCG GGGGGAATGC GTGGTGCGGC CTTCCTACAT
CTAACCGATT AA
 
Protein sequence
MYYGFDIGGT KIALGVFDSG RQLQWEKRVP TPRDSYDAFL DAVCELVAEA DQRFGCKGSV 
GIGIPGMPET EDGTLYAANV PAASGKPLRA DLSARLDRDV RLDNDANCFA LSEAWDDEFT
QYPLVMGLIL GTGVGGGLIF NGKPITGKSY ITGEFGHMRL PVDALTMMGL DFPLRRCGCG
QHGCIENYLS GRGFAWLYQH YYHQPLQAPE IIALYDQGDE QARAHVERYL DLLAVCLGNI
LTIVDPDLVV IGGGLSNFPA ITTQLADRLP RHLLPVARVP RIERARHGDA GGMRGAAFLH
LTD