Gene EcSMS35_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2540 
Symbolglk 
ID6146062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2599268 
End bp2600233 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content51% 
IMG OID641617412 
Productglucokinase 
Protein accessionYP_001744583 
Protein GI170681946 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0837] Glucokinase 
TIGRFAM ID[TIGR00749] glucokinase, proteobacterial type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.692132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGT ATGCATTAGT CGGTGATGTG GGCGGCACCA ACGCACGTCT TGCTCTGTGT 
GATATTGCCA GTGGTGAAAT CTCGCAGGCT AAGACCTATT CAGGGCTTGA TTACCCCAGC
CTCGAAGCGG TCATTCGCGT TTATCTTGAA GAACATAAGG TCGAGGTGAA AGACGGCTGT
ATTGCCATCG CTTGCCCAAT TACCGGTGAC TGGGTGGCAA TGACCAACCA TACCTGGGCG
TTCTCAATTG CCGAAATGAA AAAGAATCTC GGTTTTAGCT ATCTGGAAAT TATTAACGAT
TTTACTGCTG TATCGATGGC GATCCCGATG CTGAAAAAAG AGCATCTGAT TCAGTTTGGT
GGCGCAGAAC CGGTAGAAGG TAAGCCTATT GCGGTTTACG GTGCCGGAAC GGGGCTAGGG
GTTGCGCATC TGGTCCATGT CGATAAGCGT TGGGTAAGCT TGCCAGGCGA AGGCGGTCAC
GTTGATTTTG CGCCGAATAG TGAAGAAGAG GGCATTATCC TCGAAATACT GCGTGCAGAA
ATTGGTCATG TTTCGGCAGA GCGCGTGCTT TCTGGCCCTG GGCTGGTGAA TTTGTATCGC
GCAATTGTTA AAGCCGACAA CCGCCTGCCA GAAAATCTCA AGCCAAAAGA TATTACCGAA
CGCGCGCTGG CTGACAGCTG CACCGATTGC CGCCGCGCGT TGTCGCTGTT TTGCGTCATT
ATGGGCCGTT TTGGCGGCAA TCTGGCGCTC AATCTCGGGA CATTTGGCGG CGTGTTTATT
GCCGGTGGTA TCGTGCCGCG CTTCCTTGAG TTCTTCAAAG CCTCTGGTTT CCGTGCCGCA
TTTGAAGATA AAGGGCGCTT TAAAGAATAT GTCCATGATA TTCCGGTGTA TCTCATCGTC
CATGACAATC CGGGCCTTCT CGGTTCCGGC GCACATTTAC GCCAGACCTT AGGCCACATT
CTGTAA
 
Protein sequence
MTKYALVGDV GGTNARLALC DIASGEISQA KTYSGLDYPS LEAVIRVYLE EHKVEVKDGC 
IAIACPITGD WVAMTNHTWA FSIAEMKKNL GFSYLEIIND FTAVSMAIPM LKKEHLIQFG
GAEPVEGKPI AVYGAGTGLG VAHLVHVDKR WVSLPGEGGH VDFAPNSEEE GIILEILRAE
IGHVSAERVL SGPGLVNLYR AIVKADNRLP ENLKPKDITE RALADSCTDC RRALSLFCVI
MGRFGGNLAL NLGTFGGVFI AGGIVPRFLE FFKASGFRAA FEDKGRFKEY VHDIPVYLIV
HDNPGLLGSG AHLRQTLGHI L