Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2540 |
Symbol | glk |
ID | 6146062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2599268 |
End bp | 2600233 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617412 |
Product | glucokinase |
Protein accession | YP_001744583 |
Protein GI | 170681946 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0837] Glucokinase |
TIGRFAM ID | [TIGR00749] glucokinase, proteobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.692132 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAGT ATGCATTAGT CGGTGATGTG GGCGGCACCA ACGCACGTCT TGCTCTGTGT GATATTGCCA GTGGTGAAAT CTCGCAGGCT AAGACCTATT CAGGGCTTGA TTACCCCAGC CTCGAAGCGG TCATTCGCGT TTATCTTGAA GAACATAAGG TCGAGGTGAA AGACGGCTGT ATTGCCATCG CTTGCCCAAT TACCGGTGAC TGGGTGGCAA TGACCAACCA TACCTGGGCG TTCTCAATTG CCGAAATGAA AAAGAATCTC GGTTTTAGCT ATCTGGAAAT TATTAACGAT TTTACTGCTG TATCGATGGC GATCCCGATG CTGAAAAAAG AGCATCTGAT TCAGTTTGGT GGCGCAGAAC CGGTAGAAGG TAAGCCTATT GCGGTTTACG GTGCCGGAAC GGGGCTAGGG GTTGCGCATC TGGTCCATGT CGATAAGCGT TGGGTAAGCT TGCCAGGCGA AGGCGGTCAC GTTGATTTTG CGCCGAATAG TGAAGAAGAG GGCATTATCC TCGAAATACT GCGTGCAGAA ATTGGTCATG TTTCGGCAGA GCGCGTGCTT TCTGGCCCTG GGCTGGTGAA TTTGTATCGC GCAATTGTTA AAGCCGACAA CCGCCTGCCA GAAAATCTCA AGCCAAAAGA TATTACCGAA CGCGCGCTGG CTGACAGCTG CACCGATTGC CGCCGCGCGT TGTCGCTGTT TTGCGTCATT ATGGGCCGTT TTGGCGGCAA TCTGGCGCTC AATCTCGGGA CATTTGGCGG CGTGTTTATT GCCGGTGGTA TCGTGCCGCG CTTCCTTGAG TTCTTCAAAG CCTCTGGTTT CCGTGCCGCA TTTGAAGATA AAGGGCGCTT TAAAGAATAT GTCCATGATA TTCCGGTGTA TCTCATCGTC CATGACAATC CGGGCCTTCT CGGTTCCGGC GCACATTTAC GCCAGACCTT AGGCCACATT CTGTAA
|
Protein sequence | MTKYALVGDV GGTNARLALC DIASGEISQA KTYSGLDYPS LEAVIRVYLE EHKVEVKDGC IAIACPITGD WVAMTNHTWA FSIAEMKKNL GFSYLEIIND FTAVSMAIPM LKKEHLIQFG GAEPVEGKPI AVYGAGTGLG VAHLVHVDKR WVSLPGEGGH VDFAPNSEEE GIILEILRAE IGHVSAERVL SGPGLVNLYR AIVKADNRLP ENLKPKDITE RALADSCTDC RRALSLFCVI MGRFGGNLAL NLGTFGGVFI AGGIVPRFLE FFKASGFRAA FEDKGRFKEY VHDIPVYLIV HDNPGLLGSG AHLRQTLGHI L
|
| |