Gene EcSMS35_4366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4366 
SymbolglpK 
ID6146035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4453142 
End bp4454650 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content54% 
IMG OID641619187 
Productglycerol kinase 
Protein accessionYP_001746311 
Protein GI170680579 
COG category[C] Energy production and conversion 
COG ID[COG0554] Glycerol kinase 
TIGRFAM ID[TIGR01311] glycerol kinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.569877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAA AAAAATATAT CGTTGCGCTC GACCAGGGCA CCACCAGCTC CCGCGCGGTC 
GTAATGGATC ACGATGCCAA TATCATTAGC GTGTCGCAGC GCGAATTTGA GCAAATCTAC
CCAAAACCAG GCTGGGTAGA ACACGACCCA ATGGAAATCT GGGCCACCCA AAGCTCCACG
CTGGTAGAAG TGCTGGCGAA AGCCGATATC AGTTCCGATC AAATTGCAGC TATCGGTATT
ACGAACCAGC GTGAAACCAC TATTGTCTGG GAAAAAGAAA CCGGCAAGCC TATCTATAAC
GCCATTGTCT GGCAGTGCCG TCGTACCGCA GAAATCTGCG AGCATTTAAA ACGTGACGGT
TTAGAAGATT ATATCCGCAG CAATACCGGT CTGGTGATTG ACCCGTACTT TTCTGGCACC
AAAGTGAAGT GGATCCTCGA CCATGTGGAA GGCTCTCGCG AGCGTGCGCG TCGTGGTGAA
TTGCTGTTTG GTACGGTTGA TACGTGGCTT ATCTGGAAAA TGACTCAGGG CCGTGTCCAT
GTGACCGATT ACACCAACGC CTCTCGTACC ATGTTGTTCA ACATCCATAC CCTGGACTGG
GACGACAAAA TGCTCGAGGT GCTGGATATT CCGCGCGAGA TGCTGCCAGA AGTGCGTCGT
TCTTCCGAAG TATACGGTCA GACTAACATT GGCGGCAAAG GCGGCACGCG TATTCCAATC
TCCGGGATCG CCGGTGACCA GCAGGCGGCG CTGTTTGGTC AGTTGTGCGT GAAAGAAGGG
ATGGCGAAGA ACACCTATGG CACTGGCTGC TTTATGCTGA TGAACACTGG CGAGAAAGCG
GTGAAATCAG AAAACGGCCT GCTGACCACC ATCGCCTGTG GCCCAACTGG CGAAGTAAAC
TATGCGCTGG AAGGTGCGGT GTTTATGGCG GGCGCATCCA TTCAGTGGCT GCGTGATGAG
ATGAAGTTGA TTAACGACGC CTACGATTCC GAATACTTTG CCACGAAAGT GCAAAACACC
AATGGTGTGT ATGTGGTTCC GGCCTTTACC GGGCTGGGTG CGCCATACTG GGATCCGTAT
GCACGCGGGG CGATTTTCGG TCTGACTCGT GGGGTGAACG CTAACCACAT TATACGCGCG
ACGCTGGAGT CTATTGCTTA TCAGACGCGT GACGTGCTGG AAGCGATGCA GGCCGATTCT
GGTATTCGTC TGCACGCCCT GCGCGTGGAT GGCGGCGCAG TGGCGAACAA TTTCCTGATG
CAGTTCCAGT CCGATATTCT CGGTACGCGC GTTGAGCGCC CGGAAGTGCG CGAAGTCACC
GCATTGGGTG CGGCCTATCT TGCTGGTCTG GCGGTTGGCT TCTGGCAGAA CCTCGACGAG
CTGCAAGAGA AAGCGGTGAT TGAGCGCGAG TTCCGTCCAG GCATCGAAAC CACTGAGCGT
AATTACCGTT ACGCAGGCTG GAAAAAAGCG GTGAAACGCG CGATGGCGTG GGAAGAACAC
GACGAGTAA
 
Protein sequence
MTEKKYIVAL DQGTTSSRAV VMDHDANIIS VSQREFEQIY PKPGWVEHDP MEIWATQSST 
LVEVLAKADI SSDQIAAIGI TNQRETTIVW EKETGKPIYN AIVWQCRRTA EICEHLKRDG
LEDYIRSNTG LVIDPYFSGT KVKWILDHVE GSRERARRGE LLFGTVDTWL IWKMTQGRVH
VTDYTNASRT MLFNIHTLDW DDKMLEVLDI PREMLPEVRR SSEVYGQTNI GGKGGTRIPI
SGIAGDQQAA LFGQLCVKEG MAKNTYGTGC FMLMNTGEKA VKSENGLLTT IACGPTGEVN
YALEGAVFMA GASIQWLRDE MKLINDAYDS EYFATKVQNT NGVYVVPAFT GLGAPYWDPY
ARGAIFGLTR GVNANHIIRA TLESIAYQTR DVLEAMQADS GIRLHALRVD GGAVANNFLM
QFQSDILGTR VERPEVREVT ALGAAYLAGL AVGFWQNLDE LQEKAVIERE FRPGIETTER
NYRYAGWKKA VKRAMAWEEH DE