Gene EcSMS35_1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1943 
SymboldhaM 
ID6146765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1964650 
End bp1966068 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content50% 
IMG OID641616819 
Productdihydroxyacetone kinase subunit M 
Protein accessionYP_001743995 
Protein GI170682048 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR02364] dihydroxyacetone kinase, phosphotransfer subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.011305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAACC TGGTCATAGT TTCACATAGC AGCCGACTGG GAGAAGGTGT CGGTGAACTA 
GCCCGTCAGA TGTTAATGGG TGATATTTGT AAAATCGCCA TTGCCGCGGG AATTGACGAT
CCACAAAATC CCATTGGTAC CGATGCCGTC AAAGTGATGG AGGCCATCGA ATCTGTTGCT
GATGCCGACC ATGTGCTGGT CATGATGGAT ATGGGTAGCG CATTATTGAG TGCTGAAACT
GCGCTGGAAT TGCTGGCTCC CGAGATCGCC GCAAAAGTAC GTTTGTGTGC TGCGCCGTTG
GTCGAAGGTA CACTGGCAGC AACGGTCAGC GCGGCCTCGG GGGCGGATAT CGACAAAGTC
ATCTTTGACG CCATGCATGC GCTGGAAGCC AAACGTGAAC AACTGGGTTT ACCGTCCTCC
GACACTGAAA TCTCTGACAC ATGTCCTCCG TACGATGAAG AAGCCCGTTC TCTGTCGGTG
GTCATAAAAA ACCGTAACGG CCTGCATGTA CGTCCGGCCT CCCGGCTGGT TTATACCTTA
TCGACATTTA ATGCCGATAT GTTGCTGGAG AAAAACGGCA AATGCGTTAC ACCGGAGAGT
ATTAACCAGA TTGCGTTACT ACAAGTTCGC TATAACGATA CGCTGCGCCT GATTGCGAAA
GGACCAGAAG CTGAAGAGGC ACTGATCGCC TTCCGTCAGC TGGCTGAAGA TAACTTTGGT
GAAACGGAGG AAGTTGCTCC ACCTACTCTG CGTCCCGTTC CGCCTGTTTC GGGTAAAGCC
TTTTATTATC AACCTGTTTT ATGTACGGTA CAGGCAAAAT CAATCCTGAC CGTGGAAGAA
GAACAAGAAC GATTACGCCA GGCCATTGAC TTCACGTTAT TAGATCTGAT GACGTTAACA
GCGAAAGCAG AAACCAGCGG GCTTGACGAT ATTGCCGCAA TCTTTTCTGG TCACCATACA
CTGTTAGATG ATCCGGAACT GCAGGCGGCG GCAAGCGAAC TCCTTCAGCA TGAACATTGC
ACGGCGGAAT ATGCCTGGCA ACACGTTCTT AAAGAACTTA GCCAGCAATA CCAACAGCTG
GATGATGAAT ATCTACAGGC TCGCTATATC GATGTGGACG ATCTTCTGCA TCGCACCCTG
GTCCACCTGA CCCAAACGAA AGAAGAACTC CCGCAGTTTA ACTCACCAAC TATTCTACTG
GCGGAGAACA TTTATCCCTC CACAGTACTG CAACTGGATC CAGCGGTTGT AAAAGGTATC
TGCCTTAGCG CCGGAAGTCC GCTATCCCAC AGTGCCCTAA TCGCCCGTGA ACTGGGAATT
GGCTGGATTT GCCAGCAGGG TGAGAAACTG TATGCGATAC AACCAGAAGA AACGCTAACG
CTGGACGTTA AAACGCAACG TTTCAACCGT CAGGGTTAA
 
Protein sequence
MVNLVIVSHS SRLGEGVGEL ARQMLMGDIC KIAIAAGIDD PQNPIGTDAV KVMEAIESVA 
DADHVLVMMD MGSALLSAET ALELLAPEIA AKVRLCAAPL VEGTLAATVS AASGADIDKV
IFDAMHALEA KREQLGLPSS DTEISDTCPP YDEEARSLSV VIKNRNGLHV RPASRLVYTL
STFNADMLLE KNGKCVTPES INQIALLQVR YNDTLRLIAK GPEAEEALIA FRQLAEDNFG
ETEEVAPPTL RPVPPVSGKA FYYQPVLCTV QAKSILTVEE EQERLRQAID FTLLDLMTLT
AKAETSGLDD IAAIFSGHHT LLDDPELQAA ASELLQHEHC TAEYAWQHVL KELSQQYQQL
DDEYLQARYI DVDDLLHRTL VHLTQTKEEL PQFNSPTILL AENIYPSTVL QLDPAVVKGI
CLSAGSPLSH SALIARELGI GWICQQGEKL YAIQPEETLT LDVKTQRFNR QG