Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1943 |
Symbol | dhaM |
ID | 6146765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1964650 |
End bp | 1966068 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616819 |
Product | dihydroxyacetone kinase subunit M |
Protein accession | YP_001743995 |
Protein GI | 170682048 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR02364] dihydroxyacetone kinase, phosphotransfer subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.011305 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAACC TGGTCATAGT TTCACATAGC AGCCGACTGG GAGAAGGTGT CGGTGAACTA GCCCGTCAGA TGTTAATGGG TGATATTTGT AAAATCGCCA TTGCCGCGGG AATTGACGAT CCACAAAATC CCATTGGTAC CGATGCCGTC AAAGTGATGG AGGCCATCGA ATCTGTTGCT GATGCCGACC ATGTGCTGGT CATGATGGAT ATGGGTAGCG CATTATTGAG TGCTGAAACT GCGCTGGAAT TGCTGGCTCC CGAGATCGCC GCAAAAGTAC GTTTGTGTGC TGCGCCGTTG GTCGAAGGTA CACTGGCAGC AACGGTCAGC GCGGCCTCGG GGGCGGATAT CGACAAAGTC ATCTTTGACG CCATGCATGC GCTGGAAGCC AAACGTGAAC AACTGGGTTT ACCGTCCTCC GACACTGAAA TCTCTGACAC ATGTCCTCCG TACGATGAAG AAGCCCGTTC TCTGTCGGTG GTCATAAAAA ACCGTAACGG CCTGCATGTA CGTCCGGCCT CCCGGCTGGT TTATACCTTA TCGACATTTA ATGCCGATAT GTTGCTGGAG AAAAACGGCA AATGCGTTAC ACCGGAGAGT ATTAACCAGA TTGCGTTACT ACAAGTTCGC TATAACGATA CGCTGCGCCT GATTGCGAAA GGACCAGAAG CTGAAGAGGC ACTGATCGCC TTCCGTCAGC TGGCTGAAGA TAACTTTGGT GAAACGGAGG AAGTTGCTCC ACCTACTCTG CGTCCCGTTC CGCCTGTTTC GGGTAAAGCC TTTTATTATC AACCTGTTTT ATGTACGGTA CAGGCAAAAT CAATCCTGAC CGTGGAAGAA GAACAAGAAC GATTACGCCA GGCCATTGAC TTCACGTTAT TAGATCTGAT GACGTTAACA GCGAAAGCAG AAACCAGCGG GCTTGACGAT ATTGCCGCAA TCTTTTCTGG TCACCATACA CTGTTAGATG ATCCGGAACT GCAGGCGGCG GCAAGCGAAC TCCTTCAGCA TGAACATTGC ACGGCGGAAT ATGCCTGGCA ACACGTTCTT AAAGAACTTA GCCAGCAATA CCAACAGCTG GATGATGAAT ATCTACAGGC TCGCTATATC GATGTGGACG ATCTTCTGCA TCGCACCCTG GTCCACCTGA CCCAAACGAA AGAAGAACTC CCGCAGTTTA ACTCACCAAC TATTCTACTG GCGGAGAACA TTTATCCCTC CACAGTACTG CAACTGGATC CAGCGGTTGT AAAAGGTATC TGCCTTAGCG CCGGAAGTCC GCTATCCCAC AGTGCCCTAA TCGCCCGTGA ACTGGGAATT GGCTGGATTT GCCAGCAGGG TGAGAAACTG TATGCGATAC AACCAGAAGA AACGCTAACG CTGGACGTTA AAACGCAACG TTTCAACCGT CAGGGTTAA
|
Protein sequence | MVNLVIVSHS SRLGEGVGEL ARQMLMGDIC KIAIAAGIDD PQNPIGTDAV KVMEAIESVA DADHVLVMMD MGSALLSAET ALELLAPEIA AKVRLCAAPL VEGTLAATVS AASGADIDKV IFDAMHALEA KREQLGLPSS DTEISDTCPP YDEEARSLSV VIKNRNGLHV RPASRLVYTL STFNADMLLE KNGKCVTPES INQIALLQVR YNDTLRLIAK GPEAEEALIA FRQLAEDNFG ETEEVAPPTL RPVPPVSGKA FYYQPVLCTV QAKSILTVEE EQERLRQAID FTLLDLMTLT AKAETSGLDD IAAIFSGHHT LLDDPELQAA ASELLQHEHC TAEYAWQHVL KELSQQYQQL DDEYLQARYI DVDDLLHRTL VHLTQTKEEL PQFNSPTILL AENIYPSTVL QLDPAVVKGI CLSAGSPLSH SALIARELGI GWICQQGEKL YAIQPEETLT LDVKTQRFNR QG
|
| |