Gene EcHS_A1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1302 
SymboldhaM 
ID5593672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1296135 
End bp1297553 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content50% 
IMG OID640920459 
Productdihydroxyacetone kinase subunit M 
Protein accessionYP_001458020 
Protein GI157160702 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR02364] dihydroxyacetone kinase, phosphotransfer subunit 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAACC TGGTCATAGT TTCACATAGC AGCCGACTGG GAGAAGGTGT CGGTGAATTA 
GCCCGTCAGA TGTTAATGAG TGATAGTTGT AAAATCGCCA TTGCCGCGGG AATTGACGAT
CCACAAAATC CCATTGGTAC CGATGCCGTC AAAGTGATGG AGGCCATCGA ATCTGTTGCT
GATGCCGACC ATGTGCTGGT CATGATGGAT ATGGGTAGCG CATTATTGAG TGCTGAAACT
GCGCTGGAAT TGCTGGCTCC CGAGATCGCC GCAAAAGTAC GTTTGTGTGC TGCGCCGTTG
GTCGAAGGTA CACTGGCAGC AACGGTCAGC GCGGCCTCGG GAGCGGATAT CGACAAAGTT
ATCTTTGACG CTATGCATGC GCTGGAAGCC AAACGTGAAC AACTGGGTTT ACCGTCCTCC
GACACTGAAA TCTCTGACAC ATGTCCTGCG TACGATGAAG AAGCCCGTTC TCTGGCGGTG
GTCATAAAAA ACCGTAACGG CCTGCATGTA CGTCCGGCCT CCCGGCTGGT TTATACCTTA
TCGACATTTA ATGCCGATAT GTTGCTGGAA AAAAACGGCA AATGCGTCAC ACCAGAGAGT
ATTAACCAGA TTGCGTTACT ACAAGTTCGC TATAACGATA CGCTGCGCCT GATTGCGAAA
GGGCCAGAAG CTGAAGAGGC ACTGATCGCT TTCCGTCAGC TGGCTGAAGA TAACTTTGGT
GAAACGGAGG AAGTCGCTCC ACCTACTCTG CGTCCCGTTC CGCCTGTTTC GGGTAAAGCC
TTTTATTATC AACCAGTTTT ATGTACGGTA CAGGCAAAAT CAACCCTGAC CGTGGAAGAA
GAACAAGAAC GATTACGCCA GGCTATTGAC TTCACGTTAT TAGATCTGAT GACGTTAACA
GCGAAAGCAG AAGCCAGCGG GCTTGACGAT ATTGCCGCAA TCTTTTCTGG TCACCATACA
CTGTTAGATG ATCCGGAACT GCTGGCGGCG GCAAGCGAAC TCCTTCAGCA TGAACATTGC
ACGGCAGAAT ATGCCTGGCA GCAAGTTCTT AAAGAACTTA GCCAGCAATA CCAGCAACTG
GATGATGAAT ATCTACAAGC TCGCTATATT GATGTGGACG ATCTTCTGCA TCGCACCCTG
GTCCACCTGA CCCAAACGAA AGAAGAACTC CCGCAGTTTA ACTCGCCAAC TATTCTACTG
GCGGAGAACA TTTATCCTTC CACAGTACTG CAACTGGATC CGGCGGTTGT AAAAGGTATC
TGCCTTAGCG CCGGAAGTCC GGTATCCCAC AGCGCCCTAA TCGCCCGTGA ACTGGGGATT
GGCTGGATTT GCCAGCAGGG TGAGAAACTG TATGCGATAC AACCAGAAGA AACGCTAACG
CTGGACGTTA AAACGCAACG TTTCAACCGT CAGGGTTAA
 
Protein sequence
MVNLVIVSHS SRLGEGVGEL ARQMLMSDSC KIAIAAGIDD PQNPIGTDAV KVMEAIESVA 
DADHVLVMMD MGSALLSAET ALELLAPEIA AKVRLCAAPL VEGTLAATVS AASGADIDKV
IFDAMHALEA KREQLGLPSS DTEISDTCPA YDEEARSLAV VIKNRNGLHV RPASRLVYTL
STFNADMLLE KNGKCVTPES INQIALLQVR YNDTLRLIAK GPEAEEALIA FRQLAEDNFG
ETEEVAPPTL RPVPPVSGKA FYYQPVLCTV QAKSTLTVEE EQERLRQAID FTLLDLMTLT
AKAEASGLDD IAAIFSGHHT LLDDPELLAA ASELLQHEHC TAEYAWQQVL KELSQQYQQL
DDEYLQARYI DVDDLLHRTL VHLTQTKEEL PQFNSPTILL AENIYPSTVL QLDPAVVKGI
CLSAGSPVSH SALIARELGI GWICQQGEKL YAIQPEETLT LDVKTQRFNR QG