Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1941 |
Symbol | dhaK |
ID | 6146488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1962926 |
End bp | 1963996 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616817 |
Product | dihydroxyacetone kinase subunit DhaK |
Protein accession | YP_001743993 |
Protein GI | 170681763 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2376] Dihydroxyacetone kinase |
TIGRFAM ID | [TIGR02363] dihydroxyacetone kinase, DhaK subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.0517776 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAT TGATCAATGA TGTGCAAGAC GTACTGGACG AACAACTGGC AGGACTAGCG AAAGCGCATC CATCGCTGAC ACTGCACCAG GATCCGGTGT ATGTCACCCG AGCTGATGCC CCCGTTGCAG GTAAAGTCGC CCTGCTGTCG GGTGGCGGCA GCGGACACGA GCCGATGCAC TGTGGCTATA TCGGTCAGGG GATGCTTTCG GGAGCCTGTC CGGGCGAAAT TTTCACCTCA CCGACGCCCG ATAAAATCTT TGAATGCGCC ATGCAAATTG ATGGCGGCGA AGGTGTACTG TTGATTATCA AAAATTACAC CGGCGATATT CTTAACTTCG AAACAGCAAC CGAGTTACTG CACGATAGCG GCGTAAAAGT GACCACTGTG GTCATTGATG ACGACGTTGC AGTAAAAGAC AGTCTTTATA CCGCCGGGCG GCGCGGCGTT GCCAACACCG TATTAATTGA AAAACTCGTA GGTGCAGCGG CGGAGCGTGG CGACTCACTG GACGCCTGTG CGGAACTGGG GCGTAAGTTG AATAATCAAG GCCATTCAAT AGGTATCGCT CTCGGTGCCT GTACTGTTCC TGCCGCGGGC AAACCTTCTT TTACCCTGGC GGATAATGAG ATGGAATTTG GCGTCGGCAT TCATGGTGAG CCAGGTATTG ACCGCCGCCC CTTCTCTTCC CTTGATCAAA CCGTCGATGA AATGTTCGAC ACCCTGCTGG AAAATGGCTC ATACCATCGC ACTTTGCGTT TCTGGGATTA TCAACAAGGC AGCTGGCAGG AAGAACCACA AACCAAACAA CCGCTCCAGT CTGGCGATCG GGTGATTGCG CTGGTTAACA ATCTTGGCGC AACTCCGCTT TCTGAGCTGT ACGGCGTCTA TAACCGCCTG ACCACACGTT GCCAGCAAGC GGGATTGACT ATCGAACGTA ATTTAATTGG CGCGTACTGC ACCTCACTGG ATATGACCGG TTTCTCAATC ACCTTACTGA AAGTTGATGA CGAAACGCTG GCACTCTGGG ACGCCCCGGT CCACACCCCG GCCCTTAACT GGGGTAAATA A
|
Protein sequence | MKKLINDVQD VLDEQLAGLA KAHPSLTLHQ DPVYVTRADA PVAGKVALLS GGGSGHEPMH CGYIGQGMLS GACPGEIFTS PTPDKIFECA MQIDGGEGVL LIIKNYTGDI LNFETATELL HDSGVKVTTV VIDDDVAVKD SLYTAGRRGV ANTVLIEKLV GAAAERGDSL DACAELGRKL NNQGHSIGIA LGACTVPAAG KPSFTLADNE MEFGVGIHGE PGIDRRPFSS LDQTVDEMFD TLLENGSYHR TLRFWDYQQG SWQEEPQTKQ PLQSGDRVIA LVNNLGATPL SELYGVYNRL TTRCQQAGLT IERNLIGAYC TSLDMTGFSI TLLKVDDETL ALWDAPVHTP ALNWGK
|
| |