Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1359 |
Symbol | dhaK |
ID | 6271738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1237183 |
End bp | 1238292 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641725470 |
Product | dihydroxyacetone kinase subunit DhaK |
Protein accession | YP_001879980 |
Protein GI | 187731772 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2376] Dihydroxyacetone kinase |
TIGRFAM ID | [TIGR02363] dihydroxyacetone kinase, DhaK subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TGATCAATGA TGTGCAAGAC GTACTGGACG AACAACTGGC AGGACTGGCG AAAGCGCATC CATCGCTGAC ACTGCATCAG GATCCGGTGT ATGTCACCCG AGCTGATGCC CCTGTTGCAG GAAAAGTCGC CCTGCTGTCG GGTGGCGGCA GCGGACACGA GCCGATGCAC TGTGGGTATA TCGGTCAGGG GATGCTTTCG GGGGCCTGTC CGGGCGAAAT TTTCACCTCA CCGACGCCCG ATAAAATCTT TGAATGCGCC ATGCAAGTTG ATGGCGGCGA AGGTGTACTG TTGATTATCA AAAATTACAC CGGCGATATT CTTAACTTTG AAACAGCGAC CGAGTTACTG CACGATAGCG GCGTAAAAGT GACCACTGTG GTCATTGATG ACGACGTTGC GGTAAAAGAC AGTCTTTATA CTGCCGGGCG ACGCGGCGTT GCCAACACCG TATTAATTGA AAAACTCGTA GGCGCAGCGG CGGAGCGTGG CGACTCACTG GACGCCTGTG CGGAACTGGG GCGTAAGTTG AATAATCAAG GCCACTCAAT AGGTATCGCT CTCGGTGCCT GTACCGTTCC TGCCGCGGGC AAACCTTCTT TTACCCTGGC GGATAATGAG ATGGAGTTTG GCGTCGGCAT TCATGGTGAG CCGGGTATTG ACCGCCGCTC CTTCTCTTCC CTTGATCAAA CCGTCGATGA AATGTTCGAC ACCCTGCTGG GAAATGGCTC ATACCATCGC ACTTTACGTT TCTGGGATTA TCAACAAGGC AGTTGGCAGG AAGAACAACA AACCAAACAA CCGCTCCAGT CTGGCGATCG GGTGATTGCG CTGGTTAACA ATCTTGGCGC AACTCCGCTT TCTGAGCTGT ACGGCGTCTA TAACCGCCTG ACCACACGTT GCCAGCAAGC GGGATTGACT ATCGAACGTA ATTTAATTGG CGCGTACTGC ACCTCACTGG ATATGACCGG TTTCTCAATC ACCTTACTGA AAGTTGATGA CGAAACGCTG GCACTCTGGG ACGCCCCGGT CCACACCCCG GCCCTTAACT GGGGTAAATT AGGAGAAAGC AATGTCACTG AGCAGAACTC AAATTGTTAA
|
Protein sequence | MKKLINDVQD VLDEQLAGLA KAHPSLTLHQ DPVYVTRADA PVAGKVALLS GGGSGHEPMH CGYIGQGMLS GACPGEIFTS PTPDKIFECA MQVDGGEGVL LIIKNYTGDI LNFETATELL HDSGVKVTTV VIDDDVAVKD SLYTAGRRGV ANTVLIEKLV GAAAERGDSL DACAELGRKL NNQGHSIGIA LGACTVPAAG KPSFTLADNE MEFGVGIHGE PGIDRRSFSS LDQTVDEMFD TLLGNGSYHR TLRFWDYQQG SWQEEQQTKQ PLQSGDRVIA LVNNLGATPL SELYGVYNRL TTRCQQAGLT IERNLIGAYC TSLDMTGFSI TLLKVDDETL ALWDAPVHTP ALNWGKLGES NVTEQNSNC
|
| |