Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1342 |
Symbol | |
ID | 7978145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1410225 |
End bp | 1411223 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644798279 |
Product | dihydroxyacetone kinase, DhaK subunit |
Protein accession | YP_002949452 |
Protein GI | 239826828 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2376] Dihydroxyacetone kinase |
TIGRFAM ID | [TIGR02363] dihydroxyacetone kinase, DhaK subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00155435 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAA AACTAATCAA CAACCCAAAT CAAGTTGTCA ATGATATGTT GGAAGGAATG GTTGCCGCCT ATCGTGATCG GTTGAGAAGG CTGCCGGGTA CGAATGTGAT TGTGAGAAAC GATTCCCCAG TAAAAGGAAA AGTCGGAATC GTGAGCGGTG GCGGCAGCGG ACATGAGCCG GCGCATGCGG GCTATGTCGG AAAAGGAATG TTAGATGCGG CGGTATGCGG GGAAGTGTTT ACTTCTCCGA CGCCTGACCA GGTGCTTGAG GCGATCAAAG CGGTCGATAG CGGAAAAGGC GTATTGCTTA TTATCAAAAA TTATACGGGA GACGTCATGA ATTTTGAAAT GGCCGCAGAG TTGGCGGAAG CAGAAGGAAT TCGTGTCGCG AAAGTCATTG TAAATGACGA CGTAGCGGTG GAAGATAGCA CGTTTACAAC AGGACGGCGC GGCATTGCGG GAACGGTGTT TGTTCATAAA ATCGCAGGGG CGCTGGCGGA GCGCGGCGCA TCGCTTGAAG AAGTAGAAGC GGTAGCGAAG AAGGTGGTGC AAAACGTCCG TTCCATGGGA ATGGCACTTA CTCCGTGCAC CGTGCCGGCA GCGGGGAAAC CAGGCTTTGA ACTTGGCGAA AATGAAATTG AAGTCGGCAT CGGCATTCAC GGAGAACCGG GAATTGAAAA AACAACGATC AAACCAGCAG ACGAAATTGC GGCAACGCTG CTTGTCAAAA TTTTCGATGA TATGAAACTA GAAAAAGGCG ATCGCGTCGC AGTGATGATT AACGGACTTG GCGCGACACC GTTAATGGAG CTATATATTG TGAATAAAAA AGTATCAGAA ATGTTGAAGG AAAAACAAAT TCACGTCCAT GAAACATTTG TTGGAGAATA TATGACCTCG CTAGAAATGG CGGGATGCTC GATATCGCTA TTGAAATTGG ATGATTCCTT AATCGAATTG TTAGATGCGC CTGCCGATAC GATTGCGTTG AAAAAATAA
|
Protein sequence | MMKKLINNPN QVVNDMLEGM VAAYRDRLRR LPGTNVIVRN DSPVKGKVGI VSGGGSGHEP AHAGYVGKGM LDAAVCGEVF TSPTPDQVLE AIKAVDSGKG VLLIIKNYTG DVMNFEMAAE LAEAEGIRVA KVIVNDDVAV EDSTFTTGRR GIAGTVFVHK IAGALAERGA SLEEVEAVAK KVVQNVRSMG MALTPCTVPA AGKPGFELGE NEIEVGIGIH GEPGIEKTTI KPADEIAATL LVKIFDDMKL EKGDRVAVMI NGLGATPLME LYIVNKKVSE MLKEKQIHVH ETFVGEYMTS LEMAGCSISL LKLDDSLIEL LDAPADTIAL KK
|
| |