Gene GWCH70_1342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1342 
Symbol 
ID7978145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1410225 
End bp1411223 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content48% 
IMG OID644798279 
Productdihydroxyacetone kinase, DhaK subunit 
Protein accessionYP_002949452 
Protein GI239826828 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2376] Dihydroxyacetone kinase 
TIGRFAM ID[TIGR02363] dihydroxyacetone kinase, DhaK subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00155435 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAA AACTAATCAA CAACCCAAAT CAAGTTGTCA ATGATATGTT GGAAGGAATG 
GTTGCCGCCT ATCGTGATCG GTTGAGAAGG CTGCCGGGTA CGAATGTGAT TGTGAGAAAC
GATTCCCCAG TAAAAGGAAA AGTCGGAATC GTGAGCGGTG GCGGCAGCGG ACATGAGCCG
GCGCATGCGG GCTATGTCGG AAAAGGAATG TTAGATGCGG CGGTATGCGG GGAAGTGTTT
ACTTCTCCGA CGCCTGACCA GGTGCTTGAG GCGATCAAAG CGGTCGATAG CGGAAAAGGC
GTATTGCTTA TTATCAAAAA TTATACGGGA GACGTCATGA ATTTTGAAAT GGCCGCAGAG
TTGGCGGAAG CAGAAGGAAT TCGTGTCGCG AAAGTCATTG TAAATGACGA CGTAGCGGTG
GAAGATAGCA CGTTTACAAC AGGACGGCGC GGCATTGCGG GAACGGTGTT TGTTCATAAA
ATCGCAGGGG CGCTGGCGGA GCGCGGCGCA TCGCTTGAAG AAGTAGAAGC GGTAGCGAAG
AAGGTGGTGC AAAACGTCCG TTCCATGGGA ATGGCACTTA CTCCGTGCAC CGTGCCGGCA
GCGGGGAAAC CAGGCTTTGA ACTTGGCGAA AATGAAATTG AAGTCGGCAT CGGCATTCAC
GGAGAACCGG GAATTGAAAA AACAACGATC AAACCAGCAG ACGAAATTGC GGCAACGCTG
CTTGTCAAAA TTTTCGATGA TATGAAACTA GAAAAAGGCG ATCGCGTCGC AGTGATGATT
AACGGACTTG GCGCGACACC GTTAATGGAG CTATATATTG TGAATAAAAA AGTATCAGAA
ATGTTGAAGG AAAAACAAAT TCACGTCCAT GAAACATTTG TTGGAGAATA TATGACCTCG
CTAGAAATGG CGGGATGCTC GATATCGCTA TTGAAATTGG ATGATTCCTT AATCGAATTG
TTAGATGCGC CTGCCGATAC GATTGCGTTG AAAAAATAA
 
Protein sequence
MMKKLINNPN QVVNDMLEGM VAAYRDRLRR LPGTNVIVRN DSPVKGKVGI VSGGGSGHEP 
AHAGYVGKGM LDAAVCGEVF TSPTPDQVLE AIKAVDSGKG VLLIIKNYTG DVMNFEMAAE
LAEAEGIRVA KVIVNDDVAV EDSTFTTGRR GIAGTVFVHK IAGALAERGA SLEEVEAVAK
KVVQNVRSMG MALTPCTVPA AGKPGFELGE NEIEVGIGIH GEPGIEKTTI KPADEIAATL
LVKIFDDMKL EKGDRVAVMI NGLGATPLME LYIVNKKVSE MLKEKQIHVH ETFVGEYMTS
LEMAGCSISL LKLDDSLIEL LDAPADTIAL KK