Gene GWCH70_2085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2085 
Symbol 
ID7977329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2157864 
End bp2159048 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content44% 
IMG OID644798903 
Productgalactokinase 
Protein accessionYP_002950063 
Protein GI239827439 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.11245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCACCA ATTTAATAAA AACGTTTACT GAATTATTTG GAGACGGAAA TGAAGAAATT 
CGTATTTTCT TCGCGCCTGG CCGCGTGAAT TTAATTGGCG AGCATACGGA CTATAACGGC
GGGCATGTGC TGCCGTGCGC TTTGGAAATT GGGACGTATG CGCTTGTGCG AAAAACAGCA
CATCCGTTTA TCCGCTTTTA CTCAAAAAAT TTTCCGGAAA CAGGGATCAT TACCGTATCT
TATGACGACC TATCCTACCA AGACAAGCAC GGATGGGCAA ATTATCCAAA GGGAGTTATT
GCTGCGTTTC AATCGTTTTA TCCGATCGAG ACGGGACTTG ATATTTTGTA TTATGGAACG
ATCCCGAACG GTGCCGGGTT ATCGTCTTCC GCTTCGATTG AATTAGTTAC GGCGGTGATG
ATGAATGAGT TATTCGAGCA GCATATCGAT ATGCTCGAAC TTGTGAAAAT GAGCCAAAAA
GTAGAAAATG AATATGTCGG CGTCAACTGC GGCATTATGG ATCAATTTGC CGTCGGAATG
GGAAAGCGAA ATCATGCCAT TCTCTTAAAT TGCCAAACGC TGGCATACCG CTATATTCCT
GTGGCGTTCA ATCATTGTTC GATTGTCATC GCAAATACGA ATAAAAAGCG CGGTTTGGCC
GATTCAGCGT ATAACGAACG AAGATCGACG TGTGAAGCTG CGCTTTTGAA ATTAAAGGAG
CATCTAAATA TCGAATCGCT TGGCGAGCTG ACAAGCGAGC AATTAGAACA GTATGATCAC
CTTCTTTCTC CAATCGAACA AAAGCGAGCA CGCCATGCTG TGACGGAAAA TGAACGGACG
ATTCAAGCGG CGGACGCATT AGAAAAAGGA GATTTGGCGC GCTTTGGCGA GTTAATGAAA
CAATCCCACA TTTCGCTGCG CGATGATTAT GAAGTGACAG GATTAGAGCT TGATACGCTT
GTTGAAGCGG CGTGGAACCA CGAAGGGACG ATCGGCGCCC GTATGACTGG AGCCGGTTTT
GGCGGCTGTA CCGTAAATAT TGTAAAAGAT GAGTTCATTC CTTCTTTTAT TGAGCAAGTG
GGGAATGAAT ATGCGAAAAA AATTGGCTAT GAAGCTAGTT TTTATGTTGT GAAAATTGGT
GATGGAGCGA AAGAAATAAC GGGAGAAAAG GAGATGAGCG TATGA
 
Protein sequence
MITNLIKTFT ELFGDGNEEI RIFFAPGRVN LIGEHTDYNG GHVLPCALEI GTYALVRKTA 
HPFIRFYSKN FPETGIITVS YDDLSYQDKH GWANYPKGVI AAFQSFYPIE TGLDILYYGT
IPNGAGLSSS ASIELVTAVM MNELFEQHID MLELVKMSQK VENEYVGVNC GIMDQFAVGM
GKRNHAILLN CQTLAYRYIP VAFNHCSIVI ANTNKKRGLA DSAYNERRST CEAALLKLKE
HLNIESLGEL TSEQLEQYDH LLSPIEQKRA RHAVTENERT IQAADALEKG DLARFGELMK
QSHISLRDDY EVTGLELDTL VEAAWNHEGT IGARMTGAGF GGCTVNIVKD EFIPSFIEQV
GNEYAKKIGY EASFYVVKIG DGAKEITGEK EMSV