Gene GWCH70_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1954 
Symbol 
ID7979465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2014171 
End bp2015700 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content52% 
IMG OID644798782 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_002949952 
Protein GI239827328 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0698225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCATA TCGTTACAGC CGCAGAAATG TACGAAATCG ACCGTGATAC AATTGAACAA 
ATCGGCATCA GCGCCGATTC CTTGATGGAA AATGCCGGGC AGGCGCTGTT TCATGTGCTG
CGCGAACGCA TTCCGCATTC GGCGCTTGTG GCGGTGCTAG CGGGAACAGG CAACAACGGC
GGTGACGGAT TTGTTGTTGC AAGAATGTTG AAAAGCTGTG GTTATAACGT GGATTTATGG
CTTATTCCGC CGAAAGAAAA GATCAAAGGT GCAGCAAAGA CAGCGCTAAC TACTTATGAA
CGTTCGGGAT ATGACATCAA AGAATATATC GGAAATGAAC AATACTTTGC TGAACAGGTT
CGTCATTATG ATGTAATCAT TGATGCGCTT CTTGGGATCG GCATTCAGGG GGCCGTCCGT
TCTCCGTACA AAGAGATTAT CGAACTTGTC AATCGTTCTA ACGCAATCGT CTATGCCGTT
GATATTCCGA GTGGAACTCC AGCGGATGGG GGAGAGGTGG AAACAGCGGT TCGCGCTGAT
ATGACGATCA CCATTCAATG CCCAAAACTT GGGGCGTACA CGTTCCCGAC GGCTGATTAT
TACGGGGAGC TTCTCGTTGT CGACATTGGC ATTCCGCCTC TTGCCGTGGT GCGGAATGCC
GCGGTCCGCT CGACATGGGA GGAAGATGAT GTCGTACGGA CTTTGCCGAG ACGAAAACAG
TCGTCCCATA AAGGAACATA CGGAAAATTG CTTGTTGTTG GCGGTTCCCG GCCAATGACA
GGCGCGATTA CGTTAACGGC AAAAGCCGCG CTGCGAAGCG GAGCGGGATT ATTGACGATG
GCGGTGCCGG ATGACATTTA TTCCGTCGTC GCCAACCGCG TTCCAGAAGC GATGTACTAT
CCGTGCCCAT CGCATGACGG TTCGTTTTCC GGCGTGATCG ATGTATCGAG GTTGGATATC
GATGCGATCG CGATTGGCCC GGGGATGGGA AGAACGGATG GCGCACGGCA GCTTGTCCAC
ACTTTGTTGC AGCAGCCTGT GCCGATGGTG ATGGATGCGG ATGCGCTGTT TTTCTGGAAC
GAGTATGCTT CACTCGTTCG CGAACGGAAG GATGCGACCG TTGTTACTCC GCACCCTGGA
GAAATGGCGC GCATGCTTGA TCTGTCTATC GATGAAGTTG AACGCGACCG GTTTGGCATT
TCGAAGCAGC TGGCAACGGA GTATGGCATC TATGTGGTGT TGAAAGGGCC TTATACGATT
GTCACAGCAC CAGACGGTTC GCAATACGTC AACACGACAG GAAATCCTGC TTTGGCGAAA
GGCGGAAGCG GCGACGTGCT GACAGGAATG ATTGCCGCGT TTCTCATGCA GCATCACTCC
GCACAAGCAG CCATTAGCAA CGCCGTCTGG GTTCACGGAA AAGCTGCGGA TATGCTTGTG
GAAAACGGAC ATTCTCAATG GGACGTGCTC GCTGGAGATT TGATTGATGG GATTTCGTCT
GTGCTTTCTC ATCTACAGAA ACAACAATAA
 
Protein sequence
MMHIVTAAEM YEIDRDTIEQ IGISADSLME NAGQALFHVL RERIPHSALV AVLAGTGNNG 
GDGFVVARML KSCGYNVDLW LIPPKEKIKG AAKTALTTYE RSGYDIKEYI GNEQYFAEQV
RHYDVIIDAL LGIGIQGAVR SPYKEIIELV NRSNAIVYAV DIPSGTPADG GEVETAVRAD
MTITIQCPKL GAYTFPTADY YGELLVVDIG IPPLAVVRNA AVRSTWEEDD VVRTLPRRKQ
SSHKGTYGKL LVVGGSRPMT GAITLTAKAA LRSGAGLLTM AVPDDIYSVV ANRVPEAMYY
PCPSHDGSFS GVIDVSRLDI DAIAIGPGMG RTDGARQLVH TLLQQPVPMV MDADALFFWN
EYASLVRERK DATVVTPHPG EMARMLDLSI DEVERDRFGI SKQLATEYGI YVVLKGPYTI
VTAPDGSQYV NTTGNPALAK GGSGDVLTGM IAAFLMQHHS AQAAISNAVW VHGKAADMLV
ENGHSQWDVL AGDLIDGISS VLSHLQKQQ