Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1954 |
Symbol | |
ID | 7979465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2014171 |
End bp | 2015700 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644798782 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_002949952 |
Protein GI | 239827328 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0698225 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCATA TCGTTACAGC CGCAGAAATG TACGAAATCG ACCGTGATAC AATTGAACAA ATCGGCATCA GCGCCGATTC CTTGATGGAA AATGCCGGGC AGGCGCTGTT TCATGTGCTG CGCGAACGCA TTCCGCATTC GGCGCTTGTG GCGGTGCTAG CGGGAACAGG CAACAACGGC GGTGACGGAT TTGTTGTTGC AAGAATGTTG AAAAGCTGTG GTTATAACGT GGATTTATGG CTTATTCCGC CGAAAGAAAA GATCAAAGGT GCAGCAAAGA CAGCGCTAAC TACTTATGAA CGTTCGGGAT ATGACATCAA AGAATATATC GGAAATGAAC AATACTTTGC TGAACAGGTT CGTCATTATG ATGTAATCAT TGATGCGCTT CTTGGGATCG GCATTCAGGG GGCCGTCCGT TCTCCGTACA AAGAGATTAT CGAACTTGTC AATCGTTCTA ACGCAATCGT CTATGCCGTT GATATTCCGA GTGGAACTCC AGCGGATGGG GGAGAGGTGG AAACAGCGGT TCGCGCTGAT ATGACGATCA CCATTCAATG CCCAAAACTT GGGGCGTACA CGTTCCCGAC GGCTGATTAT TACGGGGAGC TTCTCGTTGT CGACATTGGC ATTCCGCCTC TTGCCGTGGT GCGGAATGCC GCGGTCCGCT CGACATGGGA GGAAGATGAT GTCGTACGGA CTTTGCCGAG ACGAAAACAG TCGTCCCATA AAGGAACATA CGGAAAATTG CTTGTTGTTG GCGGTTCCCG GCCAATGACA GGCGCGATTA CGTTAACGGC AAAAGCCGCG CTGCGAAGCG GAGCGGGATT ATTGACGATG GCGGTGCCGG ATGACATTTA TTCCGTCGTC GCCAACCGCG TTCCAGAAGC GATGTACTAT CCGTGCCCAT CGCATGACGG TTCGTTTTCC GGCGTGATCG ATGTATCGAG GTTGGATATC GATGCGATCG CGATTGGCCC GGGGATGGGA AGAACGGATG GCGCACGGCA GCTTGTCCAC ACTTTGTTGC AGCAGCCTGT GCCGATGGTG ATGGATGCGG ATGCGCTGTT TTTCTGGAAC GAGTATGCTT CACTCGTTCG CGAACGGAAG GATGCGACCG TTGTTACTCC GCACCCTGGA GAAATGGCGC GCATGCTTGA TCTGTCTATC GATGAAGTTG AACGCGACCG GTTTGGCATT TCGAAGCAGC TGGCAACGGA GTATGGCATC TATGTGGTGT TGAAAGGGCC TTATACGATT GTCACAGCAC CAGACGGTTC GCAATACGTC AACACGACAG GAAATCCTGC TTTGGCGAAA GGCGGAAGCG GCGACGTGCT GACAGGAATG ATTGCCGCGT TTCTCATGCA GCATCACTCC GCACAAGCAG CCATTAGCAA CGCCGTCTGG GTTCACGGAA AAGCTGCGGA TATGCTTGTG GAAAACGGAC ATTCTCAATG GGACGTGCTC GCTGGAGATT TGATTGATGG GATTTCGTCT GTGCTTTCTC ATCTACAGAA ACAACAATAA
|
Protein sequence | MMHIVTAAEM YEIDRDTIEQ IGISADSLME NAGQALFHVL RERIPHSALV AVLAGTGNNG GDGFVVARML KSCGYNVDLW LIPPKEKIKG AAKTALTTYE RSGYDIKEYI GNEQYFAEQV RHYDVIIDAL LGIGIQGAVR SPYKEIIELV NRSNAIVYAV DIPSGTPADG GEVETAVRAD MTITIQCPKL GAYTFPTADY YGELLVVDIG IPPLAVVRNA AVRSTWEEDD VVRTLPRRKQ SSHKGTYGKL LVVGGSRPMT GAITLTAKAA LRSGAGLLTM AVPDDIYSVV ANRVPEAMYY PCPSHDGSFS GVIDVSRLDI DAIAIGPGMG RTDGARQLVH TLLQQPVPMV MDADALFFWN EYASLVRERK DATVVTPHPG EMARMLDLSI DEVERDRFGI SKQLATEYGI YVVLKGPYTI VTAPDGSQYV NTTGNPALAK GGSGDVLTGM IAAFLMQHHS AQAAISNAVW VHGKAADMLV ENGHSQWDVL AGDLIDGISS VLSHLQKQQ
|
| |