Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2570 |
Symbol | |
ID | 7874009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2775247 |
End bp | 2776788 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643699492 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_002889549 |
Protein GI | 237653235 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0989158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAGCC CCCTGCGTCC GATCTACCCC GTCGCCGGCA TCCGCGAGAT CGAGGACAAA CTCATCCCGA ACGCCCGCCC GCCGCTGATG GAGCGCGCCG GGCGCGCCGC CGCGCAGGAT GCGGTACGGC TGATCATGGA TCGCCCCGGG CCGATCCTGA TCGCCTGCGG CCCTGGCAAC AATGGCGGCG ACGGCTTCGT GATGGCGCGC CAGCTCGCCC AGGCCGGGCG CGAGGTGGTG GTCGCGTTCT GCAGCGCGGC CGAGCGTCTG CCGGCCGAGG CGGCCAAGGC GCACGCCGAC TACCTCGCCG CCGGCGGCAG CATCGTCTCC GACCTCCCCG CCGCGCCGGC CAACGGCTGG GCCCTTGTGG TCGACGCGCT CTTCGGGATC GGCCTCGGAC GCCCGATCGA AGACCGCTAC GCGAGCTGGA TCCACACCCT CAACGCCCAA CCCTGCCCGC GCATGGCGCT CGACGTCCCG AGCGGACTCG ACGCCGACAC CGGCAGCCCG CTCGGCGCCA CCTTCCGCGC CACCCACACC ACCACCTTCA TCGCCCTCAA ACCCGGCCTG CTCACCAACG ACGGCCCCGA CCACTGCGGC GAGATCAGCG TGCAGCGCAT CGAGATCGAC GCCCCTGCCT GGCTGCCGGC GCGCGGTTAC GCGATCGCAT CCTCGCTGTT CCGCCATCTG CTGCAGCCGC GCCCGCGCAA CACCCACAAG GGTCTCTACG GCGACGCCGC CATCCTCGGC GGCAACGCCG GCATGGTCGG CGCCGCGCTA CTCGCCGGCC GCGCCGCGCT CTGGCTGGGG ACGGGAAGGG TCTATGTCGG CCTGCTCGAC CCCGCCGGCC CCGCGGTCGA CTTCGCCCAC CCCGAGCTCA TGCTGCGTCG CGCCGAGACC CTGCCCGAGC GCCTCAGCGC GCTCGCCATC GGCCCCGGCC TGGGCACCCA GGGCGCGTCC GCCAACGTGC TCGCCGACGC CCTCGCGCGC CCGATCCCGC TGCTCCTCGA CGCCGACGCG CTCAACCTCC TCTCCGCCGA TGCGGCGCTG CGCCGGGTGT TGTGCACGCG CGGCGCAGCC ACCGTGCTCA CCCCCCACCC CGCAGAGGCC GCCCGCCTGC TCGGCAGCGA CACCGCAAGC GTGCAGGCCG ACCGGCTGCG TGCCGCGCTC GAACTCGCCC GCCGCTACCG TGCCTTGGTC GTGCTCAAGG GCTGTGGCAG CATCGTCGCC ACACCCGACG AGCGCTGGTT CATCAACGGC AGCGGCCACT CCGGGATGGC AAGCGCGGGC ATGGGAGACG TGCTGAGCGG CCTCGTCACC GGCCTGCTCG CGCAGGGCTG GCCGCCCGAG TCCGCGCTCA TCGCCGGCGT ACACCTGCAC GGTGCCGCAG CCGACCGACT CGCACGCGAA GGCATCGGTC CGGTCGGGTT GAGCGCGAGC GAGACCATCG ACGCCGCCCG TGGCGTCTTC AACGGCTGGC TGATCGAGGC GCAGCGCGAG CCGCGCAACA CCGGCCCCCA TTCCCCGCGG ACCGGGCGCT GA
|
Protein sequence | MFSPLRPIYP VAGIREIEDK LIPNARPPLM ERAGRAAAQD AVRLIMDRPG PILIACGPGN NGGDGFVMAR QLAQAGREVV VAFCSAAERL PAEAAKAHAD YLAAGGSIVS DLPAAPANGW ALVVDALFGI GLGRPIEDRY ASWIHTLNAQ PCPRMALDVP SGLDADTGSP LGATFRATHT TTFIALKPGL LTNDGPDHCG EISVQRIEID APAWLPARGY AIASSLFRHL LQPRPRNTHK GLYGDAAILG GNAGMVGAAL LAGRAALWLG TGRVYVGLLD PAGPAVDFAH PELMLRRAET LPERLSALAI GPGLGTQGAS ANVLADALAR PIPLLLDADA LNLLSADAAL RRVLCTRGAA TVLTPHPAEA ARLLGSDTAS VQADRLRAAL ELARRYRALV VLKGCGSIVA TPDERWFING SGHSGMASAG MGDVLSGLVT GLLAQGWPPE SALIAGVHLH GAAADRLARE GIGPVGLSAS ETIDAARGVF NGWLIEAQRE PRNTGPHSPR TGR
|
| |