Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2047 |
Symbol | glk |
ID | 7083807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2315987 |
End bp | 2317006 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643699074 |
Product | glucokinase |
Protein accession | YP_002355691 |
Protein GI | 217970457 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0837] Glucokinase |
TIGRFAM ID | [TIGR00749] glucokinase, proteobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00612333 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCA CCCCTGCTCC CTATCCCCGC CTGGTCGCCG ACATCGGCGG CACCAACGCG CGCTTCGCCC TCGTCGAGGC GCCCGGAGCG GCACCGACGC ATCTGCGAGC GCTGCGCTGC GCGGAACACA GCGGCCCGGA GGCCGCGTTG CGCGCCTGGT TGGCGGACAC CGGCGCCCGC CTGCCCGCTT ACGCCGCCTT CGGCATCGCC ACGCCGATCG ACGGCGACGG CGTCGCGATG ACCAATCATC CCTGGCGCTT TTCGATCGGC GCGCTGTGCG GCGCGCTCGG CCTGCGCCGG CTGACCGTGG TGAACGATTT CACCGCGCTC GCGCTCGCGC TGCCCGCGCT CGGCGACGGC GACCTGGTCC GCGTCGGCGG CGGCGAGCCG CGTGCGGGCG CCGCGCGGGC GTTGATCGGC GCCGGCACCG GGCTCGGCGT TTCGGGCCTG CTGCCCGTGC CGGGCGGCTG GGTCCCGCTG CAAGGCGAGG GCGGGCACGT GACGCTGCCG GCCTCTTGCA CACGCGAGGC CGCGGTGGTC GCCTGGCTCG CCGCCCGCCA TGGCCATGTC TCGGCCGAGC GCGTGCTCTC GGGTCCCGGT CTGGTCGTGC TCCACGACAC CTTGCGCGCG CTCGACGGCG AAGCGCGTGT CGAGCGCACG CCGGCGGAGA TCAGCGAACG GGCGCTGGCC GGCGGCTGCC GCCACTGCGT CGAGGCACTC GAGCTCTTCT GCGCGCTGCT CGGCACGGTG GCGGGCGACG TCGCGCTCAC CCTGGGCGCG CGCGGCGGGC TGTATATCGG CGGTGGCATC GTGCCGCGGC TGGGGGATTT CTTCCTGCGC TCCGCGTTCC GCGAACGTTT CGTCGCCAAG GGCCGCTTTC GCCCCTGGCT CGAACGCATC CCGATCTGGG TGGTCGTCGC CCCCCACGCC GCCCTCACCG GCGCCTCGGC GGCACTCGAC AGCGCCATCG AGCTCGGCTT CACCACGCTC GCCGAGCCGC GCGGGCACGT CTCCCCGTAG
|
Protein sequence | MNATPAPYPR LVADIGGTNA RFALVEAPGA APTHLRALRC AEHSGPEAAL RAWLADTGAR LPAYAAFGIA TPIDGDGVAM TNHPWRFSIG ALCGALGLRR LTVVNDFTAL ALALPALGDG DLVRVGGGEP RAGAARALIG AGTGLGVSGL LPVPGGWVPL QGEGGHVTLP ASCTREAAVV AWLAARHGHV SAERVLSGPG LVVLHDTLRA LDGEARVERT PAEISERALA GGCRHCVEAL ELFCALLGTV AGDVALTLGA RGGLYIGGGI VPRLGDFFLR SAFRERFVAK GRFRPWLERI PIWVVVAPHA ALTGASAALD SAIELGFTTL AEPRGHVSP
|
| |