Gene Tmz1t_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2047 
Symbolglk 
ID7083807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2315987 
End bp2317006 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content75% 
IMG OID643699074 
Productglucokinase 
Protein accessionYP_002355691 
Protein GI217970457 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0837] Glucokinase 
TIGRFAM ID[TIGR00749] glucokinase, proteobacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00612333 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCA CCCCTGCTCC CTATCCCCGC CTGGTCGCCG ACATCGGCGG CACCAACGCG 
CGCTTCGCCC TCGTCGAGGC GCCCGGAGCG GCACCGACGC ATCTGCGAGC GCTGCGCTGC
GCGGAACACA GCGGCCCGGA GGCCGCGTTG CGCGCCTGGT TGGCGGACAC CGGCGCCCGC
CTGCCCGCTT ACGCCGCCTT CGGCATCGCC ACGCCGATCG ACGGCGACGG CGTCGCGATG
ACCAATCATC CCTGGCGCTT TTCGATCGGC GCGCTGTGCG GCGCGCTCGG CCTGCGCCGG
CTGACCGTGG TGAACGATTT CACCGCGCTC GCGCTCGCGC TGCCCGCGCT CGGCGACGGC
GACCTGGTCC GCGTCGGCGG CGGCGAGCCG CGTGCGGGCG CCGCGCGGGC GTTGATCGGC
GCCGGCACCG GGCTCGGCGT TTCGGGCCTG CTGCCCGTGC CGGGCGGCTG GGTCCCGCTG
CAAGGCGAGG GCGGGCACGT GACGCTGCCG GCCTCTTGCA CACGCGAGGC CGCGGTGGTC
GCCTGGCTCG CCGCCCGCCA TGGCCATGTC TCGGCCGAGC GCGTGCTCTC GGGTCCCGGT
CTGGTCGTGC TCCACGACAC CTTGCGCGCG CTCGACGGCG AAGCGCGTGT CGAGCGCACG
CCGGCGGAGA TCAGCGAACG GGCGCTGGCC GGCGGCTGCC GCCACTGCGT CGAGGCACTC
GAGCTCTTCT GCGCGCTGCT CGGCACGGTG GCGGGCGACG TCGCGCTCAC CCTGGGCGCG
CGCGGCGGGC TGTATATCGG CGGTGGCATC GTGCCGCGGC TGGGGGATTT CTTCCTGCGC
TCCGCGTTCC GCGAACGTTT CGTCGCCAAG GGCCGCTTTC GCCCCTGGCT CGAACGCATC
CCGATCTGGG TGGTCGTCGC CCCCCACGCC GCCCTCACCG GCGCCTCGGC GGCACTCGAC
AGCGCCATCG AGCTCGGCTT CACCACGCTC GCCGAGCCGC GCGGGCACGT CTCCCCGTAG
 
Protein sequence
MNATPAPYPR LVADIGGTNA RFALVEAPGA APTHLRALRC AEHSGPEAAL RAWLADTGAR 
LPAYAAFGIA TPIDGDGVAM TNHPWRFSIG ALCGALGLRR LTVVNDFTAL ALALPALGDG
DLVRVGGGEP RAGAARALIG AGTGLGVSGL LPVPGGWVPL QGEGGHVTLP ASCTREAAVV
AWLAARHGHV SAERVLSGPG LVVLHDTLRA LDGEARVERT PAEISERALA GGCRHCVEAL
ELFCALLGTV AGDVALTLGA RGGLYIGGGI VPRLGDFFLR SAFRERFVAK GRFRPWLERI
PIWVVVAPHA ALTGASAALD SAIELGFTTL AEPRGHVSP