Gene Moth_0832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0832 
Symbol 
ID3831529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp862831 
End bp863778 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content58% 
IMG OID637828762 
Productglucokinase 
Protein accessionYP_429692 
Protein GI83589683 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.618047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTGCCTTT TGGGTATCGA TCTCGGCGGT ACATCAATTA AGGCCGGGCT GGTGGATATA 
AACGGAAAGA TTCTGAAAAA AGGACAGGTG CCTACGGGGG CCGGTGAAGG TACAACCGCA
GTTTTAAAGC GCATCAAGAA CCTGGCCCGG GACCTGGCCG GTGAACAGGG TCTGGCTCTG
GGTGAACTCG AAGGGATCGG TATCGGTATT CCCGGATCGG TTGATGTGGC CAGGGGGTTG
GTCCACCTGG CTCCCAATCT CTTCTGGCGC GATTTCTCCC TCCGGGATGA ACTGGCAGCC
TTACTGGATC TACCTGTGGC TATTGAAAAT GACGCCCACG TTGCCGCCCT GGGGGAGATG
TGGCAGGGAG CCGGCCGGGG ATATACCTCC CTCCTCATGG TAACCATTGG CACGGGTATC
GGCTCAGGAC TGATTATTGA CGGTCGCGTC CATCACGGCC TTTTCGGTTA TGGAGCGGAG
ATGGGGCATA TAAAAATGGT CTGTGACGGC CGGCAGTGTC ACTGTGGCGG CCATGGCTGC
CTGGAGACCC TGGCTTCGGC CACAGCTATG GTAAAGTCTT TCCGGGAGTA CCTGGCAGCG
GGTTACCCAT CCATGTTATC GGATAGGCCC GAACTTGGGG CGAAGGAGAT CCTGGCAGCC
GCCGCCGGGG GCGACGAACT GGCTGGCCGC GTCATCGACG AGGCCGCCTG CTATTTGGGA
ACAGCCCTGG CCAATGCCGT CCTCCTGGTG GGGCCGGAAG CAATAATTAT CGGGGGGGGA
CCGGCCCAGG CCGGGGAGGT AATCTTAGAT CCTATCCGCA AACACCTGGC TGCAGCCATG
GGAACCTGGC AGCTTAAGCA AGTACCGGTC CTACAGGCTG CCCTGGGTAA TGATGCAGGT
ATTATCGGCG CCGCTTATCT GGCTATGAAG ACGAGCTCAC AATATTGA
 
Protein sequence
MCLLGIDLGG TSIKAGLVDI NGKILKKGQV PTGAGEGTTA VLKRIKNLAR DLAGEQGLAL 
GELEGIGIGI PGSVDVARGL VHLAPNLFWR DFSLRDELAA LLDLPVAIEN DAHVAALGEM
WQGAGRGYTS LLMVTIGTGI GSGLIIDGRV HHGLFGYGAE MGHIKMVCDG RQCHCGGHGC
LETLASATAM VKSFREYLAA GYPSMLSDRP ELGAKEILAA AAGGDELAGR VIDEAACYLG
TALANAVLLV GPEAIIIGGG PAQAGEVILD PIRKHLAAAM GTWQLKQVPV LQAALGNDAG
IIGAAYLAMK TSSQY