Gene Moth_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0221 
Symbol 
ID3831372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp217500 
End bp218915 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content61% 
IMG OID637828157 
Productphosphoglucomutase 
Protein accessionYP_429099 
Protein GI83589090 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1109] Phosphomannomutase 
TIGRFAM ID[TIGR01132] phosphoglucomutase, alpha-D-glucose phosphate-specific 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATTA AATTTGGGAC CGACGGCTGG CGCGCCGTCA TTGCCGACGA GTTCACCTTC 
GCCAATGTGC GCTTGGTCAC CCAGGCCACG GCCAATTACC TGCTCCGGGA GGCCGGCAGC
GGGAAGATCA TTATCGGTTA CGATAACCGT TTCCTGGCCC CGGAGTTTGC CCGGGCTGTG
GCCGAGGTCC TGACTGCCAG CGGTTTTACC GTCTACCTGC CGTCCCGGGC GGTGCCCACA
CCGGTAACGG CCTGGGCCAT TAAGCATTAC CAGGCCATGG GTGCGTTAAT GCTTACCGCC
AGTCATAATC CTCCGGAATA CTGCGGCTTG AAGTTTATCC CCGAATATGC CGGGCCAGCG
GTGCCTGCCA TTACCTCCGC CATTGAAAAA GAAATTGCTG CCGTCATAAA TGGGGGAGAA
GTGAAGACCC TTAACCTGGA TGAAGCCCGG GGCCGGGGCC TGGTCCGGGA ATTGGAACCG
GAGGCCGATT ACCGGGAGTA CCTGCACGGG CTCATTGACG TTGAGGCCCT CCGGAAGGCC
GGTTTGAAGG TCGTAGTCGA CCCCCTCTAT GGAGCTGGTA TAGGCTACCT GGAGGATTTC
CTGCGAGGGG CCGGCTGCCA GGTGCAGGCC ATACATAATT ACCGGGATCC CCTGTTTGGT
GGCGACTTAC CGGATCCCAG CGCCCGGGGC CTGGAGGAAC TCAGCCGGCG GGTCCGGGAG
ACAGGTGCCC ATCTGGGGCT GGCCCTGGAT GGCGATGCCG ACCGCTTCGG GGTAGTAGAC
GGGGACGGTA CCTACCTGAC GGCCAACCAG GTCCTCTACC TGGTCCTGGC CCATTTAATC
ATGGACCGCC ATTACCGTGG TCCGGTAGCC AGGACGGTAG CCACCACCCA TAACCTGGAC
CGCCTGGCCA GGGCCCACGA CCTGGAGATA ATTGAAACCC CGGTCGGCTT CAAGTATATA
GGAGAGGCCC TGCGGGAAAA GGGCTGTATC CTGGGGGGAG AAGAGAGCGG GGGCTTAAGC
ATCCGGGGGC ATATTCCGGA GAAGGACGGC ATCCTGGCGA CGGCCCTGGT GGCTGAACTG
CGGGCGGTCC GGGGCCGGAG CCTGGGGGAG ATTTTGGCGG ATTTGCATTC CAGTTATGGC
CATCTGGTCA ACCAGCGCCT GGATATCAAG GTAGACCCGG CTACTAAAGA AAGGGTGCTG
CAGGAATTAC CGGATTTTGC CCCCGCCAAG GTAGCGGGCA TACCGGTAAC CGGGCGTTTG
ACGGTAGACG GGGTAAAATT GACCCTGGCC GACGGCAGCT GGGTTTTACT TAGACCTTCC
GGTACGGAAC CCCTCCTGCG TTTATATGCG GAGGCGCCCG ACGCCGGGCG TCTCCGCCTG
TTGCAAAAAG AAATAACCAC TGCCCTCAGG ATTTAA
 
Protein sequence
MAIKFGTDGW RAVIADEFTF ANVRLVTQAT ANYLLREAGS GKIIIGYDNR FLAPEFARAV 
AEVLTASGFT VYLPSRAVPT PVTAWAIKHY QAMGALMLTA SHNPPEYCGL KFIPEYAGPA
VPAITSAIEK EIAAVINGGE VKTLNLDEAR GRGLVRELEP EADYREYLHG LIDVEALRKA
GLKVVVDPLY GAGIGYLEDF LRGAGCQVQA IHNYRDPLFG GDLPDPSARG LEELSRRVRE
TGAHLGLALD GDADRFGVVD GDGTYLTANQ VLYLVLAHLI MDRHYRGPVA RTVATTHNLD
RLARAHDLEI IETPVGFKYI GEALREKGCI LGGEESGGLS IRGHIPEKDG ILATALVAEL
RAVRGRSLGE ILADLHSSYG HLVNQRLDIK VDPATKERVL QELPDFAPAK VAGIPVTGRL
TVDGVKLTLA DGSWVLLRPS GTEPLLRLYA EAPDAGRLRL LQKEITTALR I