Gene Mthe_1091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1091 
Symbol 
ID4463123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1180769 
End bp1182157 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content59% 
IMG OID639700108 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_843514 
Protein GI116754396 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.214114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTATA TTTCAGCTGA GGAGATGGGC GCGATAGATG CGAACTGCGC GTATCTCGGT 
ATATCCACGC TCCAGCTCAT GGAGAATGCG GGAGCAGCGC TTGCGAATGA GATCAGGGCA
ATCGGTGGAA GGAGAATCGC CATAATCGCC GGCAGGGGGA ACAACGGCGG CGACGCTTTT
GTGGCTGCAC GCCACCTCGA CGATCTCGAT GTGACTGTAT TCCTCATAGG GCGCGCGAGG
GATATATCCA CAGAGGAGGC CAGGAGAAAC TGGGATGTGC TCAGGCGCCT CGAGTTCGAT
CTGAGGGAGA TCAAGGATGT GAGTGAGATA GATCTCAGCG GATACGATGT TGTCGTCGAC
GCCCTCTTCG GCACAGGTGT TCGCGGCCCG ATAAAAGGCC TTGAGGGGGA TATCATAGAT
CTCATTAATT CCTGCGGAAA GCATATCGTA TCGGTCGACG TGCCGAGCGG AATGGGCACC
GGAAAAGAGG TCTCCCCGGA TATCACGGTG ACATTCCACC GCCCGAAGAT CGGGATGAGG
GGCGATTTCA GGGTCGTGAG TATCGGCATA CCGAGAATGG CCGAGTTCCT CGTCGGCCCC
GGTGATCTGA AGCTTCTGGG GAGAAGGGGG CCTGAGAGCC ACAAGGGCGA CAGCGGAAGG
ATACTGGTGA TAGGCGGCGG GCCGTACACG GGCGCCCCAG CGCTCTCTGC GATGGCCGCC
CTCCGCGCGG GCGCGGACAT AGTCACAGTC GCGGCTCCGA AGAGCGCCGC TGATACGATC
TCCTCGTTCT CGCCGAACAT GATCGTCAGG CCTCTGACAT CAGACCGATT GTGCATGGCT
GACATTGATA TCCTGAAGGG CCTGATACCG AGACATGATG TGGTCGTCAT CGGCATGGGT
CTGGGAAGGG ACGAGGAGAC GCTGAAAGCG GTATCGCAGA TACTCCCGCT TTGCGATAGG
GTCGTGATAG ATGCCGACGC GCTACAGCCG GATATGCCGC TGAAGGGGAT AGTGACGCCG
CATGCCGGAG AGTTCAGGCG CATAAGCGGG CTGGATCTCC CGAAGGGGAA GGAGCGGATT
GAGATTGTGA AGAGGTTCGC CCGGGAGATG GGGCTTGTCG TTCTTCTCAA GGGAAGGATG
GATATAATCA CGGATGGGGA GATCGTGAGG GGGAACACCA CAGGAAACCC GGGTATGACT
GTCGGAGGGA CGGGGGATGT TCTTGCAGGC ATAACAGGGG CGTTTTATGC GCGTGCGGAT
GCCTTGAGGG CGGCTGCAGC TGCTGCATTC GTCAACGGCA GGGCCGGGGA TCTCGTCTAC
CGCGAGAGGG ATTTTGGGAT GGTGGCCACG GATCTGATCG ACAAGATTCC CGAGGCGATG
ATGGTCTGA
 
Protein sequence
MRYISAEEMG AIDANCAYLG ISTLQLMENA GAALANEIRA IGGRRIAIIA GRGNNGGDAF 
VAARHLDDLD VTVFLIGRAR DISTEEARRN WDVLRRLEFD LREIKDVSEI DLSGYDVVVD
ALFGTGVRGP IKGLEGDIID LINSCGKHIV SVDVPSGMGT GKEVSPDITV TFHRPKIGMR
GDFRVVSIGI PRMAEFLVGP GDLKLLGRRG PESHKGDSGR ILVIGGGPYT GAPALSAMAA
LRAGADIVTV AAPKSAADTI SSFSPNMIVR PLTSDRLCMA DIDILKGLIP RHDVVVIGMG
LGRDEETLKA VSQILPLCDR VVIDADALQP DMPLKGIVTP HAGEFRRISG LDLPKGKERI
EIVKRFAREM GLVVLLKGRM DIITDGEIVR GNTTGNPGMT VGGTGDVLAG ITGAFYARAD
ALRAAAAAAF VNGRAGDLVY RERDFGMVAT DLIDKIPEAM MV