Gene Mthe_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0221 
SymbolcofG 
ID4461994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp219187 
End bp220167 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content57% 
IMG OID639699228 
ProductFO synthase subunit 1 
Protein accessionYP_842659 
Protein GI116753541 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR03550] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofG subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.906705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGCGCGG GGATACAAGC TGAAGCTCGT GGAGCTGAGG CGCTGAGGGC AACATTCTCG 
AGGAATGTAT TCATACCGGT CACCGACCTC TGCAGAAATG CCTGCGGGTA CTGCTCTTTC
AGGCGTGATC CGGACCGCGC CAGGGTGATA TCAAGAAGTG AGGCTCAGAG GTTGATGGAG
CGCGCGCAGA GCGCAGGCTG CTCAGAGGCA CTCTTCTCGA TGGGAGACAG GCCCTGGGAG
GTGCGTGGCG ACCGGTTGGA ACTCCTGGAG TATCTCGTTG AGCTCTGCGA ACTCGCCCTG
GAGATGGGGC TTCTTCCTCA TACCAACGCC GGCATCCTCA CGCGCGAGGA GCTGGAGCTT
CTCGCCCCAT ACAACGCATC AATGGGCCTG ATGCTTGAGA GCACAGCGCA TCTCAAAATC
CATGAGAGAA GCCCGGGAAA GAGACCGGAG GTGCGAATAA GAACAATATC AGACGCAGGC
GCACTCAGGA TACCGTTCAC CACTGGCATA CTGGTTGGAA TAGGCGAAAG CTCTGAGGAT
AGGATCAGAT CGCTTGAGGT TATCGCAGAA CTTCACAGAA GGTATGGCCA CATCCAGGAG
GTCATAATCC AGCCGCTGGA TCCCAAGCCC GGGACCGAGT CGGAGGGAAT GCATCCTCCT
GCCATGACTG ATATGGTGAA GCTCGTCTCA ATTGCCAGGA GGATCCTGCC GCTGGAGATA
TCCATTCAGG TGCCCCCAAA CCTGATGGAT CCAGTTCCGC TGTTGAGAGC CGGAGCGGAT
GATCTAGGTG GAATAAGCCC CGTGACACCG GACTGGATCA ATCCGGAGAG GAGATGGCCT
GAGATCGATG AGCTGAGGGG AGTTGTGCTG GTGGAGAGAT TGCCGGTTTA CCCCAGATAC
GTGAAGCTCG GCTGGTACGG CAGCAGAACA AAAGATCTCA TAAAACGGCT CGCGGATGAG
AGAGGGCTGA GAAGGACCTG A
 
Protein sequence
MCAGIQAEAR GAEALRATFS RNVFIPVTDL CRNACGYCSF RRDPDRARVI SRSEAQRLME 
RAQSAGCSEA LFSMGDRPWE VRGDRLELLE YLVELCELAL EMGLLPHTNA GILTREELEL
LAPYNASMGL MLESTAHLKI HERSPGKRPE VRIRTISDAG ALRIPFTTGI LVGIGESSED
RIRSLEVIAE LHRRYGHIQE VIIQPLDPKP GTESEGMHPP AMTDMVKLVS IARRILPLEI
SIQVPPNLMD PVPLLRAGAD DLGGISPVTP DWINPERRWP EIDELRGVVL VERLPVYPRY
VKLGWYGSRT KDLIKRLADE RGLRRT