Gene Mthe_0284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0284 
Symbol 
ID4463263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp280524 
End bp281600 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content55% 
IMG OID639699290 
Productradical SAM domain-containing protein 
Protein accessionYP_842721 
Protein GI116753603 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily
[TIGR03551] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCGC TGAGTGATTC GACCTGTGAA GACATGCTGA ACGATTCCGG GGATCGTGCG 
GAGCTGAATC GCGATCTCAT GCGGTTCCTC TGGCGAAACC CGATGAGGCT ATTCTCCCTC
TCCCGCAGGC TCCAGGAGAG CTGCTGCGGG AGGGAGGTGA CATACGTCAT AAACAGGAAC
ATCAACTTCA CCAATATCTG TGTAGGAAGA TGCAGGTTCT GCGCGTTCAG GAAGAGCGAT
GGTTACATCC TGACACATGA GGAGATCGTT CAGAGGGCGA AGGAGGCCGA GCAGCTGGGC
GCCACCGAGA TATGCCTCCA GGGCGGTCTG GCGCCAGGGC TCACTGTGGA AGACTACTGT
GAAATTCTTG AAGTACTCAA ATCAAACTTC CCCCGGATGC ATCTCCACGC CTACTCTCCA
ATGGAGGTGA TGCACATGTC GAGATGCTCC GGGGTAGATG TCCATGAGGC CCTTCGCTCC
TTGAGGGATT CGGGTCTCGA TTCGATGCCC GGGACGGCAG CAGAGCTGCT GGTCGATTCT
GTGAGAAAAG TGATATGTCC GGAGAAGCTC AGCACCTCAG AATGGTCTTT CATCATAAAA
GCTGCGCACG CGATGGGCAT ACCAACGACC TCGACGATGC TCTACGGTCA CATCGAGAGC
TTCGAGGACA GGCTGAGTCA TTTAGAGGTC ATCAGATCGA TACAGCTGGA GACCGGAGGG
TTCACGGAGT TTGTGCTTCT CCCATTCGTC CCCGGGAACA CAGAGCTCGG GAGGATCTCA
TCGCCTCCAG ACATCCTTGA GAACCTGAAG ATGCACGCGC TCGCAAGAGT TGCGCTTCAC
CCGTACATAA CAAACATCCA GGCGAGCTGG GTGAAGCTCG GAAGGGAGGT TGCAGGCGCA
GCTATAGAAT GGGGCGCAAA CGATCTCGGC GGGACGCTCA TGGAGGAGAA CATATCGAGA
TGCGCAGGCT CAAAAGAGGG ACAGTACATG TCTCCGGATG AGTTTCAGGA TCTCATAAAG
AAACACGGGA GAGTACCGAC GCAGAGGGAC ACGCTGTACA GAAGGATACT CCGATAG
 
Protein sequence
MKALSDSTCE DMLNDSGDRA ELNRDLMRFL WRNPMRLFSL SRRLQESCCG REVTYVINRN 
INFTNICVGR CRFCAFRKSD GYILTHEEIV QRAKEAEQLG ATEICLQGGL APGLTVEDYC
EILEVLKSNF PRMHLHAYSP MEVMHMSRCS GVDVHEALRS LRDSGLDSMP GTAAELLVDS
VRKVICPEKL STSEWSFIIK AAHAMGIPTT STMLYGHIES FEDRLSHLEV IRSIQLETGG
FTEFVLLPFV PGNTELGRIS SPPDILENLK MHALARVALH PYITNIQASW VKLGREVAGA
AIEWGANDLG GTLMEENISR CAGSKEGQYM SPDEFQDLIK KHGRVPTQRD TLYRRILR