Gene Mthe_0251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0251 
Symbol 
ID4462074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp247127 
End bp248245 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content54% 
IMG OID639699257 
Productradical SAM domain-containing protein 
Protein accessionYP_842688 
Protein GI116753570 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.996674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACGGT CGATCCTGGC ACGTGGTGCG AGCTTCGATC TCGAGGACGT TGATATTAAG 
GTTCAGGCGC TCAGCAGAAG CACCAGATAC GACAGGTGCT GCCACAGACG GAATGATGAC
TCGCTGATCT ACAGTGCCTC AGGACGCAAT GGCTGCCATG TGCGCCTCTT CAAGACTCTC
TTCACCAACG AGTGCTATCA TCAATGTGGC TACTGCCCCA ATGCAGGGAC ATCAAATGGA
TGCTCCTACA CTCCAGAGGA GCTTTCCAAT ATAGTGAGCA GTCTGAGAAG AGAAGGGCTC
ATTGACGGCC TCTTCCTGAG CTCCGGCGCG GGCAGGGATG AGGATTCCAC GATGGAGGAG
ATGCTCGAGA CTGTGCGGAT TCTCAGGGAG AGGCACGGGT TCTCAGGATA CATCCACCTC
AAGATTCTCC CTGGGACATC CAGGCACCTG ATAGAAGAGG CTGTGGAGCT CGCTGACAGG
GTGAGCATCA ACATCGAGGC ACCATCGATG GATGTGATGC ATGAGCTCAG CCCGACGAAG
GATTACGAAA GGGACATACT GGACAGGCAG ATGTACATAC GTGACATCCT GGCGAGACGC
TCCAGAGGCT CCCAGACGAC ACAGCTTGTG GTCGGCGCAG CAGGTGAGAC AGACCTCGAA
ATATTCCAGA GGGTTGTAAA GGAGTACAGG GAGATTGGGG TGAGCAGGGT CTATTACAGC
GCATTTGTCC CTATCAAAGG GACGATCTTT GAAGGAAAAC AACCGCAGCT GAGATGGCGT
GAGAGCAGGC TATACCAGCT CGACTGGCTT TATAGGGTCT ACAGACTCTC CCCCGAGCAG
ATCAAAAATG TCTTCGACGA TTATGGATTT CTCATCAATC AGGATCCAAA GGTCATTCTG
GCCGGAGAAT CGCTCGACCT GCCACTCGAT GTGAATGAGG CCGATTTTCA GAGCCTGATA
CGAGTGCCTG GCATAGGGCC GGAGAGCGCA CGCAGGATCA TCTCATACAG GAGGAGGGAG
AGGATAGAGA GTCCATCAGA TCTCATCAGG CTCGGCATCA AGAGGAAGGC GATACCGTAC
CTGAAGATAA ACGGATGGGT GCAGAAGAGG CTCTTATGA
 
Protein sequence
MERSILARGA SFDLEDVDIK VQALSRSTRY DRCCHRRNDD SLIYSASGRN GCHVRLFKTL 
FTNECYHQCG YCPNAGTSNG CSYTPEELSN IVSSLRREGL IDGLFLSSGA GRDEDSTMEE
MLETVRILRE RHGFSGYIHL KILPGTSRHL IEEAVELADR VSINIEAPSM DVMHELSPTK
DYERDILDRQ MYIRDILARR SRGSQTTQLV VGAAGETDLE IFQRVVKEYR EIGVSRVYYS
AFVPIKGTIF EGKQPQLRWR ESRLYQLDWL YRVYRLSPEQ IKNVFDDYGF LINQDPKVIL
AGESLDLPLD VNEADFQSLI RVPGIGPESA RRIISYRRRE RIESPSDLIR LGIKRKAIPY
LKINGWVQKR LL