Gene Mthe_0467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0467 
Symbol 
ID4462624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp483198 
End bp484349 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content54% 
IMG OID639699469 
Productradical SAM domain-containing protein 
Protein accessionYP_842898 
Protein GI116753780 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.103489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAGGA AAAGGCCTTT TCACGTTATG ATCATTCCCA CTCTGGGATG TCCTTCCAAG 
TGCAGCTACT GCTGGAGCTC TGAGGAGGGC TCTCCGGTGA TGAGCATTGA TACTGTAAAG
GAGATCGTCG AGTGGCTGAA GCTCTTTCGT GATGATCCCG TGACGTTCAC CTTCCACGGC
GGTGAGCCGC TGCTGGCGGG TGTGGAGTTT TACAGGAAGG CACTGCCGTT GCTTGCAGAT
GGCCTTTCAC ATCTCACTCC ATCATTTGCA CTGCAGACGA ACCTCTGGAG GCTAACGCCT
GAGCTGGCTG AAGTCCTCAA GGAGTATGAT GTGCCGATCG GCTCAAGCCT GGACGGCCCG
AAGGAGATAA ACGATCTTCA GAGGTCAGAG GGTTACTACG ATCGAACCAT GCGCGGCTAC
GGCATTGCCC GCGATCACGG CCTGAGTGTG CAATTCATAT GCACATTCAC CTCGCACTCC
ATAAAGTACA AACAGGAGAT CTTCGATTTC TTCATGAGCA ACGGATTGAC CCTGAAGCTC
CATCCAGCGC TTCCATCGCT TCGCAGCGAC GAGCCGGAGC GGTGGGCCCT CGATCCATCT
GAGTACGGGG AGCTTCTAGT TTATCTCCTC GACAGATACC TCGAGAACAT GGACAGGATC
GAGGTGAGGA ACATCAACGA TCTCTGCAGA TGCGTCTTCA GCGGTCGGGG AACTGTGTGC
ACATTCGTGG ACTGCATGGA TAACACGTTC GCTGTGGGCC CGGATGGGAG CATATATCCG
TGCTACAGGT TTGTCGGGAT GCCCGATTAT GTCATGGGTG ATGTGAGAGA TCATCCATCA
ATGGACGATC TGAAGCGATC TGAAGCATGG AGGCGGATGA ACCGCTTCAG GGAGTGCGTG
GAGGTGCACT GCAGGAAATG CAGGCACCTC AGATACTGCA GGGGCGGGTG TCCTTACAAT
GCGATATCCC ACACAGATGG GGAGATAAGA GGCGTGGATC CCTACTGCAT CGCTTACAAA
AGAATCTTCG ACGAGATCAC AGAGAGGTTC AACAGAGAGA TGCTCAGCTC CTTCGGATTG
CAGAGCAGCA AGCCTGGAAT AATCGCGCTC ATCCGCAAGA TCGCATCCAA GGAGGAGCCA
AAAGGGCTGT GA
 
Protein sequence
MQRKRPFHVM IIPTLGCPSK CSYCWSSEEG SPVMSIDTVK EIVEWLKLFR DDPVTFTFHG 
GEPLLAGVEF YRKALPLLAD GLSHLTPSFA LQTNLWRLTP ELAEVLKEYD VPIGSSLDGP
KEINDLQRSE GYYDRTMRGY GIARDHGLSV QFICTFTSHS IKYKQEIFDF FMSNGLTLKL
HPALPSLRSD EPERWALDPS EYGELLVYLL DRYLENMDRI EVRNINDLCR CVFSGRGTVC
TFVDCMDNTF AVGPDGSIYP CYRFVGMPDY VMGDVRDHPS MDDLKRSEAW RRMNRFRECV
EVHCRKCRHL RYCRGGCPYN AISHTDGEIR GVDPYCIAYK RIFDEITERF NREMLSSFGL
QSSKPGIIAL IRKIASKEEP KGL