Gene Mthe_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0644 
Symbol 
ID4462284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp678262 
End bp679476 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content55% 
IMG OID639699652 
Producthypothetical protein 
Protein accessionYP_843074 
Protein GI116753956 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00156729 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCGA TGCTCTGTCT TATCAGCGAG CAGCATGTTC CGAACCTGCT GGGCGTTCAC 
GAGCTGCGGC CGGATCTCCT CGTGCTGCTT GAGACCGAGG GGATGAAAAG GAGGGAGGCT
GCAAACAGAT TCCTGAAAGC CCTTGCGATC GGAGGTCAGG ATTACCTAAC AAGAAATGAG
ATCGTGCCGC TGGAGGATGG TGACTCAATA GAGGAGACTG AGAGGGCGCT GAAAGGGGTC
TATGAGAGAT ACAGAGATGC GGAGTGGATC GTGAACATCA CAGGCGGCAC GAAGCCGATG
AGCATAGGAG CATACGGGTT TTTCAGGCAA AAGAAGAATG CCAGGATAAT CTATGTCTCC
GCGTCTGACC AGTCGAGGGC GCTGGACTTC TCGGGTGGAG CGGACATACC TCTGAGCCAC
AGGATATCTG TGGCTGAGTT CCTCGCAGGC TATGGGTTTG ATGTGCTCCA TTACGACAAG
GTCCAGGAGA ACGAGGAGCG GAGCAGGAGG TGGCTTGGTC TTGCAGCAGA GATCGCGGCG
AGGAGCCAGA ATGGCGCCAT TCTCGGGCTT CTCGCGAATT TATCGAGGAT ATCGAAAGAG
CGGAGGGGCA GGTACAGGGG ACTCAAGATC TCAGAATCAG ATGGTCTATT TCTGAACGAT
GGTCATCTGC GTGAGATGAT CGCTTCGAGC TTTGGTCTGG CATGTGATGG TGGGCACTTC
ACAGGCGCCC TGGATAAATA CGCTGTCAGG TTCCTCACAG GCGGCTGGCT TGAGGTCTTC
ACATGGGGGT TGCTGAGGGG GCTTGATCGT GTCTGGGATG TGCATCTCGG TTTGCAGATT
GGAATGAAGA ACGAGAAGCT CCAGAACGAT CTGGATGTTG TGTTCATGAC AGATCAGTCC
CTCAGGATCG TGGAGTGCAA GAGCGGCGGG CAGGAGCACG ACAGGGAGGG GAGTGATACG
CTGTACAAGA TTGAGGCGAT ACGGAAGCAG CTCGGAGCAC TTCGTGTTCG ATCCTATCTT
GTCACGACCT CTGATAACGT GATCGATTCC GAGACCGGTA ATATCAAGGA GCATCTGGAG
GACAGATCGA GGCTCTATGA GTGCAACATT GTGAAGCCTG AAGATGTTCG CAGTCTTGCG
CAGATGTACC TCGCAGGTGA CGTGCGGCTG AACGCGAGGG TTGCGCAGGT CTTCAACATA
CGGCAGGCGG TTTGA
 
Protein sequence
MKAMLCLISE QHVPNLLGVH ELRPDLLVLL ETEGMKRREA ANRFLKALAI GGQDYLTRNE 
IVPLEDGDSI EETERALKGV YERYRDAEWI VNITGGTKPM SIGAYGFFRQ KKNARIIYVS
ASDQSRALDF SGGADIPLSH RISVAEFLAG YGFDVLHYDK VQENEERSRR WLGLAAEIAA
RSQNGAILGL LANLSRISKE RRGRYRGLKI SESDGLFLND GHLREMIASS FGLACDGGHF
TGALDKYAVR FLTGGWLEVF TWGLLRGLDR VWDVHLGLQI GMKNEKLQND LDVVFMTDQS
LRIVECKSGG QEHDREGSDT LYKIEAIRKQ LGALRVRSYL VTTSDNVIDS ETGNIKEHLE
DRSRLYECNI VKPEDVRSLA QMYLAGDVRL NARVAQVFNI RQAV