Gene Mthe_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1046 
Symbol 
ID4463114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1130872 
End bp1132908 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content55% 
IMG OID639700064 
ProductDNA mismatch repair protein MutS domain-containing protein 
Protein accessionYP_843470 
Protein GI116754352 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0558977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGGCCCC GCCTGAGGGA GCGCTTACTC TCATATTTCG AGAGCGAGGA GATGGCCCTC 
AGGGCGATCG CAGATGAGGA TATGAATGAT CTGAGAGCGG CGATCGGCGA GCGCCATGCC
ATAGCCACTG TCAGAGCAGC CAGAGGCCTG AGGTACGGTG TCTCCCCCGA GTCATTCCTG
GCCACTGATG AGGCCGAGCG GATATACAGA ACGATCCTGA ACAGAATAGC GGAGCATGCG
AATACGGCGT ATGCCATCAT GCGGATTTCC ACCCTTTTCC CCTCAGGATC TCCAGAGCTC
ATAAAAGAGA TGCGCTCGGT CTCGCTGCGC GCCATGGATC TTCGCAGGAG GCTCGGAGAT
GTAAGAGATC TCCTGCGCAG GATAAAGCCG CTCAGGCGGA GGGGCGCTCA GAGAATAAAA
GGGAGGGCTA TCGCCGCTAG ATCCCCAGAG GAGCTCGCAA TGGCGAGGTC GATGGGCTTT
GACCGCCTCC TCGACATACA TCTGGCAGAG AGCCCAGGAG AGCTACGGGA TATCGCCAGC
GGCTACGATC ATGTGATCGT TCTCAGCGAC CCTGGAGTAC CCCTCCCAGG TGTGGAGATC
GCGGAGCAGC TTGACATCTG GTACATTGCC CCCGAGGCAG TGCTGAGCTT CTATACAGAG
AACAGAGACG CTCTGGAGGC CGCCATGGAG CTCGCAGCAG TGCTTGAGGA GAGAGGCATC
GAGCACTTTG AGGATCTGAG TAAGCTCAGG AATGCGCTCA GAAGGTTGTT CGATGATGAA
GATCAATCAG ACATATCACG CATTGATGAT CTTCTTAAAA GATTACCTGC GGCGGTTGAT
TCAGCGCTCG CCCAGGCGAA CACAGAGCTC CGCAAGCGCA TAGAGACGAG CTCTGTGACC
CTTGGAGGCC CGGATCTCCT CAGAGCGCTC GGGCGCGGGG ATATGATCAG GGATGTGTTT
GAGACGCAGA TGCATGGGAT CTTCAAATCT GTGATATCAG AGGCGAGGGC GAGGGTCGCT
GCTGATCTGA ACCTTAAAGG AGAGGCAATC TGGCTGGAGG AGATCATACC CGAGGAGATA
AAATACCCTC TGGAGATCAA CCACAGAGCT CTGCGCCAGC TTGAGATTGA GCTGAGGAGG
AGGAGGGAGG CAGAAGGGCT GAGGAGGAAG AGAGAGATCG CCGGATCCCT GGCGAACATG
GATGGTCTCA CAGCGAATCT CATAAAAAAA CTCATACAGC TCGATTTTCT TTACGCCATC
GGAGACTTCG CCATCTCATG CGGTCTCACA ATGCCTGAGC TCATCCACGA TCCTGGGATA
GGATTCCGCG ATGGCAGGCA TCTATTCATA CAGAACCCGG AGCCTGTCAG CTACAGTCTG
GGTGCATGTG GGATCCGGGA GTACACTGAA AAGGCAGCGA TACTGAGCGG TGTGAACTCC
GGCGGCAAGA CCTCGCTCCT TGAGCTGATG GCGCAGATAG CGATACTCGC ACACATGGGC
CTGCCTGTTC CTGCCAGTGA GTGCAGGATA TCGATATTCG ATGAGCTTTA CTTCTTTGCA
AAGAGCAGTG GCACTCTCAG TGCGGGGGCG TTCGAGAGCA CAATGAGAAA GCTCTCAGCT
CTCGCAACCG AGAAAAGAAA GCTCGTGCTC GCGGACGAGC TGGAAGCGAT AACAGAGCCA
GGGGCATCTG CCAGGATAAT CGCATCGATA CTTGATATGG TCCAGGAGAA CGGCTCTGTG
GCTCTTTTCG TGAGCCATCT CGCAGATGAG ATACGCAGAT TTTCAAGGAC AGCTGTCAGG
GTGGATGGAA TAGAAGCAGA GGGCCTGGAT GAGAGGAACA ACCTCATACT CAGCAGGAGT
CCGAGGTACA ACCATCTGGC CAGATCAACG CCAGAGCTCA TACTCGACAG GCTTGTCAGA
ACAACAGAAG GGAAGGAGAG GACGTTCTAC GAGGCGCTGC TGACGAGGTT CCGTAACACA
CAGTCGCGAT CCGAGAAGAT TACAGGAAAT TGTGCACGTC TCCAGATCGA CTCATGA
 
Protein sequence
MGPRLRERLL SYFESEEMAL RAIADEDMND LRAAIGERHA IATVRAARGL RYGVSPESFL 
ATDEAERIYR TILNRIAEHA NTAYAIMRIS TLFPSGSPEL IKEMRSVSLR AMDLRRRLGD
VRDLLRRIKP LRRRGAQRIK GRAIAARSPE ELAMARSMGF DRLLDIHLAE SPGELRDIAS
GYDHVIVLSD PGVPLPGVEI AEQLDIWYIA PEAVLSFYTE NRDALEAAME LAAVLEERGI
EHFEDLSKLR NALRRLFDDE DQSDISRIDD LLKRLPAAVD SALAQANTEL RKRIETSSVT
LGGPDLLRAL GRGDMIRDVF ETQMHGIFKS VISEARARVA ADLNLKGEAI WLEEIIPEEI
KYPLEINHRA LRQLEIELRR RREAEGLRRK REIAGSLANM DGLTANLIKK LIQLDFLYAI
GDFAISCGLT MPELIHDPGI GFRDGRHLFI QNPEPVSYSL GACGIREYTE KAAILSGVNS
GGKTSLLELM AQIAILAHMG LPVPASECRI SIFDELYFFA KSSGTLSAGA FESTMRKLSA
LATEKRKLVL ADELEAITEP GASARIIASI LDMVQENGSV ALFVSHLADE IRRFSRTAVR
VDGIEAEGLD ERNNLILSRS PRYNHLARST PELILDRLVR TTEGKERTFY EALLTRFRNT
QSRSEKITGN CARLQIDS