Gene Mthe_1478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1478 
Symbol 
ID4461932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1591316 
End bp1592989 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content55% 
IMG OID639700497 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_843891 
Protein GI116754773 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.064699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAGGA TACACATACT TGATGAAGAG ACTGTGAGCA GGATAGCTGC GGGGGAGGTG 
ATAGAGCGGC CGGCATCTGT CGTCAAGGAG CTGATCGAGA ACTCGATAGA CGCCGGAGCA
TCAAGAATAA TCATCGAGGT CGAGAACGGT GGCATCTCTC TCATAAAGCT CGTGGATGAT
GGATGCGGTA TCGAGCGCGA GGATCTCCCA CTTGCATTCC AGAGGCACGC CACAAGCAAG
ATCTCTACAG CAGATGACCT GTTCAGGCTT AAAACCCTGG GGTTCCGCGG CGAGGCTCTC
TCTGCGATCG CCAGTGTATC AAAATGCGTG GAGGTGCACA CTAGGACTAG ATACTCTCCA
GTGGGCACGT ATCTTCGACT GGAGAACGGC AGGGTTGCTG AGATAAAAGA TGATGGATGC
CCTTACGGAA CAAGCATAGA GGTGAGGGGG CTTTTTGAGA CGATCCCTGC AAGGCTCAAG
CATCTCTCCT CTCCATCACA GGAGCTTGCG AGGATCGCGG AGATTGTGAC ACAGATGGCC
ATAATCCACC ACAGGATATC GTTCGAGCTC TCCTCCGGCA GAAGGACTCT TTTCAGATCT
AACGCCTCTG AGACGTGGGA CGATGCACTG ATCAGGGCAT TTGGCCTCAG AACTGCGAAA
GGCATGATTT CCATAATAGC GGAAGGGGAT GGTTTTGACC TTCATGGCAT GATCTCCTCC
CATGATTCAT CACACCATGG CTCTGAGTTA ATACTAGTTT ACGTAAACAG CCGGCCTGTC
TACTCCAAAG TTGTCGTTCA GGCTCTGAGG GAAGCGTACC GTGGCTTCCT GCAGTCTGGT
AGGAGCCCTC TTGCTGTAAT CTCGATCGAG ATCGAGCCGT CACTGGTGGA TGTGAATGTC
CATCCAGCGA AGAGAGAGGT CAGGTTTCTT CGGGAGGATG AGGTATACGA TGCTGTGAGG
GATGCAGCGC TCAGCGCGCT GAGGTCCTCA GCAATACCAT CTCCACCACC GCCCGCGAGG
ATCGCAGAGC CGCAGCTCTG GAGCGCGAAG CCCCAGATCC AGAGGACGCT CCCTCTTGAG
GTGCAGGCTG AGCAAAGAAT CTCAGAGCCG CTCATCAGAA TCGTGGGCCA GGCTCTCGAT
CTCTATATCA TTGTGGAGGA CGATGATGGT ATAATGCTCG TGGATCAGCA TGCTGCAGCA
GAGCGGATCC GCTACGAGCA TCTCCTCGAG AAATGCAAGA GCGGAAGCAT CTCTCAGGAG
CTGATCCAGC CGGTCACCGT GGAGCTATCC CCTGGGGAGG TGGCACTGCT CGATTCGTTC
TCAGGAGAGC TGGGGGAGAT CGGCTTCGAG ATCGATCCCT TCGGAGGGAG GGCATACAGC
GTGAGGTCTG TGCCTGCAGC TGCTGGCCTC GAGAGCCCCG AGTCGATTCG CGATGTTCTC
AGAGAGATCC TGAATCTCGG CAGGGTGTGC AGAGCATCCT TCAGAGATGA GGCTCTGAAG
CATCTAGCAT GCAGGGGCTC GATAAAATCG GGGGAGAGGC TGAGCGAGAG CGCGATGCTC
AGGCTTCTCA CAGACCTCTT CGCATGTGAT AACCCCAGGA CATGCCCGCA CGGAAGGCCT
GTCGTTGTTC GGATATCCTC TGAGAGCCTG GAGAAGATGT TTGGGAGGAG ATGA
 
Protein sequence
MPRIHILDEE TVSRIAAGEV IERPASVVKE LIENSIDAGA SRIIIEVENG GISLIKLVDD 
GCGIEREDLP LAFQRHATSK ISTADDLFRL KTLGFRGEAL SAIASVSKCV EVHTRTRYSP
VGTYLRLENG RVAEIKDDGC PYGTSIEVRG LFETIPARLK HLSSPSQELA RIAEIVTQMA
IIHHRISFEL SSGRRTLFRS NASETWDDAL IRAFGLRTAK GMISIIAEGD GFDLHGMISS
HDSSHHGSEL ILVYVNSRPV YSKVVVQALR EAYRGFLQSG RSPLAVISIE IEPSLVDVNV
HPAKREVRFL REDEVYDAVR DAALSALRSS AIPSPPPPAR IAEPQLWSAK PQIQRTLPLE
VQAEQRISEP LIRIVGQALD LYIIVEDDDG IMLVDQHAAA ERIRYEHLLE KCKSGSISQE
LIQPVTVELS PGEVALLDSF SGELGEIGFE IDPFGGRAYS VRSVPAAAGL ESPESIRDVL
REILNLGRVC RASFRDEALK HLACRGSIKS GERLSESAML RLLTDLFACD NPRTCPHGRP
VVVRISSESL EKMFGRR