Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1477 |
Symbol | |
ID | 4461931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 1588765 |
End bp | 1591338 |
Gene Length | 2574 bp |
Protein Length | 857 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639700496 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_843890 |
Protein GI | 116754772 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.480331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATGGTAA GGCTCTCGCC GCTGATGGAG CAGTATTCCC GCATAAAGCA GCAGTGCCCA GATGCCCTCC TCCTTTTCAG GGTGGGGGAT TTTTATGAGA CCTTCGGCGA GGACGCAATT ACCACTTCCA GAACGCTGAA CATAACACTC ACATCGAGGC AGAAGGACGA CGAGGGGAAC AGGATCCCGC TGGCGGGCAT ACCGTATCAC TCACTTGACG TTTACCTCTC CAGGCTTGTT AATGCAGGTC ATAAGGTAGC GATATGCGAG CAGGTCGAGG ATCCAAAGCA GGCAAAGGGG ATTGTGAGGA GAGAGATCAC AAGAATAGTT ACCCCTGGAA CCGTGATAGA GCCTGCGCTG CTCCAGGAGA GGAGCAACAA CTACCTTGCA GCCGTCCTTC TGGAGAACGG GCTGGCAGGG CTGTCTCTTC TCGATCTATC AACAGGCGAT TTCATAGCAT CAGAGGTTCC GGTGGAGCGG CTGGGATCGG AACTGGCCAG GTTCTCGCCG GCTGAGTGTC TGGCCCCTCC GGGTGTATCT CTGGAGGGCA TGAAATTTCA GAACCTCGAT GCCGAACTCT TCTCCTTGAA CGAGGCGTCT GAATCTGGTG GAATTCTTGG AGATCTCCAC AAGAGAAGCC CGCTTTGCGC CCGCGCTGCC TGCGCGATAC TCTCCTACCT GCGGGAGACC AGAATCCAAT CCGTGGATCA CATCAAGCCG ATCCGGATGA TATCCTCCTC TGAGTACATG CTTCTCGATG AGATCACGCT GAGAAACCTG GAGATATTCA GGAACCTCAG GGATGGGTCC AGAAGCGGCA CGCTCATGGA GATCCTGGAC GAGACCGTAA CGCCGATGGG ATCCAGGACC CTGGCTAGGT GGCTCCAGAT GCCGTCGATG TCTCTGGAAG TCATTCGCAG AAGACAGGAT GCGATTGAGG AGATGGTGAG AAGAGCTGTG ATCAGAGAGG AGATCTCGGA GCTTCTCGAT GGACTCAGCG ACCTGGAGAG GATCATCGGG CGTGTCTCGC TCGGAAACGC GGGTCCGAAG GATCTCGTGG CGCTGCGTTC ATCGCTCCGC AGGATCCCTG AGATCGCACA TGCGATGAGC GCTCTTGAGT CTGAGTATCT TGTTGAAATC AGAAAGAGGC TGGATGCAAG CGAGCTTGAT GATCTCGTGG TGCTGCTCGA AAGGGCACTC TCAGATGACC CACCAACATC GCCCAGGGAT GGTGGCGTGA TACGGGATGG CTACAGCCCT GAGCTGGATG AGATCCGTTC AGCTCTCAGA AACGGGAGGA GCTGGATCGC GGAGCTGGAA TCATCTGAGA GAAAGAGGAC GGGGATAAAG TCGCTGAAGG TCGGATACAA CAACGTCTTC GGTTATTACA TAGAGGTCAC AAAGCCGAAC CTCTCGATGG TGCCAGACGA TTATATAAGA AAGCAGACCC TCTCAAATGC GGAGCGCTTC GTGACCCGGG AGCTGAAGGA GGTGGAGAGC AGAGTGCTCT CAGCTCAGGA GAGATCGTCA GCGCTCGAAT ACGAGGTCTT CCTCGATCTG AGGGGACAGG TCGCATCCAG GACTAGATCT GTGCAGGAGG TTGCAGCCGC GATAGGCGAG CTGGACACGA TTCTCGGATT GACGAGAGCT GCCCTCATGG GAGCGATGGT GAGGCCTGTT GTAGATGCTG GCAGGGAGGT GATACTCCGC GACTCTAGGC ATCCCGTTCT TGACAGAGTC ATGAAGGGCG GCTTCGTCCC GAATGACCTC ACGATGGATG AGAGCAGCTG GTTCATGATA CTGACCGGCC CCAACATGGC CGGGAAGTCG ACGTTCATGC GCCAGGTCGC TCTGATAGCG ATAATGGCGC AGATCGGCTC GTTTGTTCCC GCCTCTTATG CGAAGATAGG GCTCATCGAC AGGATCTTCA CAAGGGTCGG CGCCAGGGAC GATCTCGTCT CGGGGAGATC CACATTTATG GTGGAGATGA GCGAGCTCGC GAACATACTG GTATCAGCCA CAAAAGACAG CCTGATACTG CTCGATGAGA TAGGGAGGGG GACGAGCACG TTTGATGGGC TGAGCATAGC ATGGGCAGTC TCTGAGTACA TACATTCCAG GATTAAGGCG AAGACGATCT TCGCGACGCA TTACCACCAG CTTACGCAGC TCAATCTCCC TGGAATTGTG AACTACAGCA TGGCTGTGAA GGAGGAGGGG AGATCGATAA CATTTCTCAG AACTGTTGTG CCAGGTGCGA CAAACAAGAG CTATGGAATA CATGTCGCCA GGCTTGCTGG TGTCCCGGAG CATGTGATTC GACGGGCGGA GGAGCTTCTC GACATTATAG AGGAACAGGC AGCCATAGAG ATAAGGAAGT GCAGATCAAA GGAGAGACCG AAGAGGTACA CTCAGCTCAT ATTCTTCAAC CAGCCCGAGA GCATAGATAA CGACATACTC GAGGAGATAA AGAATCTTGA ACCTGAGAAG ATAACACCGC TGCAGGCTCT GAACCTGCTT GTGGAATACA GAAGAAGGCT CGGCTGTAAA GATGCCAAGG ATACACATAC TTGA
|
Protein sequence | MMVRLSPLME QYSRIKQQCP DALLLFRVGD FYETFGEDAI TTSRTLNITL TSRQKDDEGN RIPLAGIPYH SLDVYLSRLV NAGHKVAICE QVEDPKQAKG IVRREITRIV TPGTVIEPAL LQERSNNYLA AVLLENGLAG LSLLDLSTGD FIASEVPVER LGSELARFSP AECLAPPGVS LEGMKFQNLD AELFSLNEAS ESGGILGDLH KRSPLCARAA CAILSYLRET RIQSVDHIKP IRMISSSEYM LLDEITLRNL EIFRNLRDGS RSGTLMEILD ETVTPMGSRT LARWLQMPSM SLEVIRRRQD AIEEMVRRAV IREEISELLD GLSDLERIIG RVSLGNAGPK DLVALRSSLR RIPEIAHAMS ALESEYLVEI RKRLDASELD DLVVLLERAL SDDPPTSPRD GGVIRDGYSP ELDEIRSALR NGRSWIAELE SSERKRTGIK SLKVGYNNVF GYYIEVTKPN LSMVPDDYIR KQTLSNAERF VTRELKEVES RVLSAQERSS ALEYEVFLDL RGQVASRTRS VQEVAAAIGE LDTILGLTRA ALMGAMVRPV VDAGREVILR DSRHPVLDRV MKGGFVPNDL TMDESSWFMI LTGPNMAGKS TFMRQVALIA IMAQIGSFVP ASYAKIGLID RIFTRVGARD DLVSGRSTFM VEMSELANIL VSATKDSLIL LDEIGRGTST FDGLSIAWAV SEYIHSRIKA KTIFATHYHQ LTQLNLPGIV NYSMAVKEEG RSITFLRTVV PGATNKSYGI HVARLAGVPE HVIRRAEELL DIIEEQAAIE IRKCRSKERP KRYTQLIFFN QPESIDNDIL EEIKNLEPEK ITPLQALNLL VEYRRRLGCK DAKDTHT
|
| |