Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1113 |
Symbol | |
ID | 3833245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1139729 |
End bp | 1142320 |
Gene Length | 2592 bp |
Protein Length | 863 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637829041 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_429970 |
Protein GI | 83589961 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000302948 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAGGG AACAGGCCTT AACCCCCATG ATGGCGCAAT ACCGCCAGAT CAAGTCCCAG TACCCGGATT GCATCCTCTT CTTCCGCCTG GGAGATTTTT ATGAGATGTT TTATGAAGAC GCCGAGGTGG CAGCCCGGGA ATTGGACCTG GTTTTGACGA CCCGGGGCGG TAAAGAGGCA GCCCCCATGT GCGGGGTACC CTTTCACGCC GCCGACAGTT ACCTGGCCCG CCTGGTAGGT AAGGGCTACA AGGTGGCTAT TTGCGAACAG ATGGAGGACC CCCGCCAGGC CAAGGGCCTG GTACGCCGGG AGGTTATCCG GGTGGTTACC CCGGGTACGA TAACCGATGA GAAGGCCCTG ACCCCCGGCG GAAATAACTA CCTGGCGGCG ATAGTGCGCT ATAACGGCTG CTGGGGGTTG GCCTGGGCCG ATGCCTCCAC GGGGGAATTC TTGTTTACCA CCTGCCCGGA CCAGGAAACC CTGGTGGACG AGCTGGTACG CCTCATGCCG TCGGAATATT TGCTGCCCGG GGAACTGGCC CGGGATACGG CCCTAAAAAG GCTCCTGCAG ATTTACACCC GGGGCGTGAT TACCGGCTGG CAAGTAGCCA GTAATCCGGA GGCAGCCCGG CAGTCCCTGG AGGACCATTT TGGCCACGAG GCCCTGGCCG GGGTCGAATT ACCGGCAGCT GCGGGCCTGG CAGCGGCCAT GATTCTTTCT TTCCTGGTTG CTACCCAGCA CAACTCCCTG GCCCACCTGG AGGCCCCGGC TGCCGCAGCC ACAACCAGCC GGATGTTCCT GGATCAGGCT ACCAGGCGCA ACCTGGAGCT GGTGACTGCC GGCCGGGAGC AAAAACGGGA GGGTTCCCTC CTCTGGGTCC TGGATAAGAC CCTCACAGCC ATGGGTGCCC GGACCCTGAG GCGCTGGCTG GACCAGCCCT TGGTAGATGC CGGGGCCATC AAGGAACGCC AGGAAGCCGT CGCCGAACTG GTTGAGGGCT TCATCCTGCG CCAGGAATTG CGGGAGAGGC TGCAGGTGGT GCGGGACCTG GAACGCCTGG CCGGCAGGGT GGCCTATGGG ACGGCAGGCG GGCGGGAACT CCAGGCCCTG CGGGGTTCCC TGGCCGTAAT CCCCGGTATC CTGGAGCTGA TGAGCGAAGT ACATTCCAGG CTCCTGGCGC AGGTGAGGGC CCAACTGGAC CCCCTTGACG ACCTGGTCGA TCTCCTGGGC CGGGCCCTGG TGGACGATCC CCCGGCCAGC ATCACCGAGG GCGGCATTAT CCGTACCGGC TATAATGGAG AGGTCGATAA GCTGCGCGAG GCGGCCACCC ACGGCCGGGA TTGGATTGCC AGCCTGGAAG CCGAGGAGCG GGAACGCACA GGGATTAAAT CCCTCAAGGT CGGTTACAAC CGTGTCTTCG GTTACTACAT AGAGGTCACC CGTCCCAACC TCCCCCAGGT GCCGGCCGAC TACGAGCGCA AACAGACCCT GGCCAATGCC GAGCGTTTTG TTACCCGGCG CCTGCAGGAA CTGGAACGTC AGGTCCTGGG GGCAGAGGAG AGACTGGTCC AACTGGAGTA TGAACTCTTT CAGGGCCTGC GGGAAGAGGT GCTGGGCGTC CTGCCCCGTA TCCAGGCCAC GGCCCGGGCC CTGGGAGTAC TGGACGCCCT TATTTCCCTG GCTACGGTGG CCGTAGATAA CAACTATACC TGTCCACGGG TCGATGACGG TACGGTGATT GAAATCGAGC AGGGGCGCCA CCCGGTGGTG GAACTGGTCG GTTCTCCGGG GACCTTTGTC CCCAACGACA CCTACCTGGA CCAGGAACAG TATATCCAGA TCATTACCGG GCCCAATATG GCCGGTAAAT CCACCTATAT CCGCCAGGTA GCTCTGATTG TACTGCTGGC CCAGATTGGC AGCTTTGTAC CGGCCCGGCG AGCCCATATC GGCCTGGTGG ACAGGATTTT TACCCGGGTC GGTGCTGCCG ATGATATCTT TGCCGGCCAG AGCACCTTTA TGGTTGAGAT GCAGGAAGTG GCCGGTATCT TGAAACACGC CACCAGGCGG AGCCTGGTTA TCCTTGATGA AGTGGGCCGG GGTACAAGCA CTGCTGACGG TCTGAGTATT GCCCGGGCAG TAACGGAGTA TATTCACAAT GTCATTGGTG CCCGCTGCCT CTTTGCCACC CACTACCACG AATTGGTTAG CCTGGCGGAG GAATTGTCCG GGGTCCGCAA TTACTGCGTT GCCGTCCTGG AAGAGGGAGA GGACATTACC TTCCTCCGGA CGATAGTGCC AGGTAGCACC GACAAGAGCT ACGGAATCCA TGTGGCCCGC CTGGCGGGCC TGCCGGAGCA GGTCCTGGAA CGGGCGCGAG AAATTCTGGA ACAGCAACCC AGGGCCCAGG TGCGGGTTTT AACGAAACCG GTAAAAAAGC CGACCCTTAC TCCCGGCGAG GTACAGGTAC TGGAGGAACT GGCCGGTTAT AACCTGATGG CGGCCACGCC CCTGGAAGCC ATGGAGCAGA TCTTCCGCTG GCAGAAGATG CTTCACAAAG AGCTGGATAT TGTTAAAAGG AGCAGGGGAT GA
|
Protein sequence | MAREQALTPM MAQYRQIKSQ YPDCILFFRL GDFYEMFYED AEVAARELDL VLTTRGGKEA APMCGVPFHA ADSYLARLVG KGYKVAICEQ MEDPRQAKGL VRREVIRVVT PGTITDEKAL TPGGNNYLAA IVRYNGCWGL AWADASTGEF LFTTCPDQET LVDELVRLMP SEYLLPGELA RDTALKRLLQ IYTRGVITGW QVASNPEAAR QSLEDHFGHE ALAGVELPAA AGLAAAMILS FLVATQHNSL AHLEAPAAAA TTSRMFLDQA TRRNLELVTA GREQKREGSL LWVLDKTLTA MGARTLRRWL DQPLVDAGAI KERQEAVAEL VEGFILRQEL RERLQVVRDL ERLAGRVAYG TAGGRELQAL RGSLAVIPGI LELMSEVHSR LLAQVRAQLD PLDDLVDLLG RALVDDPPAS ITEGGIIRTG YNGEVDKLRE AATHGRDWIA SLEAEERERT GIKSLKVGYN RVFGYYIEVT RPNLPQVPAD YERKQTLANA ERFVTRRLQE LERQVLGAEE RLVQLEYELF QGLREEVLGV LPRIQATARA LGVLDALISL ATVAVDNNYT CPRVDDGTVI EIEQGRHPVV ELVGSPGTFV PNDTYLDQEQ YIQIITGPNM AGKSTYIRQV ALIVLLAQIG SFVPARRAHI GLVDRIFTRV GAADDIFAGQ STFMVEMQEV AGILKHATRR SLVILDEVGR GTSTADGLSI ARAVTEYIHN VIGARCLFAT HYHELVSLAE ELSGVRNYCV AVLEEGEDIT FLRTIVPGST DKSYGIHVAR LAGLPEQVLE RAREILEQQP RAQVRVLTKP VKKPTLTPGE VQVLEELAGY NLMAATPLEA MEQIFRWQKM LHKELDIVKR SRG
|
| |