Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1485 |
Symbol | |
ID | 4269958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1696849 |
End bp | 1699485 |
Gene Length | 2637 bp |
Protein Length | 878 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638126243 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_742324 |
Protein GI | 114320641 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCAGT CCAGTCCCAA GGCATCGTCG CGGCATACGC CCATGATGCG CCAGTTCCTG CGCATCAAGG CCGAACACCC GGACATCCTG CTCTTCTACC GGATGGGCGA TTTCTACGAA CTATTCTACG AGGATGCCGA ACGGGCCGCG AAGCTGCTGG ACATCACCCT GACCACCCGC GGGCAGTCCG CCGGCGAACC CATCCCCATG GCCGGCGTGC CGGTGCACGC GGTGGAGAGC TATCTGGCGC GGCTGGTGCG CCAGGGCGAA TCGGTGGCCA TCTGCGAACA GATCGGCGAC CCGGACAACA GCAAGGGCCC GGTGGAGCGG CAGGTGGTGC GTATCGTCAC CCCCGGCACC CTGACTGACG AGGCCCTGCT CGAGGAACGT CAAAGCAACA TCCTGGCGGC CCTGAGCTGC CATCAATCGC GCTGGGGGCT GGCCAGCCTG GAGCTCTCCA GCGGCCGGTT CAGCCTCACC GAACCTGCCG ACGAGCAGGC CCTTGCCGCC GACCTGGAGC GGCTGAACCC CGCCGAGTTG CTGGTGGACG AGGCGCTCAC CCTGCCCACG GGCCTGGCCG TGGGGCCGGG GCTCACCCGC CGGCCACCCT GGCACTTCGA GCTCGATACC GCAACCGACC TACTCACAGA GCAGTTCGGC ACCCGCGATC TGGCCGGCTT CGGCGCCCAG GACCACAGCG CAGGGCTCGC CGCCGCCGGC GCCCTGCTCC AGTACGTCCG TGAGACCCAG CGCTCCGCCC TGCCCCATAT CCGCAGGCTG CAGGTGGAAC ACGGCGATCA GGCCATCGTC ATCGACGCCG CCAGCCGCCG GAACCTTGAG CTGGAACGCA ACCTCTCCGG GGGCACCGAA CACACCCTTG CCTCGGTGCT GGACAGTACC GTCAACGCAA TGGGCAGCCG CCTGCTGCGC CGCTGGCTCA ACCGCCCGCT GCGCGACCGC ACCACCCTGC AGGCCCGCCA CCAGGCCGTG GAGATCCTGA TGGCCGAGTC CCTGACCGAG GCGCTTCGCC GCCAGCTCCG GGGAATCAGC GACGTGGAGC GCATCCTTGC CCGGGTGGCA TTGGGCAGCG CCCGGCCCCG GGATCTCACT GGCCTGCGGG AGACGCTCGC CCGGCTCCCG GACATCCAGG CCACCCTCAC CGGTGCCGGC GCTCCTCGGC TGGTGGACCT GGCGGCGCAG TGCGGGGAAC ACCCGCAGAC CCTGGACCAC TTGCGCCGCG CCCTGGTGGA CCAGCCACCG GTGGTGATCC GCGACGGCGG GGTCATCGCC GAAGGCTATG ATGCGACCCT GGATGAGTTG CGGACGCTTT CAGAAAACGC CGATAACTAC TTGTTGGAAC TCGAACAACG CGAGCGGGAA CGCACGGGGA TCAGCACCCT GAAGGTGGGT TATAACCGGG TACACGGCTA CTACATCGAG GTCACCCGCG CCCAGGCTGA TGCGGTGCCC GCCGAGTATG TCCGACGCCA GACCCTCAAA GGGGTGGAGC GCTATATCCT CCCTGAACTC AAGGCCTTCG AGGACAAAGT GCTGTCGGCA CGTGAAAAGG CCCTGGCCCG CGAGAAGGTC CTCTATGAGC AATTGCTCGC CAGCCTGGCC AGCGACCTGG CGCCGCTTCA GGACACCGCC GCGGCACTGG CGGAGCTGGA CACCCTGGCC GCCTTCGCGG AACGCGCGCA GGCGCTGGAC TACTCTCGCC CGGAACTGCG CGATGGGGCG GGACTGCGCA TTGAGGCCGG CCGGCACCCG GTGGTGGAAT ACAGCCTCGA CGGCCCCTTC GTCCCCAACG ACCTGACACT GGACGACCGC AGGCGGATGC TGATCATCAC CGGGCCCAAC ATGGGGGGCA AGTCCACCTA TATGCGCCAG GTGGCGCTGA TCACCCTGAT GGCGCATATC GGCAGTTTTG TACCGGCGCG CGCCGCCAGC CTGGGCCCGG TGGATCGGAT CTTCACCCGG ATCGGCGCCT CCGACGACCT GGCCGGTGGC CGCTCCACCT TCATGGTGGA GATGACCGAG ACGGCGAACA TCCTGCACAA CGCCACCGCG CAAAGCCTGG TGTTAATGGA CGAAATCGGG CGAGGCACCA GCACCTTCGA CGGCCTGGCC CTGGCCTGGG CGACCGCCGA ACGGCTGGCC CGCGACCAGC GCGCCTACAC CCTTTTCGCC ACCCACTATT TCGAGATGAC AGCGCTGCCC GAGCAATGTC CCGGTGCCAG CAACGTCCAC CTGGATGCCG TGGAGCACGG CGAGCGCATC GTCTTTCTGC ATGCGGTCAA ACCCGGCCCG GCGAGCCAGA GCTACGGCCT CCAGGTGGCC GCGTTGGCGG GCGTCCCCGG GCCGGTGCTG GAGGCCGCCC GCGAGAAGCT GCGGGCGCTC GAAGAGGAGA GTTCCCGCCA GAGGGCCGAG CCGGATCAGC TCTCACTCTT CGCGGAACCG GCGCCGCCCC CACCCCTGCC CAGTGCCGCC GAGCAGGCGC TGTCGGAGGT GGACCCGGAC GAACTCTCAC CCCGCCAGGC CCTGGACCTG CTCTACCGTC TGAAGGCGTT GACCAGTGGC GAGGAAGGGG CGGACAAAAA GGCGCGGGGC GATGCAGTCG ATGCCCGGAG TCGCTAG
|
Protein sequence | MAQSSPKASS RHTPMMRQFL RIKAEHPDIL LFYRMGDFYE LFYEDAERAA KLLDITLTTR GQSAGEPIPM AGVPVHAVES YLARLVRQGE SVAICEQIGD PDNSKGPVER QVVRIVTPGT LTDEALLEER QSNILAALSC HQSRWGLASL ELSSGRFSLT EPADEQALAA DLERLNPAEL LVDEALTLPT GLAVGPGLTR RPPWHFELDT ATDLLTEQFG TRDLAGFGAQ DHSAGLAAAG ALLQYVRETQ RSALPHIRRL QVEHGDQAIV IDAASRRNLE LERNLSGGTE HTLASVLDST VNAMGSRLLR RWLNRPLRDR TTLQARHQAV EILMAESLTE ALRRQLRGIS DVERILARVA LGSARPRDLT GLRETLARLP DIQATLTGAG APRLVDLAAQ CGEHPQTLDH LRRALVDQPP VVIRDGGVIA EGYDATLDEL RTLSENADNY LLELEQRERE RTGISTLKVG YNRVHGYYIE VTRAQADAVP AEYVRRQTLK GVERYILPEL KAFEDKVLSA REKALAREKV LYEQLLASLA SDLAPLQDTA AALAELDTLA AFAERAQALD YSRPELRDGA GLRIEAGRHP VVEYSLDGPF VPNDLTLDDR RRMLIITGPN MGGKSTYMRQ VALITLMAHI GSFVPARAAS LGPVDRIFTR IGASDDLAGG RSTFMVEMTE TANILHNATA QSLVLMDEIG RGTSTFDGLA LAWATAERLA RDQRAYTLFA THYFEMTALP EQCPGASNVH LDAVEHGERI VFLHAVKPGP ASQSYGLQVA ALAGVPGPVL EAAREKLRAL EEESSRQRAE PDQLSLFAEP APPPPLPSAA EQALSEVDPD ELSPRQALDL LYRLKALTSG EEGADKKARG DAVDARSR
|
| |