Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_4649 |
Symbol | |
ID | 7115217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 4930283 |
End bp | 4933033 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643527347 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002423351 |
Protein GI | 218532535 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.427089 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGAC CCCAGCGTTC CCAACCCCTC GAACGGCGTG CCCAGGCGGC GGATACGGTG ACTTCCTCGC CCCTCGACAT CCCCGGCGCC ACGCCGATGA TGGCGCAGTA CATCGAGATC AAGGCCGCCA ATCCGGACTG CCTGCTGTTC TACCGGATGG GGGACTTCTA CGAGCTGTTC TTCGAGGATG CGGAGATCGC CTCTCGCGCG CTCGGCATCG TCCTGACCAA ACGCGGCAAG CACGGCGGCG CCGACATCCC GATGTGCGGC GTGCCCCTGG AGCGGGCCGA CGACTACCTG CATCGGCTGA TCGCGCTAGG CCACCGCGTC GCCGTCTGCC AGCAGACCGA GGACGCCGCG GAAGCCAAGA AGCGGGGCCC GAAATCGGTG GTGCGGCGCG AGGTGATCCG CCTCGTCACG CCGGGCACGC TGACCGAGGA CCGACTGCTC GACCCGACGC GGGCCAACCT GCTGCTCGCC ATCGGCCGGC GTAAGGTCTC CGACACGAGC GAGGCCTACG GGCTCGCCGC CCTCGACATC TCAACCGGCC GGTTCTCGCT GAGTGAGGTG GAGAGTTCGG AACTCGCCGC CGCGATCGCC CGACACGAAC CGCGGGAGAT CGTGCTCTCG GAGGTGATCC ACGCCGATCC AGGCCTAGCC AAGCTGTGGC GCGAGACCAA GGCGGCGGTC GTACCGCTGG CCCAGGGCGA GTTGGAACCC GCTTCGGCCG AGCGGCGCAT CCGTGAGCAG TTCGGCGTCA AGACCCTCGA CGGGTTCGGG AATTTCAGCC GGGCGGAGAT CGCCGCGGCC GGGGCGGCTT TGCTCTATCT CGAGCGCACC CAGCTCGGCA CGCGGGTGCC GCTCTCGGCG CCGAGCCGCG AGGCGACGGG GGCGACGCTC GCCATCGATG CGGCCACCCG CGCCAACCTC GAACTCACCC GCACCCTCTC GGGGGAGCGC AAGGGCAGCC TGCTCGACGC CATCGACCGC ACGGTCTCGG CCGGCGGTGC CCGGCTGCTC GCCGAGCATC TGGCCGGGCC CCTGACGGAC CTGACCAAGA TCGGACGGCG GCACGACGCC GTCGCGTTCC TGGCTGATGA CGGCGCGCTG CGGGCGCATC TGCGCGACGC GCTGCACGCC GCCCCCGACA TCGCCCGCGC GCTGTCTCGC ATCGGCCTGA ACCGGGCCGG CCCGCGCGAC CTCGCCGCCC TGCGCGACGG GCTCGACGCC GCCGCGACGA TCGCTGAGCA GCTTCGCGCG GAAGAAAATT TGCCGGACGG GCTGACCCGC CTCGCCCGCC GCCTCGACAA GGCCGACCGG GCGCTGGCCG AGGAACTGGC TCGGGCGCTC GCCGACGACC TGCCGCTCAA CCGCCGCGAC GGCAATTTCG TGCGCGCGGG CTACCACGCC GAGATCGACG AGGCTCGGCT GCTCGGCCAG GACTCGCGAA AGGTCATCGC CGCGCTCCAG GCGCGCTATG CCGAGGCCAG CGGCTGCCGG ACGCTCCGGA TCAAGCACAA CAACGTGCTC GGCTACTACA TCGAGGTGCC GCAGGCGGTG GGCGAGGCCT GCCTCAAGGG CCTGATGCAG GACTTCGTGC ATCGCCAGAC CATGGTCGAT GCGATGCGCT TCACCAGCGT CGAACTCGGC GAGCTGGAAT CGAAGATCGC GGGCGCCTCC GACCGGGTGC TGGCGCTCGA ATCCGCGGTG TTCGATACCT TGAGCGCGCG GGTGAGCGAG GGGGCGGAGG CGATCGCCGA CATCGCCGAG GCGCTGTCGG GCCTCGACGT GGCGGCGAGC CACGCCGAAC TCGCGGTCGA ACTCGCCTGG ACCCGGCCGG TCATCGACGA CAGCCTCGCC TTCGCGATCG AGGGGGGACG CCATCCGGTG GTGGAGGCGG CGCTGACCAA GGCCGGCGAG GCCTTCATCG CCAATGCCTG CGACCTGTCC AGCGACGAGG CCGGGCGCAT CCGCCTCGTC ACCGGCCCGA ACATGGGCGG CAAATCGACC TTCCTGCGCC AGAACGCGCT GATCGCCGTG CTGGCGCAGA TGGGCGCCTT CGTGCCGGCA GCCTCCGCCC GGCTCGGGGT GGTGGACCGG CTGTTCTCCC GCGTCGGCGC GGCCGACGAC CTCGCGCGCG GCCACTCGAC CTTCATGGTC GAGATGGTCG AGACCGCCGC GATCCTCAAT CAGGCGACCC GCCGGTCGCT CGTGGTGCTC GACGAGATCG GCCGCGGCAC CGCGACCTTC GACGGCCTGT CCATCGCCTG GGCCTGCCTG GAGCACCTGC ACGAGAAGAA CGGCTGCCGG GCTTTGTTCG CGACCCACTT CCACGAGCTG ACGGCGCTGA GCCAGCGCCT GCCGCGCCTC GACAACGCGA CGCTGAAGGT GGCCGAGGAC CGCGGCGACG TGGTGTTCCT GCACGAGGTG GTGCCGGGCG TGGCCGAGCG CTCCTACGGC CTCCAGGTCG CCCGCCTCGC CGGCCTGCCG CCGAGCGTCG TGGCGCGGGC CGGGGCGATC CTGAAAGGGC TGGAAAGCTC CGAGCGCGAG CGCCCCGCCC GCCGCAAGAT CGACGACCTG CCGCTCTTCG CCAGCCTCGC CGCCGCACCG CCCCCGCCAC CGGAGCCGGT GAAATCCGAG CCGGACGACC GGCTCGGCCA GTTGATCGAC CGCCTCGATC CCGATGCGCT GACGCCGCGC GAGGCGCTCG ACGCGCTTTA CCGGCTGAAG AAGGAGCGGG CTGCGGGGTA A
|
Protein sequence | MSRPQRSQPL ERRAQAADTV TSSPLDIPGA TPMMAQYIEI KAANPDCLLF YRMGDFYELF FEDAEIASRA LGIVLTKRGK HGGADIPMCG VPLERADDYL HRLIALGHRV AVCQQTEDAA EAKKRGPKSV VRREVIRLVT PGTLTEDRLL DPTRANLLLA IGRRKVSDTS EAYGLAALDI STGRFSLSEV ESSELAAAIA RHEPREIVLS EVIHADPGLA KLWRETKAAV VPLAQGELEP ASAERRIREQ FGVKTLDGFG NFSRAEIAAA GAALLYLERT QLGTRVPLSA PSREATGATL AIDAATRANL ELTRTLSGER KGSLLDAIDR TVSAGGARLL AEHLAGPLTD LTKIGRRHDA VAFLADDGAL RAHLRDALHA APDIARALSR IGLNRAGPRD LAALRDGLDA AATIAEQLRA EENLPDGLTR LARRLDKADR ALAEELARAL ADDLPLNRRD GNFVRAGYHA EIDEARLLGQ DSRKVIAALQ ARYAEASGCR TLRIKHNNVL GYYIEVPQAV GEACLKGLMQ DFVHRQTMVD AMRFTSVELG ELESKIAGAS DRVLALESAV FDTLSARVSE GAEAIADIAE ALSGLDVAAS HAELAVELAW TRPVIDDSLA FAIEGGRHPV VEAALTKAGE AFIANACDLS SDEAGRIRLV TGPNMGGKST FLRQNALIAV LAQMGAFVPA ASARLGVVDR LFSRVGAADD LARGHSTFMV EMVETAAILN QATRRSLVVL DEIGRGTATF DGLSIAWACL EHLHEKNGCR ALFATHFHEL TALSQRLPRL DNATLKVAED RGDVVFLHEV VPGVAERSYG LQVARLAGLP PSVVARAGAI LKGLESSERE RPARRKIDDL PLFASLAAAP PPPPEPVKSE PDDRLGQLID RLDPDALTPR EALDALYRLK KERAAG
|
| |