Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mnod_7101 |
Symbol | |
ID | 7302999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium nodulans ORS 2060 |
Kingdom | Bacteria |
Replicon accession | NC_011894 |
Strand | - |
Start bp | 7175154 |
End bp | 7177943 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643604653 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002502144 |
Protein GI | 220926842 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.743761 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATGG ACAGCGACCT CGGCCGGCGC CTGCTGCGCG ACGAGCCTCA CGAGCCGGCG GACGACGCTC CGCCTCCGCC ACGCGGACGG CGCGCCGCCG CGGCCGAGCC CGCCGCCTCG CCCATGATGG CGCAGTACAT CGAGATCAAG GCCGCCAATC CGGGCTTGCT GCTGTTCTAC CGGATGGGGG ATTTCTACGA GCTGTTCTTC GAGGATGCGG AGGTCGCCTC GCGGGCGCTC GGCATCGTGC TGACGAAGCG CGGCAAGCAT GGCGGCGCCG ACATCCCGAT GTGCGGCGTG CCGGTCGAGC GGGCGGACGA CTACCTCCAG CGCCTGATCG CGCTGGGGCA CCGGGTCGCC GTCTGCGAGC AGACCGAGGA TCCGGCCGAG GCCCGCAAGC GCGGCTCGAA ATCGGTGGTG CGCCGGGAGG TGGTGCGCCT CGTCACCCCC GGCACGATCA CGGAGGAGCG GCTCCTCGAT CCGGCCCGCG CCAACCTCCT CCTCGCCCTG GCGCGCCGCC GCGCCTCGGA GTCCGGCTGG ACCTACGGGC TCGCCGCGGT CGACATCTCG ACCGGGCGCT TCACCCTGAG CGAGATCGAT GGGCAGGGGC TCCCGGCCGA GATCGCCCGG CTGGAGCCGC GCGAGATCGT CATGGCCGAG GCGATTCACG CCGATCCGGA CCTCGCCCGG CTATGGCGCG ACACGAGTGC CGCGGTGACG CCGCTCGGGC GCGGCGAGGC CGACCCGGCC TCGGCCGAGC GGGCGCTGAA GGAGCAGTTC GGCGTCGCCA CCCTCGACGG GTTCGGCGCC TTCAGCCGCA CGGAGATCGC GGCGGCCGGG ACCGTCCTGC ACTACATCGC CCGCACGCAG CTCGGCGCCA GGGTGCCGCT GAGCCCGCCC GCCCGCCAGG GTGCCGGCGG CAGCCTGCTC ATCGATGCGG CGACGCGGAC CAATCTCGAA CTCACCCGCA CCCTGTCGGG GGAGCGGGCC GGGAGCCTGC TCGCGGCCAT CGACCGCACC GTCGGGGCGG CCGGCGCGCG GCTCCTGGCG GAGCGGCTCG CCGGCCCCTC CACCGACCTC GCGCTGATCC GCCGCCGCCA CGACGCGGTG GCCTTCCTGG TCGCCGAGGG GGCCTTGCGG GCGGAGCTCC GCGCCGACCT CGCGCGGGCG CCCGACATGG CCCGGGCGCT CTCGCGCATC GGGGTCGGGC GGGCCGGCCC GCGGGATCTC GCGGCTTTGC GCGACGGCCT CGACGCGGCC CGCAGCATCG CGACGCGGCT CGCGGGGGCC GGTGCGCTCC CGGGGGAGAT CGGCAAGGCC GCGCGGCTCC TCGCCACCGT GGGCGACGGG CTCGTCGAGA CCCTCGCCGC CGCGCTCGCC GACGAGCTGC CGCTCGTCAG GCGCGACGGC AACTTCGTGC GCGAGGGCTA CCGGGCCGAG CTCGACGAGG CGCGTGCGCT CCAGAGCGAT TCCCGCCGCT TCGTCGCAGG GCTCCAGACC CGCTACGCCG CCGAGACCGG CTGCCGGAGC TTGCGCATCA AGCACAACAA CCTGCTCGGC TTCTACATCG AGGTGCCGCA GGCGGTCGGC GAGACCCTGC TGAAGGATCC CTGGCGCGAG ACCTTCGTGC ACCGCCAGAC CATGGTGGAC GCGATGCGCT TCACCAGCGT GGAGCTGGGC GAGCTCGAAT CGCGCATCGC CAACGCGGCC GGCCGGGCGC TCGCCCTCGA ACTCGAGATC TTCGAGGCCC TCGCCGCCGC CGTGATGGAC CAGGCCGCGG CGATCAACGC GGCGGCGACG GCGCTCGCGG CCCTCGACGT GGCGGCCTCC CACGCGGAGC TCGCGGTCGA GCTCGACTGG ACGCGGCCCG TCCTCGACGA GAGCCTGACC TTCCGGGTCG AGGGCGCCCG CCACCCGGTG GTGGAGGCCG CGCTCCGGCA GGCGGGCGAG CCCTTCATCG CCAATTCCTG CGACCTGTCG GGGAGCGAGA ACGAGGCGCG CAGCGGCCGG GAGGCCGGCC AGATCCTGAT CGTCACCGGC CCGAACATGG GCGGCAAGTC GACCTTCCTG CGCCAGAACG CGCTGATCGC GGTGCTGGCC CAGATGGGGG CCTTCGTGCC GGCCCGCTCC GCCCATCTCG GCCTCGTCGA CCGGCTGTTC TCGCGGGTCG GCGCCGCCGA CGACCTCGCG CGCGGCCACT CGACCTTCAT GGTCGAGATG GTGGAGACCG CCGCGATCCT GAACCAGGCG ACGCGCCGCT CCCTCGTCGT CCTCGACGAG ATCGGGCGCG GCACCGCGAC CTTCGACGGG CTCTCGATCG CCTGGGCCTG CCTGGAGCAT CTCCACGAGG TCACGGGCTG CCGGGCGCTG TTCGCGACCC ATTTCCACGA GCTCACCGGG CTCGCGCGGC GGCTCGAGCG CCTCTCGAAC GCCACCCTGA AGGTGACCGA GTGGAAGGGC GACGTGGTGT TCCTGCACGA GGTGGTGCCG GGAGCGGCGG ACCGCTCCTA CGGCCTCCAG GTGGCCCGGC TCGCCGGCCT CCCGGCCTCG GTGATCGCCC GCGCCAAGGT GATCCTGGCC GATCTGGAGA AGGGCGATGG CGGGCGGGGC CGCCGCGCGC CGGTTGCCGA GCTGCCGCTC TTCGCCGCCC TGCCGCCGGC GCCCGAACCG CCGCCCGCAC CGAAGCCGGA CGCCCTGCGC GACCTCCTCG GCGGCCTCGA TCCGGACGGC CTCACGCCGC GCGAGGCCCT CGATGCGCTC TACCGGCTGA AGGCCGCCCG GGACGCGTGA
|
Protein sequence | MTMDSDLGRR LLRDEPHEPA DDAPPPPRGR RAAAAEPAAS PMMAQYIEIK AANPGLLLFY RMGDFYELFF EDAEVASRAL GIVLTKRGKH GGADIPMCGV PVERADDYLQ RLIALGHRVA VCEQTEDPAE ARKRGSKSVV RREVVRLVTP GTITEERLLD PARANLLLAL ARRRASESGW TYGLAAVDIS TGRFTLSEID GQGLPAEIAR LEPREIVMAE AIHADPDLAR LWRDTSAAVT PLGRGEADPA SAERALKEQF GVATLDGFGA FSRTEIAAAG TVLHYIARTQ LGARVPLSPP ARQGAGGSLL IDAATRTNLE LTRTLSGERA GSLLAAIDRT VGAAGARLLA ERLAGPSTDL ALIRRRHDAV AFLVAEGALR AELRADLARA PDMARALSRI GVGRAGPRDL AALRDGLDAA RSIATRLAGA GALPGEIGKA ARLLATVGDG LVETLAAALA DELPLVRRDG NFVREGYRAE LDEARALQSD SRRFVAGLQT RYAAETGCRS LRIKHNNLLG FYIEVPQAVG ETLLKDPWRE TFVHRQTMVD AMRFTSVELG ELESRIANAA GRALALELEI FEALAAAVMD QAAAINAAAT ALAALDVAAS HAELAVELDW TRPVLDESLT FRVEGARHPV VEAALRQAGE PFIANSCDLS GSENEARSGR EAGQILIVTG PNMGGKSTFL RQNALIAVLA QMGAFVPARS AHLGLVDRLF SRVGAADDLA RGHSTFMVEM VETAAILNQA TRRSLVVLDE IGRGTATFDG LSIAWACLEH LHEVTGCRAL FATHFHELTG LARRLERLSN ATLKVTEWKG DVVFLHEVVP GAADRSYGLQ VARLAGLPAS VIARAKVILA DLEKGDGGRG RRAPVAELPL FAALPPAPEP PPAPKPDALR DLLGGLDPDG LTPREALDAL YRLKAARDA
|
| |