Gene Mnod_7101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_7101 
Symbol 
ID7302999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp7175154 
End bp7177943 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content75% 
IMG OID643604653 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002502144 
Protein GI220926842 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.743761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATGG ACAGCGACCT CGGCCGGCGC CTGCTGCGCG ACGAGCCTCA CGAGCCGGCG 
GACGACGCTC CGCCTCCGCC ACGCGGACGG CGCGCCGCCG CGGCCGAGCC CGCCGCCTCG
CCCATGATGG CGCAGTACAT CGAGATCAAG GCCGCCAATC CGGGCTTGCT GCTGTTCTAC
CGGATGGGGG ATTTCTACGA GCTGTTCTTC GAGGATGCGG AGGTCGCCTC GCGGGCGCTC
GGCATCGTGC TGACGAAGCG CGGCAAGCAT GGCGGCGCCG ACATCCCGAT GTGCGGCGTG
CCGGTCGAGC GGGCGGACGA CTACCTCCAG CGCCTGATCG CGCTGGGGCA CCGGGTCGCC
GTCTGCGAGC AGACCGAGGA TCCGGCCGAG GCCCGCAAGC GCGGCTCGAA ATCGGTGGTG
CGCCGGGAGG TGGTGCGCCT CGTCACCCCC GGCACGATCA CGGAGGAGCG GCTCCTCGAT
CCGGCCCGCG CCAACCTCCT CCTCGCCCTG GCGCGCCGCC GCGCCTCGGA GTCCGGCTGG
ACCTACGGGC TCGCCGCGGT CGACATCTCG ACCGGGCGCT TCACCCTGAG CGAGATCGAT
GGGCAGGGGC TCCCGGCCGA GATCGCCCGG CTGGAGCCGC GCGAGATCGT CATGGCCGAG
GCGATTCACG CCGATCCGGA CCTCGCCCGG CTATGGCGCG ACACGAGTGC CGCGGTGACG
CCGCTCGGGC GCGGCGAGGC CGACCCGGCC TCGGCCGAGC GGGCGCTGAA GGAGCAGTTC
GGCGTCGCCA CCCTCGACGG GTTCGGCGCC TTCAGCCGCA CGGAGATCGC GGCGGCCGGG
ACCGTCCTGC ACTACATCGC CCGCACGCAG CTCGGCGCCA GGGTGCCGCT GAGCCCGCCC
GCCCGCCAGG GTGCCGGCGG CAGCCTGCTC ATCGATGCGG CGACGCGGAC CAATCTCGAA
CTCACCCGCA CCCTGTCGGG GGAGCGGGCC GGGAGCCTGC TCGCGGCCAT CGACCGCACC
GTCGGGGCGG CCGGCGCGCG GCTCCTGGCG GAGCGGCTCG CCGGCCCCTC CACCGACCTC
GCGCTGATCC GCCGCCGCCA CGACGCGGTG GCCTTCCTGG TCGCCGAGGG GGCCTTGCGG
GCGGAGCTCC GCGCCGACCT CGCGCGGGCG CCCGACATGG CCCGGGCGCT CTCGCGCATC
GGGGTCGGGC GGGCCGGCCC GCGGGATCTC GCGGCTTTGC GCGACGGCCT CGACGCGGCC
CGCAGCATCG CGACGCGGCT CGCGGGGGCC GGTGCGCTCC CGGGGGAGAT CGGCAAGGCC
GCGCGGCTCC TCGCCACCGT GGGCGACGGG CTCGTCGAGA CCCTCGCCGC CGCGCTCGCC
GACGAGCTGC CGCTCGTCAG GCGCGACGGC AACTTCGTGC GCGAGGGCTA CCGGGCCGAG
CTCGACGAGG CGCGTGCGCT CCAGAGCGAT TCCCGCCGCT TCGTCGCAGG GCTCCAGACC
CGCTACGCCG CCGAGACCGG CTGCCGGAGC TTGCGCATCA AGCACAACAA CCTGCTCGGC
TTCTACATCG AGGTGCCGCA GGCGGTCGGC GAGACCCTGC TGAAGGATCC CTGGCGCGAG
ACCTTCGTGC ACCGCCAGAC CATGGTGGAC GCGATGCGCT TCACCAGCGT GGAGCTGGGC
GAGCTCGAAT CGCGCATCGC CAACGCGGCC GGCCGGGCGC TCGCCCTCGA ACTCGAGATC
TTCGAGGCCC TCGCCGCCGC CGTGATGGAC CAGGCCGCGG CGATCAACGC GGCGGCGACG
GCGCTCGCGG CCCTCGACGT GGCGGCCTCC CACGCGGAGC TCGCGGTCGA GCTCGACTGG
ACGCGGCCCG TCCTCGACGA GAGCCTGACC TTCCGGGTCG AGGGCGCCCG CCACCCGGTG
GTGGAGGCCG CGCTCCGGCA GGCGGGCGAG CCCTTCATCG CCAATTCCTG CGACCTGTCG
GGGAGCGAGA ACGAGGCGCG CAGCGGCCGG GAGGCCGGCC AGATCCTGAT CGTCACCGGC
CCGAACATGG GCGGCAAGTC GACCTTCCTG CGCCAGAACG CGCTGATCGC GGTGCTGGCC
CAGATGGGGG CCTTCGTGCC GGCCCGCTCC GCCCATCTCG GCCTCGTCGA CCGGCTGTTC
TCGCGGGTCG GCGCCGCCGA CGACCTCGCG CGCGGCCACT CGACCTTCAT GGTCGAGATG
GTGGAGACCG CCGCGATCCT GAACCAGGCG ACGCGCCGCT CCCTCGTCGT CCTCGACGAG
ATCGGGCGCG GCACCGCGAC CTTCGACGGG CTCTCGATCG CCTGGGCCTG CCTGGAGCAT
CTCCACGAGG TCACGGGCTG CCGGGCGCTG TTCGCGACCC ATTTCCACGA GCTCACCGGG
CTCGCGCGGC GGCTCGAGCG CCTCTCGAAC GCCACCCTGA AGGTGACCGA GTGGAAGGGC
GACGTGGTGT TCCTGCACGA GGTGGTGCCG GGAGCGGCGG ACCGCTCCTA CGGCCTCCAG
GTGGCCCGGC TCGCCGGCCT CCCGGCCTCG GTGATCGCCC GCGCCAAGGT GATCCTGGCC
GATCTGGAGA AGGGCGATGG CGGGCGGGGC CGCCGCGCGC CGGTTGCCGA GCTGCCGCTC
TTCGCCGCCC TGCCGCCGGC GCCCGAACCG CCGCCCGCAC CGAAGCCGGA CGCCCTGCGC
GACCTCCTCG GCGGCCTCGA TCCGGACGGC CTCACGCCGC GCGAGGCCCT CGATGCGCTC
TACCGGCTGA AGGCCGCCCG GGACGCGTGA
 
Protein sequence
MTMDSDLGRR LLRDEPHEPA DDAPPPPRGR RAAAAEPAAS PMMAQYIEIK AANPGLLLFY 
RMGDFYELFF EDAEVASRAL GIVLTKRGKH GGADIPMCGV PVERADDYLQ RLIALGHRVA
VCEQTEDPAE ARKRGSKSVV RREVVRLVTP GTITEERLLD PARANLLLAL ARRRASESGW
TYGLAAVDIS TGRFTLSEID GQGLPAEIAR LEPREIVMAE AIHADPDLAR LWRDTSAAVT
PLGRGEADPA SAERALKEQF GVATLDGFGA FSRTEIAAAG TVLHYIARTQ LGARVPLSPP
ARQGAGGSLL IDAATRTNLE LTRTLSGERA GSLLAAIDRT VGAAGARLLA ERLAGPSTDL
ALIRRRHDAV AFLVAEGALR AELRADLARA PDMARALSRI GVGRAGPRDL AALRDGLDAA
RSIATRLAGA GALPGEIGKA ARLLATVGDG LVETLAAALA DELPLVRRDG NFVREGYRAE
LDEARALQSD SRRFVAGLQT RYAAETGCRS LRIKHNNLLG FYIEVPQAVG ETLLKDPWRE
TFVHRQTMVD AMRFTSVELG ELESRIANAA GRALALELEI FEALAAAVMD QAAAINAAAT
ALAALDVAAS HAELAVELDW TRPVLDESLT FRVEGARHPV VEAALRQAGE PFIANSCDLS
GSENEARSGR EAGQILIVTG PNMGGKSTFL RQNALIAVLA QMGAFVPARS AHLGLVDRLF
SRVGAADDLA RGHSTFMVEM VETAAILNQA TRRSLVVLDE IGRGTATFDG LSIAWACLEH
LHEVTGCRAL FATHFHELTG LARRLERLSN ATLKVTEWKG DVVFLHEVVP GAADRSYGLQ
VARLAGLPAS VIARAKVILA DLEKGDGGRG RRAPVAELPL FAALPPAPEP PPAPKPDALR
DLLGGLDPDG LTPREALDAL YRLKAARDA