Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_1075 |
Symbol | |
ID | 4037872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | - |
Start bp | 1166857 |
End bp | 1169607 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637976456 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_583230 |
Protein GI | 94310020 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.835373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.410772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGAAG CCCTATCCGT ACCTGCCGCT GAAGGCGAGA ACACCGTGAC CGCGAGTGAA TCGCCCGACC TTGCCGCGAC TTCCGCCAGG GCCGAGAAAG TCGGCAAGCA GGAAAAGCCC GAGAAAGCCG AGAAGCAGTC GCCCATGATG CTGCAGTACC ACCGCATCAA GGCCGACCAC CCGGACACGC TCCTTTTCTA CCGGATGGGT GACTTCTATG AGCTATTTCA TGACGATGCC GAGAAGGCCG CGCGATTGCT CGACATCACG CTGACCGCGC GCGGCCAGTC TGGCGGTGTG CCAATCCGTA TGGCCGGGAT CCCATTCCAT TCGGCTGACC AGTATCTGGC GCGCCTGGTA AAGCTTGGCG AATCCGTGGC GATCTGCGAA CAGATTGGTG ACCCAGCGAC CAGCAAGGGC CCGGTCGAGC GCAAAGTGGT GCGCATCGTT ACGCCCGGCA CGCTGACCGA TGCCGCCCTG CTGCCAGACA AGTCCGACAC GTTCCTGTTG GCCGTGCACC AGCAGACCAC ACGCCGCGGC GTCAGCAAGA CGGGCCTGGC ATGGCTGAAC CTGGCCAGTG GCGAACTGCG ACTCATGGAG TGCGACGCCG CGCAGTTGTC CCGCGAACTC GAGCGTATCC GTCCTGCCGA GGTGCTGTAC ACCGACGGCA TGGACCTTCC GACGCTGACA TGCGCGCGCA CCCGGTTGCC GGAATGGCAC TTTGACCAGG AAGCCGGTAC TCGCCGCCTG CGCGAACAGA TCGGCGTGGC CAGCCTGGAG CCGTTCGGCT GCTCGGGCCT GGGTGCCGCC CTAGGCGCTG CCGGCGCGCT GCTCAACTAT GCAGCCACCA CGCAAGGCCA GTCTCTGCGG CACGTGCAGG GCGTCACTGT CGAGCGGGAA TCCGAATTTG TCGGCCTCGA TTCGGCAACG CGCCGCAACC TCGAACTGAC CGAAACGCTG CGTGGCACCG AATCGCCAAC GCTCTTCTCG CTGCTCGACA CATGCGCGAC GACGATGGGC AGCCGCGCGC TGCGCCACTG GCTGCATCAT CCGTTGCGCG ACCCGGCCCT GCCGCGAGCG CGTCAGCAGG CCATTGGGGC GCTGATCACA CAAGGGCCGG ACGGCTTGCG TGCCGTGCTG CGCAAGCTTG CCGACGTCGA GCGGATCACC GCGCGACTCG CGTTGCTGTC GGCACGTCCG CGCGATCTGT CCTCGCTGCG CGACACCCTG CGTGCCCTGC CCGATGTGCA GGCCGCCACG GTCAGCTCAG AGGATGCGCC GCTGCTGGCG CAGACCCTTG AGGAAATCGA CATTCCGCAG GACTGCCTGG AACTGTTGAT CCGCGCCGTG GCGGATGAAC CCTCAACGGT GATTCGCGAC GGCGGCGTCA TCGCGCGCGG CTACGACAGT GAGCTCGACG AACTGCGCGA TATCTCGGAA AACTGCGGTC AATTCCTGAT CGACCTGGAA ACCCGGGAGC GCGAGCGCAC AGGCATCACG AACCTGCGCG TCGAATACAA CCGTGTTCAT GGCTTCTATA TCGAAGTCAC CAACGGGCAG GCCGACAAGG TGCCCGACGA CTATCGACGC CGGCAGACCC TCAAGAACGC CGAACGCTAC ATCACACCCG AGTTGAAAGC GTTCGAGGAC AAGGCGCTCT CGGCACAGGA TCGCGCGCTG GCTCGCGAGA AGCAGCTCTA CGACGGTCTG CTGCAGGCCC TGCTGCCGCA TATCGGCAGT CTGCGCCGCG TGGCCGGCGC GCTCGCCCGC CTCGACGTGC TTGCCACACT GGCCGAACGT GCAAAAACAC TCGACTGGGT CCAGCCAGAA CGCGTCCAGG AGAACGTCAT CGACATCTCG CAGGGCCGCC ATCCCGTCGT GGAAGGTCAG CTGGCCGCGG AGTCCGTCGC GTTCATCGCA AACGACTGTC AGTTGAACGA GGCCCGCAAG CTGCTGCTGA TCACCGGCCC GAACATGGGT GGTAAGTCGA CGTTCATGCG CCAGACCGCA TTGATCGTTC TGCTGGCCTG CGTCGGCGCC TGGGTGCCGG CACGCCGTGC CGTGATTGGC CCGGTGGACC GCATCTTCAC GCGTATTGGG GCCGCCGACG ACCTGGCGGG CGGTCGTTCG ACCTTCATGG TCGAAATGAC CGAAGCCGCA GCCATCCTGC ACAACGCCAC GCCATCCAGC CTGGTGCTGA TGGATGAGAT CGGCCGCGGT ACTTCGACCT TCGACGGTCT CGCCCTCGCG TGGGCCATCG CGCGTCATCT GCTATCCCAC AACCGCAGCC ACACACTGTT TGCCACCCAT TATTTCGAGC TGACGCAACT GCCAGTCGAG TTCCCGCAGG CTGCCAACGT GCACCTGTCA GCCGTGGAGC ATGGCGATGG CATCGTATTC CTGCATGCGG TGCAGGACGG TCCGGCCAGC CAGAGCTACG GGCTGCAGGT GGCCCAACTG GCGGGTGTTC CGCAACCGGT CATTCGCGCA GCGCGCAAGC ATCTCGCGTG GCTGGAAGAA CAATCGGCGG ACGCCACGCC CACGCCGCAG ATGGATCTGT TCTCGGCGCA GTCTTCACCG TCCGCAGACG ATGAAGATGA CAAATCGGCA GGCCAGTCCG CCGTGCCGCC CGCGCAGGCT GCCACGCTGG AAGCGCTGGC CGATATCGAC CCGGACAGCC TCTCGCCGCG AGAAGCGCTC GAGGCGCTGT ACCGACTCAA GTCCATCTCG GAGACCGCGC GAACGGTATG A
|
Protein sequence | MSEALSVPAA EGENTVTASE SPDLAATSAR AEKVGKQEKP EKAEKQSPMM LQYHRIKADH PDTLLFYRMG DFYELFHDDA EKAARLLDIT LTARGQSGGV PIRMAGIPFH SADQYLARLV KLGESVAICE QIGDPATSKG PVERKVVRIV TPGTLTDAAL LPDKSDTFLL AVHQQTTRRG VSKTGLAWLN LASGELRLME CDAAQLSREL ERIRPAEVLY TDGMDLPTLT CARTRLPEWH FDQEAGTRRL REQIGVASLE PFGCSGLGAA LGAAGALLNY AATTQGQSLR HVQGVTVERE SEFVGLDSAT RRNLELTETL RGTESPTLFS LLDTCATTMG SRALRHWLHH PLRDPALPRA RQQAIGALIT QGPDGLRAVL RKLADVERIT ARLALLSARP RDLSSLRDTL RALPDVQAAT VSSEDAPLLA QTLEEIDIPQ DCLELLIRAV ADEPSTVIRD GGVIARGYDS ELDELRDISE NCGQFLIDLE TRERERTGIT NLRVEYNRVH GFYIEVTNGQ ADKVPDDYRR RQTLKNAERY ITPELKAFED KALSAQDRAL AREKQLYDGL LQALLPHIGS LRRVAGALAR LDVLATLAER AKTLDWVQPE RVQENVIDIS QGRHPVVEGQ LAAESVAFIA NDCQLNEARK LLLITGPNMG GKSTFMRQTA LIVLLACVGA WVPARRAVIG PVDRIFTRIG AADDLAGGRS TFMVEMTEAA AILHNATPSS LVLMDEIGRG TSTFDGLALA WAIARHLLSH NRSHTLFATH YFELTQLPVE FPQAANVHLS AVEHGDGIVF LHAVQDGPAS QSYGLQVAQL AGVPQPVIRA ARKHLAWLEE QSADATPTPQ MDLFSAQSSP SADDEDDKSA GQSAVPPAQA ATLEALADID PDSLSPREAL EALYRLKSIS ETARTV
|
| |