Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2495 |
Symbol | |
ID | 8137836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2918840 |
End bp | 2921449 |
Gene Length | 2610 bp |
Protein Length | 869 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870104 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_003022295 |
Protein GI | 253701106 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 87 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAA TGACGCCCAT GATGCGGCAA TTCCTTGAGA TCAAGGCCGA ACACCCCGAC GCGATCCTTT TCTTCAGATG CGGCGACTTC TACGAGATGT TCCTGGACGA CGCGGTCAAA GCCTCCCGCA TCCTCGGCAT CACCCTCACC TCGCGCAACA AGAACGCCGA CGGCTCCGAG GTTCCTCTCT GCGGCATCCC CTACCACTCC TGCGCCCCAT ACATCGCGAA ACTGGTCGAG GCAGGGGAGA AGGTCGCCAT CTGCGAACAG GCCGAGGACC CGAAGCAGGC CAAGGGAATC GTCAAGCGCG AGGTGGTGAA GGTGATCACC CCAGGGCTCG TTATCGAGGA CGCCTCGCTC TCCCCCAAGG AAAACAATTA CCTGCTGGCG CTTTGCTGTG ACGGCGAGTG CTACGGGCTC TCCTACCTCG ACCTCTCCAC CGGAGAATTC AGAGTGACCG AGCTGGATGG GCTCCAAGCC GCCCTGGCAG AGGTGACCTG CATAGGTCCC CGCGAAATCA TCCTCCCGGT CTGCTTCCGT GAGGAACCGA AGCGGAAAGA AGTCGCCCAC GTCATCGTGG ACCGCAGCAT CACCTATTTC GAGGAATGGG TCTACGACCC GGACTACTGC AAGCGACTGG TAGCGAACCA GTTCAAGGGG GCGACTGCCG AGTCGCTTGG ATGCGACCGC CTCCCCACCG CCCTTCTTGC CGCAGGCGCG GTATTGCACT ACCTGGTGGA CACGCAAAAG GGGCACGCCC CCCATGTAAC CTGCATCACT CCGTACAACG AAAGCGAGCA TCTCCTCCTC GACGAATCGA CCAGGAGGAA CCTGGAACTC ACCGCTACCC TCTCGGAAGG AAAGCGCAAG GGGTCGCTTC TGGGGCTTAT GGACCGCACC GTCACCGCCA TGGGAGGCCG CAAGCTCAAG CAGTGGATCA ACTACCCGCT CATGGATCTG AAGAAGATCT GGCTGCGGCA GGACGCGATC CAGGAACTGA TGGAGGCGCC CGGCACCCGT GAAGCGATGA AGTCGCTCCT GGCCGGGGTC TACGACCTCG AACGCCTGAA CGGCCGCATC AGTCTCGCCT CCGCCTCCGC CAAGGATCTC TCCGCCCTCA GGTCGTCGCT CAGCCGTCTT CCCGCCATCA AGGAGCAGGT AGCCGCCTGC GGCGCAGGGC TGCTCAAGGA GTTGGATGCC GGCATCGACC CGCTCGACGA ACTGAGCGAC CTGATCTCCA GCGCCATCGT TGACGACCCT CCTTTTGTAT TGCGCGACGG CGGCATCATC GCGGACGGCT ACAACCAGGA ACTCGACGAA TTACGGGCCA TCAGCCGCGA GGGAAAGGGG TTCATCGCCA GGCTGGAGGC ACAGGAGAAG GGGCGGACCG GGATCTCGTC GCTCAAGATC CGCTACAACA AGGTCTTCGG CTACTACATC GAGGTGACCA AGGCGAACGT CTCGGCCATC CCCGACGACT ACATCAGAAG ACAGACCCTC GCCAACGCCG AGCGGTACAT CACCCCCGAA CTGAAGGAGT ACGAGGAGAA GGTGCTCGGG GCCGAGGACC GGATCAAGGA CCTGGAATTC TCGCTCTTCC AGGAAGTCCG AGAGGCAGCC GCGGCTCAAG GGGAGCGGAT CGCCCGCAGC GCCGACCGGC TCGCCTGCCT GGACGTCCTG GTAAGCCTCT CCGAACTGGC CCACGACAAG GGGTACTGCC GACCCGAGGT GCACGAGGGG AGCGAACTCA GCATCACCGA GGGGAGGCAC CCGGTCATCG AGGACATGCA CTCCGCCGAG CGCTTCGTCC CCAACGACAC GCTGCTGGAC AACGGGGAGA ACCAGCTCAT CATCATCACC GGCCCCAACA TGGCCGGTAA ATCGACCTTC ATGCGCCAGG TGGCCCTCAT CTCGCTGATG GCCCAGATGG GGAGCTTCGT GCCGGCGGAC AAGGCGCTCA TTCCGCTTGT TGACCGCATC TTCACCAGGG TGGGCGCATC CGACAACCTG GCCCGCGGCC ACTCCACCTT CATGGTCGAG ATGATGGAGA GCGCCGCCAT CCTGAGGGGA GCCACCGCCA GGAGCCTGGT CATCCTGGAC GAGATCGGGC GCGGCACCTC CACCTTCGAC GGCGTCTCCA TCGCCTGGGC GGTCGCGGAA TTCCTGCACG ACAACAAGAT ACACGCGGCC AAGACCCTCT TCGCCACCCA CTACCACGAA CTCACCGAAC TCGCGGTCAC CCGCCCCGGC ATCAAGAACT TCAACATAGC CGTCAGGGAA TGGAACGAGC GCATCATCTT CTTGAGGAAG ATCGTTCCCG GTGGGGCCTC ACACTCCTAC GGCATCCAGG TAGCCCGCTT GGCCGGTCTC CCCCAGGCGG TGATCGACCG GGCCAAGGAG ATCTTGATCA ACCTGGAAAA GGGGGAGTAC GGCGAAGGAG GGGTACCGCG CCTGGCGCGC GGCAAGAAGA CCCCTCCCCC GTCACCGCAG CTCTCGCTCT TTGGCGCGGG CGACGACCAG ATCAGGGAGA GGCTCAAGGA AATCGAAGTG GCGCTCCTGA CCCCCCTGGA GGCGCTGAAC CTCGTGGACG AACTGAAGAG GATGATCTAG
|
Protein sequence | MSEMTPMMRQ FLEIKAEHPD AILFFRCGDF YEMFLDDAVK ASRILGITLT SRNKNADGSE VPLCGIPYHS CAPYIAKLVE AGEKVAICEQ AEDPKQAKGI VKREVVKVIT PGLVIEDASL SPKENNYLLA LCCDGECYGL SYLDLSTGEF RVTELDGLQA ALAEVTCIGP REIILPVCFR EEPKRKEVAH VIVDRSITYF EEWVYDPDYC KRLVANQFKG ATAESLGCDR LPTALLAAGA VLHYLVDTQK GHAPHVTCIT PYNESEHLLL DESTRRNLEL TATLSEGKRK GSLLGLMDRT VTAMGGRKLK QWINYPLMDL KKIWLRQDAI QELMEAPGTR EAMKSLLAGV YDLERLNGRI SLASASAKDL SALRSSLSRL PAIKEQVAAC GAGLLKELDA GIDPLDELSD LISSAIVDDP PFVLRDGGII ADGYNQELDE LRAISREGKG FIARLEAQEK GRTGISSLKI RYNKVFGYYI EVTKANVSAI PDDYIRRQTL ANAERYITPE LKEYEEKVLG AEDRIKDLEF SLFQEVREAA AAQGERIARS ADRLACLDVL VSLSELAHDK GYCRPEVHEG SELSITEGRH PVIEDMHSAE RFVPNDTLLD NGENQLIIIT GPNMAGKSTF MRQVALISLM AQMGSFVPAD KALIPLVDRI FTRVGASDNL ARGHSTFMVE MMESAAILRG ATARSLVILD EIGRGTSTFD GVSIAWAVAE FLHDNKIHAA KTLFATHYHE LTELAVTRPG IKNFNIAVRE WNERIIFLRK IVPGGASHSY GIQVARLAGL PQAVIDRAKE ILINLEKGEY GEGGVPRLAR GKKTPPPSPQ LSLFGAGDDQ IRERLKEIEV ALLTPLEALN LVDELKRMI
|
| |