Gene GM21_2495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2495 
Symbol 
ID8137836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2918840 
End bp2921449 
Gene Length2610 bp 
Protein Length869 aa 
Translation table11 
GC content63% 
IMG OID644870104 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_003022295 
Protein GI253701106 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA TGACGCCCAT GATGCGGCAA TTCCTTGAGA TCAAGGCCGA ACACCCCGAC 
GCGATCCTTT TCTTCAGATG CGGCGACTTC TACGAGATGT TCCTGGACGA CGCGGTCAAA
GCCTCCCGCA TCCTCGGCAT CACCCTCACC TCGCGCAACA AGAACGCCGA CGGCTCCGAG
GTTCCTCTCT GCGGCATCCC CTACCACTCC TGCGCCCCAT ACATCGCGAA ACTGGTCGAG
GCAGGGGAGA AGGTCGCCAT CTGCGAACAG GCCGAGGACC CGAAGCAGGC CAAGGGAATC
GTCAAGCGCG AGGTGGTGAA GGTGATCACC CCAGGGCTCG TTATCGAGGA CGCCTCGCTC
TCCCCCAAGG AAAACAATTA CCTGCTGGCG CTTTGCTGTG ACGGCGAGTG CTACGGGCTC
TCCTACCTCG ACCTCTCCAC CGGAGAATTC AGAGTGACCG AGCTGGATGG GCTCCAAGCC
GCCCTGGCAG AGGTGACCTG CATAGGTCCC CGCGAAATCA TCCTCCCGGT CTGCTTCCGT
GAGGAACCGA AGCGGAAAGA AGTCGCCCAC GTCATCGTGG ACCGCAGCAT CACCTATTTC
GAGGAATGGG TCTACGACCC GGACTACTGC AAGCGACTGG TAGCGAACCA GTTCAAGGGG
GCGACTGCCG AGTCGCTTGG ATGCGACCGC CTCCCCACCG CCCTTCTTGC CGCAGGCGCG
GTATTGCACT ACCTGGTGGA CACGCAAAAG GGGCACGCCC CCCATGTAAC CTGCATCACT
CCGTACAACG AAAGCGAGCA TCTCCTCCTC GACGAATCGA CCAGGAGGAA CCTGGAACTC
ACCGCTACCC TCTCGGAAGG AAAGCGCAAG GGGTCGCTTC TGGGGCTTAT GGACCGCACC
GTCACCGCCA TGGGAGGCCG CAAGCTCAAG CAGTGGATCA ACTACCCGCT CATGGATCTG
AAGAAGATCT GGCTGCGGCA GGACGCGATC CAGGAACTGA TGGAGGCGCC CGGCACCCGT
GAAGCGATGA AGTCGCTCCT GGCCGGGGTC TACGACCTCG AACGCCTGAA CGGCCGCATC
AGTCTCGCCT CCGCCTCCGC CAAGGATCTC TCCGCCCTCA GGTCGTCGCT CAGCCGTCTT
CCCGCCATCA AGGAGCAGGT AGCCGCCTGC GGCGCAGGGC TGCTCAAGGA GTTGGATGCC
GGCATCGACC CGCTCGACGA ACTGAGCGAC CTGATCTCCA GCGCCATCGT TGACGACCCT
CCTTTTGTAT TGCGCGACGG CGGCATCATC GCGGACGGCT ACAACCAGGA ACTCGACGAA
TTACGGGCCA TCAGCCGCGA GGGAAAGGGG TTCATCGCCA GGCTGGAGGC ACAGGAGAAG
GGGCGGACCG GGATCTCGTC GCTCAAGATC CGCTACAACA AGGTCTTCGG CTACTACATC
GAGGTGACCA AGGCGAACGT CTCGGCCATC CCCGACGACT ACATCAGAAG ACAGACCCTC
GCCAACGCCG AGCGGTACAT CACCCCCGAA CTGAAGGAGT ACGAGGAGAA GGTGCTCGGG
GCCGAGGACC GGATCAAGGA CCTGGAATTC TCGCTCTTCC AGGAAGTCCG AGAGGCAGCC
GCGGCTCAAG GGGAGCGGAT CGCCCGCAGC GCCGACCGGC TCGCCTGCCT GGACGTCCTG
GTAAGCCTCT CCGAACTGGC CCACGACAAG GGGTACTGCC GACCCGAGGT GCACGAGGGG
AGCGAACTCA GCATCACCGA GGGGAGGCAC CCGGTCATCG AGGACATGCA CTCCGCCGAG
CGCTTCGTCC CCAACGACAC GCTGCTGGAC AACGGGGAGA ACCAGCTCAT CATCATCACC
GGCCCCAACA TGGCCGGTAA ATCGACCTTC ATGCGCCAGG TGGCCCTCAT CTCGCTGATG
GCCCAGATGG GGAGCTTCGT GCCGGCGGAC AAGGCGCTCA TTCCGCTTGT TGACCGCATC
TTCACCAGGG TGGGCGCATC CGACAACCTG GCCCGCGGCC ACTCCACCTT CATGGTCGAG
ATGATGGAGA GCGCCGCCAT CCTGAGGGGA GCCACCGCCA GGAGCCTGGT CATCCTGGAC
GAGATCGGGC GCGGCACCTC CACCTTCGAC GGCGTCTCCA TCGCCTGGGC GGTCGCGGAA
TTCCTGCACG ACAACAAGAT ACACGCGGCC AAGACCCTCT TCGCCACCCA CTACCACGAA
CTCACCGAAC TCGCGGTCAC CCGCCCCGGC ATCAAGAACT TCAACATAGC CGTCAGGGAA
TGGAACGAGC GCATCATCTT CTTGAGGAAG ATCGTTCCCG GTGGGGCCTC ACACTCCTAC
GGCATCCAGG TAGCCCGCTT GGCCGGTCTC CCCCAGGCGG TGATCGACCG GGCCAAGGAG
ATCTTGATCA ACCTGGAAAA GGGGGAGTAC GGCGAAGGAG GGGTACCGCG CCTGGCGCGC
GGCAAGAAGA CCCCTCCCCC GTCACCGCAG CTCTCGCTCT TTGGCGCGGG CGACGACCAG
ATCAGGGAGA GGCTCAAGGA AATCGAAGTG GCGCTCCTGA CCCCCCTGGA GGCGCTGAAC
CTCGTGGACG AACTGAAGAG GATGATCTAG
 
Protein sequence
MSEMTPMMRQ FLEIKAEHPD AILFFRCGDF YEMFLDDAVK ASRILGITLT SRNKNADGSE 
VPLCGIPYHS CAPYIAKLVE AGEKVAICEQ AEDPKQAKGI VKREVVKVIT PGLVIEDASL
SPKENNYLLA LCCDGECYGL SYLDLSTGEF RVTELDGLQA ALAEVTCIGP REIILPVCFR
EEPKRKEVAH VIVDRSITYF EEWVYDPDYC KRLVANQFKG ATAESLGCDR LPTALLAAGA
VLHYLVDTQK GHAPHVTCIT PYNESEHLLL DESTRRNLEL TATLSEGKRK GSLLGLMDRT
VTAMGGRKLK QWINYPLMDL KKIWLRQDAI QELMEAPGTR EAMKSLLAGV YDLERLNGRI
SLASASAKDL SALRSSLSRL PAIKEQVAAC GAGLLKELDA GIDPLDELSD LISSAIVDDP
PFVLRDGGII ADGYNQELDE LRAISREGKG FIARLEAQEK GRTGISSLKI RYNKVFGYYI
EVTKANVSAI PDDYIRRQTL ANAERYITPE LKEYEEKVLG AEDRIKDLEF SLFQEVREAA
AAQGERIARS ADRLACLDVL VSLSELAHDK GYCRPEVHEG SELSITEGRH PVIEDMHSAE
RFVPNDTLLD NGENQLIIIT GPNMAGKSTF MRQVALISLM AQMGSFVPAD KALIPLVDRI
FTRVGASDNL ARGHSTFMVE MMESAAILRG ATARSLVILD EIGRGTSTFD GVSIAWAVAE
FLHDNKIHAA KTLFATHYHE LTELAVTRPG IKNFNIAVRE WNERIIFLRK IVPGGASHSY
GIQVARLAGL PQAVIDRAKE ILINLEKGEY GEGGVPRLAR GKKTPPPSPQ LSLFGAGDDQ
IRERLKEIEV ALLTPLEALN LVDELKRMI