Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1412 |
Symbol | |
ID | 8136740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1660477 |
End bp | 1662828 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644869026 |
Product | MutS2 family protein |
Protein accession | YP_003021229 |
Protein GI | 253700040 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 2.29197e-17 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCAGCA CCGATACCCT CAAGCGGCTG GAGTTCGACA AGATCCTCGA CACCGTCGCT TCTTACGCAC ACTGCGACGC CTCTCACCTA GGGGCGCTAT CCATAACGCC CCTCTCGGCA CGGGACGAGA TCGAACTCCG CCTGGGCCTC GTCGAAGAGG TGCGCAAGCT AACCCGGTTC GGCATCGCCC TCAAGCTCTC GGAGTTCGAG GACATCACCC CCCAGGTCAG AGCGGTCCGC CCGACAGGCT CGGTGATCTC GCCGCTGGAG TTGCAGCGCT TCATCCCGAC ACTGAGGGTG ATGGGAGCCA TCTCCGCGCA GCTCGGTTTT CGCACCGACG TCCCCCTGCT CACCTCGCAG GCCGGTTCCA TCACCGGATT CCCGGACCTC TTGAATCCGC TGGAGCACAC GGTGAACGAG GAAGGGGAGA TCCTCGACAC CGCCTCCAGG CTCCTGGCCG ACATCCGCGG CCGCAAGAAG GGGCTCACCG CCCGCATCAA GAAAAGGCTG GAGGAGATCG TCCGCGAGCG GCACACCGCC ATCTTTTTGC AGGACGACTT CATCACCCAG AGGTCCGGAC GCTGGGTCAT CCCGGTACGC ATGGACTCGA AGGGGATGGT CCCCGGGGTC GTTCACGACG TCTCCAATTC GGGCGAGACC GCCTTCATGG AACCTTTGGA GATCATCGGG CTCGCCAACG AGCTGGAGAA CCTCGTCGCC GACGAGAGGG CCGAGGAGAT CAGGATCGTC AGGCAGATCT GCAACTGGAT CCGTGAGGAC GCGGAGCAGA TCCTGGAGCA GTTCGAGGCG CTGGTGCGCA TGGACATCCT CAACTGCATC GCCACCCTGA GCGACAAGCT CAGGAGCGAG ACGCCTGTCA TCTCGCCTTC CCCCGCCATA CTGCTTAAGT CGGCGCGCCA CCCGATCCTC ACCCTGATGG GAAAGGAGGT GGTCCCCCTC GACCTGGAAC TCGCCGCCGA CAACAGGGTC ATGGTGGTCA CGGGCCCCAA CACCGGAGGC AAGACCATCG CCATCAAGAG CGCGGGGCTC CTCTGCGTCA TGGCCCTTTG CGGCATGCCG GTCCCGGCGC TTCCCGGCAC CGTGCTCCCG CTGGTGGAGA GCATTCTGGT CGACATCGGG GACGAGCAGT CCATAGAGGA GAGCCTTTCG ACCTTCTCGG CCCACATCTC GAAGATCTCC AACATCATCG AGCAAGCAGG CCAAGGGGCG CTGGTGCTCC TGGACGAGTT GGGGACCGGC ACCGAGCCGG GCCAAGGCGC GGCCATCGCC TGCGCGGTCC TCAAGGAGTT GCAGGAGAAG GGGGCGCTGG TGGTCGCCAC CACGCACCTC ACCGAGATCA TCGGCTTCGT GCAGCGCGAG GAGGGAATGA TGAACGCGGC CATGGCGTTC GACCGCGACA GGCTGGCGCC GCTCTACCGG CTCGTGGTCG GGGAACCGGG GGAATCGCAC GCTCTCGAGA TAGCCAGCCG CTACGGCCTG CCGGATCGCG TGGTGCGCTT CGCCCGAGGG ATGATCGGCA CCATGGAGGC CGATTTCCAT GCCCTTTTGC GCGACCTCAA GGATAAGCGG GCCCAACTGG AGCGCGCCCT GGAGGATATG GCCGAGAGGG AGGAAAAGGT TTCCTTCGCC GAGCGGAACC TCGTGGACCG CCGGGATGAG GCGGCCCAAC TGGTAAAGGA CGCCAAGGAA AAGGGGTTAT TGGAAGCGCA GCAGATCATC TGGAAGGCCA AGCGAGAGGT GGCTGCCCTT CTGGAGGAGG CCAAGCGCGA GAAGACGAAG ACGCGGGAGG CGAAGGAAAA GCTCGACCAG GCGGCGAGCG AATTGGAGCA GGCGCTGGAA GAGTTGCACC CCGAGGAGAA CGTGGACCCG GAAAAGGTGG CGGCGGGAGA CGTTCTTTTC GTCAAGCCGC TCAACTGCGA CGCCACCGTC CTCGCCATCG ATACGCGCTC AGGCAAGGCG CGGGTCCGTG CCGGGAGCAT GGAAATGGAG GTGCAGGTCA ATTCGTTGCT AAAGCCCAAA GGCAGAGAGC CGAAGAAGGT ACAGAAGCGT CGCGAGAAGC AGCAGGCCCA AGAGCAGGAG CGCGCCGAGC CGGCCTCCAC CATCAACCTC CTGGGGATGC GAGTGGAAGA GGCGGTAGGC GTCCTGGAAC CGTTCCTGAA CCACGCGGCG CTGGACCGGA TCCAGGAAGT GCACATCGTG CACGGCAAGG GAACCGGCGC GCTGATGAAG GGGGTGCGGA GCTACCTGGC CGACCACCCG CTGGTTGCCT CCTTCCGCAC CGGCGAGCGG TACGAGGGGG GCGACGGGGT GACGGTGGTG ACCCTGCGCT GA
|
Protein sequence | MISTDTLKRL EFDKILDTVA SYAHCDASHL GALSITPLSA RDEIELRLGL VEEVRKLTRF GIALKLSEFE DITPQVRAVR PTGSVISPLE LQRFIPTLRV MGAISAQLGF RTDVPLLTSQ AGSITGFPDL LNPLEHTVNE EGEILDTASR LLADIRGRKK GLTARIKKRL EEIVRERHTA IFLQDDFITQ RSGRWVIPVR MDSKGMVPGV VHDVSNSGET AFMEPLEIIG LANELENLVA DERAEEIRIV RQICNWIRED AEQILEQFEA LVRMDILNCI ATLSDKLRSE TPVISPSPAI LLKSARHPIL TLMGKEVVPL DLELAADNRV MVVTGPNTGG KTIAIKSAGL LCVMALCGMP VPALPGTVLP LVESILVDIG DEQSIEESLS TFSAHISKIS NIIEQAGQGA LVLLDELGTG TEPGQGAAIA CAVLKELQEK GALVVATTHL TEIIGFVQRE EGMMNAAMAF DRDRLAPLYR LVVGEPGESH ALEIASRYGL PDRVVRFARG MIGTMEADFH ALLRDLKDKR AQLERALEDM AEREEKVSFA ERNLVDRRDE AAQLVKDAKE KGLLEAQQII WKAKREVAAL LEEAKREKTK TREAKEKLDQ AASELEQALE ELHPEENVDP EKVAAGDVLF VKPLNCDATV LAIDTRSGKA RVRAGSMEME VQVNSLLKPK GREPKKVQKR REKQQAQEQE RAEPASTINL LGMRVEEAVG VLEPFLNHAA LDRIQEVHIV HGKGTGALMK GVRSYLADHP LVASFRTGER YEGGDGVTVV TLR
|
| |