Gene GM21_1412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1412 
Symbol 
ID8136740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1660477 
End bp1662828 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content65% 
IMG OID644869026 
ProductMutS2 family protein 
Protein accessionYP_003021229 
Protein GI253700040 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value2.29197e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAGCA CCGATACCCT CAAGCGGCTG GAGTTCGACA AGATCCTCGA CACCGTCGCT 
TCTTACGCAC ACTGCGACGC CTCTCACCTA GGGGCGCTAT CCATAACGCC CCTCTCGGCA
CGGGACGAGA TCGAACTCCG CCTGGGCCTC GTCGAAGAGG TGCGCAAGCT AACCCGGTTC
GGCATCGCCC TCAAGCTCTC GGAGTTCGAG GACATCACCC CCCAGGTCAG AGCGGTCCGC
CCGACAGGCT CGGTGATCTC GCCGCTGGAG TTGCAGCGCT TCATCCCGAC ACTGAGGGTG
ATGGGAGCCA TCTCCGCGCA GCTCGGTTTT CGCACCGACG TCCCCCTGCT CACCTCGCAG
GCCGGTTCCA TCACCGGATT CCCGGACCTC TTGAATCCGC TGGAGCACAC GGTGAACGAG
GAAGGGGAGA TCCTCGACAC CGCCTCCAGG CTCCTGGCCG ACATCCGCGG CCGCAAGAAG
GGGCTCACCG CCCGCATCAA GAAAAGGCTG GAGGAGATCG TCCGCGAGCG GCACACCGCC
ATCTTTTTGC AGGACGACTT CATCACCCAG AGGTCCGGAC GCTGGGTCAT CCCGGTACGC
ATGGACTCGA AGGGGATGGT CCCCGGGGTC GTTCACGACG TCTCCAATTC GGGCGAGACC
GCCTTCATGG AACCTTTGGA GATCATCGGG CTCGCCAACG AGCTGGAGAA CCTCGTCGCC
GACGAGAGGG CCGAGGAGAT CAGGATCGTC AGGCAGATCT GCAACTGGAT CCGTGAGGAC
GCGGAGCAGA TCCTGGAGCA GTTCGAGGCG CTGGTGCGCA TGGACATCCT CAACTGCATC
GCCACCCTGA GCGACAAGCT CAGGAGCGAG ACGCCTGTCA TCTCGCCTTC CCCCGCCATA
CTGCTTAAGT CGGCGCGCCA CCCGATCCTC ACCCTGATGG GAAAGGAGGT GGTCCCCCTC
GACCTGGAAC TCGCCGCCGA CAACAGGGTC ATGGTGGTCA CGGGCCCCAA CACCGGAGGC
AAGACCATCG CCATCAAGAG CGCGGGGCTC CTCTGCGTCA TGGCCCTTTG CGGCATGCCG
GTCCCGGCGC TTCCCGGCAC CGTGCTCCCG CTGGTGGAGA GCATTCTGGT CGACATCGGG
GACGAGCAGT CCATAGAGGA GAGCCTTTCG ACCTTCTCGG CCCACATCTC GAAGATCTCC
AACATCATCG AGCAAGCAGG CCAAGGGGCG CTGGTGCTCC TGGACGAGTT GGGGACCGGC
ACCGAGCCGG GCCAAGGCGC GGCCATCGCC TGCGCGGTCC TCAAGGAGTT GCAGGAGAAG
GGGGCGCTGG TGGTCGCCAC CACGCACCTC ACCGAGATCA TCGGCTTCGT GCAGCGCGAG
GAGGGAATGA TGAACGCGGC CATGGCGTTC GACCGCGACA GGCTGGCGCC GCTCTACCGG
CTCGTGGTCG GGGAACCGGG GGAATCGCAC GCTCTCGAGA TAGCCAGCCG CTACGGCCTG
CCGGATCGCG TGGTGCGCTT CGCCCGAGGG ATGATCGGCA CCATGGAGGC CGATTTCCAT
GCCCTTTTGC GCGACCTCAA GGATAAGCGG GCCCAACTGG AGCGCGCCCT GGAGGATATG
GCCGAGAGGG AGGAAAAGGT TTCCTTCGCC GAGCGGAACC TCGTGGACCG CCGGGATGAG
GCGGCCCAAC TGGTAAAGGA CGCCAAGGAA AAGGGGTTAT TGGAAGCGCA GCAGATCATC
TGGAAGGCCA AGCGAGAGGT GGCTGCCCTT CTGGAGGAGG CCAAGCGCGA GAAGACGAAG
ACGCGGGAGG CGAAGGAAAA GCTCGACCAG GCGGCGAGCG AATTGGAGCA GGCGCTGGAA
GAGTTGCACC CCGAGGAGAA CGTGGACCCG GAAAAGGTGG CGGCGGGAGA CGTTCTTTTC
GTCAAGCCGC TCAACTGCGA CGCCACCGTC CTCGCCATCG ATACGCGCTC AGGCAAGGCG
CGGGTCCGTG CCGGGAGCAT GGAAATGGAG GTGCAGGTCA ATTCGTTGCT AAAGCCCAAA
GGCAGAGAGC CGAAGAAGGT ACAGAAGCGT CGCGAGAAGC AGCAGGCCCA AGAGCAGGAG
CGCGCCGAGC CGGCCTCCAC CATCAACCTC CTGGGGATGC GAGTGGAAGA GGCGGTAGGC
GTCCTGGAAC CGTTCCTGAA CCACGCGGCG CTGGACCGGA TCCAGGAAGT GCACATCGTG
CACGGCAAGG GAACCGGCGC GCTGATGAAG GGGGTGCGGA GCTACCTGGC CGACCACCCG
CTGGTTGCCT CCTTCCGCAC CGGCGAGCGG TACGAGGGGG GCGACGGGGT GACGGTGGTG
ACCCTGCGCT GA
 
Protein sequence
MISTDTLKRL EFDKILDTVA SYAHCDASHL GALSITPLSA RDEIELRLGL VEEVRKLTRF 
GIALKLSEFE DITPQVRAVR PTGSVISPLE LQRFIPTLRV MGAISAQLGF RTDVPLLTSQ
AGSITGFPDL LNPLEHTVNE EGEILDTASR LLADIRGRKK GLTARIKKRL EEIVRERHTA
IFLQDDFITQ RSGRWVIPVR MDSKGMVPGV VHDVSNSGET AFMEPLEIIG LANELENLVA
DERAEEIRIV RQICNWIRED AEQILEQFEA LVRMDILNCI ATLSDKLRSE TPVISPSPAI
LLKSARHPIL TLMGKEVVPL DLELAADNRV MVVTGPNTGG KTIAIKSAGL LCVMALCGMP
VPALPGTVLP LVESILVDIG DEQSIEESLS TFSAHISKIS NIIEQAGQGA LVLLDELGTG
TEPGQGAAIA CAVLKELQEK GALVVATTHL TEIIGFVQRE EGMMNAAMAF DRDRLAPLYR
LVVGEPGESH ALEIASRYGL PDRVVRFARG MIGTMEADFH ALLRDLKDKR AQLERALEDM
AEREEKVSFA ERNLVDRRDE AAQLVKDAKE KGLLEAQQII WKAKREVAAL LEEAKREKTK
TREAKEKLDQ AASELEQALE ELHPEENVDP EKVAAGDVLF VKPLNCDATV LAIDTRSGKA
RVRAGSMEME VQVNSLLKPK GREPKKVQKR REKQQAQEQE RAEPASTINL LGMRVEEAVG
VLEPFLNHAA LDRIQEVHIV HGKGTGALMK GVRSYLADHP LVASFRTGER YEGGDGVTVV
TLR