Gene Gmet_2974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_2974 
Symbol 
ID3740885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp3360327 
End bp3362684 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content68% 
IMG OID637780263 
ProductMutS 2 protein 
Protein accessionYP_385914 
Protein GI78224167 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACGAC ACGAAACCCT CCGCACCCTC GAATTCGACA AGGTCCTCGC GGCCGTCGGC 
GGCTACGGCC ACAGCACCGC CACCCAGGAT GAGATCGCCC TGATCCGTCC CCTGGACGAT
TGGTCGGCCA TCACCCGGCG CTTCGGCCAG GTGGACGAGA TCCGGCGCAT GACCCAGCAG
GGGATCGCCA TCCCCCTCTC CACGTTCGAC GACATCTCGG CCCTTCTGGA CGCGGTGCGC
CCCGAGGGTG CGGTCCTCGA CCCCACGGAA TTGGTGATAC TCTTTCCGGT GCTCCGGACC
ATGACCGCCA TCGCCAAACA GTTCGCCTAC CGGTCCGACA TCCCGCTCCT GAAGGAGTTG
GCTGGCCACG TGACCGGCTT CCCCGACATC CTGGACGAGC TGGAGGTGTC CATCGACAGC
GAGGGGGAGA TCCTCGACTC CGCCTCGCCG CTCCTCTTCG ACCTGCGCAA GAAAAAGCGC
GCCCTCACCG AGCGGATCCG GCGGCGGCTG GCCGAGATCG TCCGGGAGAC CGGCGTCACC
ACCTTCCTCC AGGACGACTT CATCACCCAG CGGGGTGGCC GGTGGGTGAT CCCGGTCCGG
ATGGACTCCA AGGGGATGGT CCCCGGCGTG GTCCACGACG TCTCCAACTC CGGCGAGACC
GCCTTCATGG AACCATTGGA AATCATCGGC CTCGCCAACG AGTTGGAAAA CCTGGTGGCC
GAGGAGAAGG CCGAGATGAT CCGCATCGTC CGGGCCATCT GCCGGATGCT TCGCCGCGAG
GCCGATCCCC TGGCGGAACA GTTCCGGACC CTGGTCCACC TGGACCTCCT GAACGCCGTG
GCCACCTTCG CCGATTCCCT CTCCGCCGAA AACCCCGAGA TCAACGACGC CCGCTTCATC
CGGGTAAACG AAGGGCGGCA TCCCCTCCTG GCCCTCATGG CCCGGGAGCG GGGTGCCGGC
AGGGTCGTGC CCCTGGACCT TTCCCTCGGG GAGACTGAGC AGGTCATGGT CATCACCGGC
CCCAACGCCG GCGGCAAAAC CATCTCCCTC AAGACCACCG GCCTCCTCCA CCTCATGGCC
CTGGCCGGGC TTCCGGTACC GGCCGCATCC ACCTCGTCGT TCCCCCTCAT CTCCGACCTC
CTGGTGGACA TCGGCGATGA GCAGTCCATC GAGCAGAGCC TCTCCACCTT CTCGGCCCAC
GTCTCCAACA TCGCGGGGAT CCTGGAACGG GCCGACGACC GGACCGTGGT GCTTCTGGAC
GAGCTGGGGA CCGGTACCGA GCCGGTCCAG GGGGCGGCCA TCTCCTGCGC CGTCCTGGCC
GATCTCCAGG AGAAGGGGGC ACGGGTCATC GCCACGACCC ACCTCACCGA CATCGTCGGC
TTCGTCCACA AGCGGGACGG GATGGTGAAC GCCTCCATGG AGTTCGACCG GGCGACCCTC
ACCCCCCTCT ACCGCCTCAA GAAGGGGGAG CCGGGGCAGT CCCACGCCCT GGAGATCGCC
CGCCGTTACG GCCTCCCGGA CCGGGTGGTG GAGTTCGCCA CCGGCATGCT CTCCCGCATG
GAGACCGAGT TCCACGAGCT TCTGGCCGAG CTGAAGGACC AGCGCCGGCG CCACGAGGAG
GCCCTGGCCG AGGCGGAGCG ACTGCGGCGG GATGCCGAGG AAAAGGCCCG CATCGTCCGT
GAGCGGCTGG CCGAGGCCGA GGCGAAGCGG CGGGAGGCGG TGGAAAAGGC GTTTCAGGAG
GCGAAGGAGA TTGTCCGAAG TGCCCGGCGG GAGGTGAACG CCATCATCGA GGAGGCCCGG
AAAGAGAAGA GCCGCGAGGC CCGGAAAAAG ATCGACGAGG CCGAGGCGCG GGTGGAGGAG
CAGCTCCAGG AGTTCCACCC CGAGGAGCGC GTTCCCCTGG AGGCCATCAG CGAGGGGGAC
ACGGTCCATG TGAAGCGTCT CGGCCACGAC GTGACCGTCC TCGCCGTGGA CCGGAAGGGG
GAGACCCTCA AGGTCCGGGC CGGCACCTTC GAGCTGGTGG TGGAGGCGGC CGACGTGGCC
CCGCCGAGGG AAAAGGGGGG GAAGAAACCC AAGGCCAGGG CCGCCGCCAA GATCGCCGCC
CCTTCCCGGG AGTCCACACC CCACGAACTG AACCTCATCG GCCTGCGGGT GGACGATGCC
CTGGGGCGGC TGGAGCCGTT CCTGAACCAT GCATCCCTGG AAGGGTACGG CGAGGTGCGG
ATCGTTCACG GCAAGGGGAC CGGCGCCCTC ATGCGGGGGG TGCGGGAGTA CCTGGACGGC
CATCCCCTGG TGCGGGAGTT CCGCCCCGGC GAGCCCTTCG AGGGGGGCGA GGGGGCCACG
GTGGTGCTGC TAAGGTAG
 
Protein sequence
MIRHETLRTL EFDKVLAAVG GYGHSTATQD EIALIRPLDD WSAITRRFGQ VDEIRRMTQQ 
GIAIPLSTFD DISALLDAVR PEGAVLDPTE LVILFPVLRT MTAIAKQFAY RSDIPLLKEL
AGHVTGFPDI LDELEVSIDS EGEILDSASP LLFDLRKKKR ALTERIRRRL AEIVRETGVT
TFLQDDFITQ RGGRWVIPVR MDSKGMVPGV VHDVSNSGET AFMEPLEIIG LANELENLVA
EEKAEMIRIV RAICRMLRRE ADPLAEQFRT LVHLDLLNAV ATFADSLSAE NPEINDARFI
RVNEGRHPLL ALMARERGAG RVVPLDLSLG ETEQVMVITG PNAGGKTISL KTTGLLHLMA
LAGLPVPAAS TSSFPLISDL LVDIGDEQSI EQSLSTFSAH VSNIAGILER ADDRTVVLLD
ELGTGTEPVQ GAAISCAVLA DLQEKGARVI ATTHLTDIVG FVHKRDGMVN ASMEFDRATL
TPLYRLKKGE PGQSHALEIA RRYGLPDRVV EFATGMLSRM ETEFHELLAE LKDQRRRHEE
ALAEAERLRR DAEEKARIVR ERLAEAEAKR REAVEKAFQE AKEIVRSARR EVNAIIEEAR
KEKSREARKK IDEAEARVEE QLQEFHPEER VPLEAISEGD TVHVKRLGHD VTVLAVDRKG
ETLKVRAGTF ELVVEAADVA PPREKGGKKP KARAAAKIAA PSRESTPHEL NLIGLRVDDA
LGRLEPFLNH ASLEGYGEVR IVHGKGTGAL MRGVREYLDG HPLVREFRPG EPFEGGEGAT
VVLLR