Gene GSU1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1822 
Symbol 
ID2688694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1989919 
End bp1992534 
Gene Length2616 bp 
Protein Length871 aa 
Translation table11 
GC content60% 
IMG OID637126511 
ProductDNA mismatch repair protein MutS 
Protein accessionNP_952872 
Protein GI39996921 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0116284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAAC TTACTCCCAT GATGCGCCAA TATCTGGAGA TCAAGGCTGA TCATCCTGAC 
GCGATCCTCT TTTTTCGTCT GGGCGATTTC TACGAAATGT TCCTTGACGA TGCGGTCAAG
GCATCGCGTA TCCTTGACAT CACCCTGACT TCGCGCAACA AGGGGGGCGA CGGTGCCGAC
ATTCCACTCT GCGGTGTCCC TTTTCATTCG GCGGCACCGT ATATCGCCAA ACTGGTCGAG
GCGGGCGAGA AGGTCGCCAT CTGCGAACAG GTCGAGGACC CAAAAAGCGT CAAGGGGATC
GTCAAGCGGG AAGTGGTCAA GGTGGTTACC CCTGGCCTGG TGATTGACGC CGAAAGCCTT
TCTCCCAAGG AGAATAATTA TCTTCTGTCT CTTTTCCCCG GCCCGGACCG CTGGGGGGTT
GCCTATCTCG ATCTCTCCAC AGGTGATTTC CGAGTAACCG AAACTGATTC AGCCGATGCC
GCGTGGGCGG AGGTCGCCTG CGTCAACCCC CGTGAAATCC TGATGCCCCT TTCGTTCAGG
GACGGGGGGG CGGGGAGTGA ACGCCCGGAC CTGGCGGCGG GGCGGATGCT CAGCTATGTT
GATGAATGGG TGTACGACGC CGAGTATGCC GAACGCATGG TGCGTAATCA TTTTGGTGTC
GCCTCTGCGG AGGCGGCTGG GTGCGGCAGC ATGGACTGCG GCCTCAGGGC CGTTGCCGCG
GTGCTCCACT ATCTGCAGCA GACCCAAAAG GGGGACGTTC GCCATATCAG CTACCTTCAG
GCGTACCGTA CCCAGGAGTT CCTGGTGCTG GACGAGTCTT CCCGGCGCAA CCTGGAGCTG
AACGCCACCA TTGGCGATGG AAAGCGGCGC GGCTCTCTCC TTGGCTTGCT GGACCGGACC
GTGACGGCAA TGGGGGGGAG GAAGCTCAAA CAGTGGATTA ATTATCCGCT TGTATCCATA
GAAAAAATAA ACGAACGACT TGATGCGGTC GAGGAGCTGG TCGCTGATGC CGAGTTCAGG
CAGGGGGTTC GGGCGGCCCT TGACGGCGTC TATGACCTGG AGCGGCTTAA CGGCCGGATC
AGCCTCGCTT CGGCTTCGGC CAAAGATCTG GTCGCCCTGC GCGCTTCCCT GGTACGGTTG
CCGTCGCTCA TTGCCCTTCT GACGCCGGCA GCCTCAACGC TCCTGGCCAG ATTGCGCGAC
GGGATTGATC TCCTCGCGGA CGTGGAGGAG CTGATCGGCC GTGGCATCGT GCCAGATCCT
CCCTTTGTTC TCCGTGAAGG GGGGATCATC GCGCAAGGGT ACCACTCCGA GCTTGACGAA
CTACGCAGCA TCAGCCGGGA AGGGAAGGGG TTCATTGCCC GCCTTGAAGC CCAGGAAAAG
GCGCGCACCG GTATTTCGTC GCTCAAGGTT CGCTACAATA AGGTGTTTGG TTATTACATC
GAGGTGACCA AGTCGAACCT GTCAGCCATC CCGGACGACT ACATCCGGCG CCAGACCCTG
GCCAATGCCG AGCGTTTCAT CACGCCTGAG CTGAAAGAGT ACGAGGAAAA AGTGCTCGGG
GCCGAAGACA GGATCGTTGA GCTGGAATAC GCACTCTTCC AGGATATCCG CCAGCGGGTG
GCGGCCCAGG GGGAGCGGAT CGCCCGCACT GCGGACCGGC TTGCGACTCT GGACGTGCTG
GCGGCCCTGG CCGACGTGGC CCACGACCAT CGCTATGTCA GGCCGACGGT GGACGAAGGG
GACGCCATCG TCGTCACCGG CGGCCGCCAT CCCGTGGTGG AAGCCCTGAA CCGGTCCGAG
CGGTTTGTTG CCAATGATGT GCAGCTCGAC AACGGTGAGA ACCAGTTGGT CATCATCACT
GGTCCCAACA TGGCCGGTAA GTCGACCTTC ATGCGACAGG TGGCCCTGAT TGTTCTCATG
GCCCAGACGG GGTCCTTCGT GCCTGCCGAC GAAGCCAGCA TCGGCGTGGT CGACCGCATC
TTCACCCGGG TCGGCGCATC GGACAATCTG GCGCGGGGCC AGTCGACCTT CATGGTCGAG
ATGATGGAAA CCGCGGCGAT TCTGCGCAAT GCAACGCCGC GCAGCTTGGT CGTGCTCGAT
GAGATCGGCC GGGGAACATC CACCTTCGAT GGTGTTTCCA TTGCCTGGGC CGTGGCCGAG
TATCTGCACG ACACGGAGCG GTGTGCGGCA AAGACCCTCT TTGCCACCCA CTACCACGAA
CTGACCGAGC TTGCGGTGAC ACGCAACCGG GTCAAGAACT GTAATGTGGC GGTGAAGGAG
TGGAACGATC AGGTAATCTT TCTCCGCAAG ATCGTCGAGG GAGGGGCCTC CCACTCCTAC
GGTATTCAGG TGGCCCGCTT GGCCGGACTT CCGCAGGAGG TTATCGAGCG GGCCAAAGAG
ATTCTCCATA ACCTCGAAAA GGGAGAGTAT GCCGAAGGGG GCATACCGCG CATCGCCCGG
GGGAAAAGGG CCGGAGCGCC GAAGCCGTCG CCGCAGCTGT CCCTCTTCGA CCAGGGGGAC
GACCTGCTGC GTCGCCGGAT TGCTGGTTTG AATATTGCAG CTCTCACCCC CCTTGAAGCG
CTCAATATCC TGGACGAATT GAAAAGGATG GTCTGA
 
Protein sequence
MSELTPMMRQ YLEIKADHPD AILFFRLGDF YEMFLDDAVK ASRILDITLT SRNKGGDGAD 
IPLCGVPFHS AAPYIAKLVE AGEKVAICEQ VEDPKSVKGI VKREVVKVVT PGLVIDAESL
SPKENNYLLS LFPGPDRWGV AYLDLSTGDF RVTETDSADA AWAEVACVNP REILMPLSFR
DGGAGSERPD LAAGRMLSYV DEWVYDAEYA ERMVRNHFGV ASAEAAGCGS MDCGLRAVAA
VLHYLQQTQK GDVRHISYLQ AYRTQEFLVL DESSRRNLEL NATIGDGKRR GSLLGLLDRT
VTAMGGRKLK QWINYPLVSI EKINERLDAV EELVADAEFR QGVRAALDGV YDLERLNGRI
SLASASAKDL VALRASLVRL PSLIALLTPA ASTLLARLRD GIDLLADVEE LIGRGIVPDP
PFVLREGGII AQGYHSELDE LRSISREGKG FIARLEAQEK ARTGISSLKV RYNKVFGYYI
EVTKSNLSAI PDDYIRRQTL ANAERFITPE LKEYEEKVLG AEDRIVELEY ALFQDIRQRV
AAQGERIART ADRLATLDVL AALADVAHDH RYVRPTVDEG DAIVVTGGRH PVVEALNRSE
RFVANDVQLD NGENQLVIIT GPNMAGKSTF MRQVALIVLM AQTGSFVPAD EASIGVVDRI
FTRVGASDNL ARGQSTFMVE MMETAAILRN ATPRSLVVLD EIGRGTSTFD GVSIAWAVAE
YLHDTERCAA KTLFATHYHE LTELAVTRNR VKNCNVAVKE WNDQVIFLRK IVEGGASHSY
GIQVARLAGL PQEVIERAKE ILHNLEKGEY AEGGIPRIAR GKRAGAPKPS PQLSLFDQGD
DLLRRRIAGL NIAALTPLEA LNILDELKRM V