Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG0095 |
Symbol | mutS |
ID | 2552448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | - |
Start bp | 112178 |
End bp | 114853 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637148905 |
Product | DNA mismatch repair protein MutS |
Protein accession | NP_904443 |
Protein GI | 34539964 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0480084 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAGC CTGTGGTAGA AACGCCTTTG ATGCGCCAAT ACTTTCAGAT CAAGCAAAAA CACCCTGATG CTATCCTGCT CTTCAGGGTG GGAGATTTCT ACGAAACATT TTCGGAGGAT GCCATCGTGG CTTCCGAGAT CCTGGGGATA ACGCTGACAC GGCGTGCCAA CGGAGCTGCC CAGTTTGTCG AATTGGCCGG CTTTCCTCAT CATGCGTTGG ATACCTATCT GCCCAAATTG GTACGAGCCG GCAAGCGCGT GGCCATATGC GACCAACTGG AAGATCCGAA AAAGACCAAA ACGCTGGTCA AACGCGGTAT TACCGAACTG GTCACTCCCG GCGTATCGAC CAACGACAAT GTACTCTCTC ATAAGGAGAA CAATTTCCTT GCAGCCGTCT CCTGCGGCAA AGAGGTCTTC GGTATTTCTC TTCTGGACAT ATCCACCGGC GAATTTATGG CCGGACAAGG CAATGCCGAC TATGTGGAAA AACTCCTGAC CAACTACCGC CCGAAGGAAA TCCTCGTGGA ACGGTCGGAG CGGTCTCGCT TCAACGACCT CTTCCACTGG AGCGGATTTA TCTTCGATAT GGAAGACTGG GCTTTCTCCT CGGAGAACAA TCGACTACGC GTACTCAAGC ACTTTGACCT GAAAAGCCTC AAAGGTTTCG GGCTGGAAGA GCTTTCGATG GCAGTAACGG CAGCCGGAGC CGTACTGAAC TATCTCGATC TGACACAGCA CCATCAGCTA CAACACATCA CATCGCTGAG CCGACTGGAT GAAAACCGGT ATGTACGCTT GGATAAGTTC ACCGTCCGAA GCCTTGAATT GCTTAGCCCG ATGAACGAGG GAGGCAAGAG TCTGCTCGAC ATCATCGACC ATACGATAAC GCCTATGGGA GCAAGGCGTA TACGACAGTG GATCGTATTC CCACTCAAGG ACCCTGCACG CATACAAGCC CGACAGCGAG TGGTGGAGTT TTTCTTCCGA CATCCTGAAG AGCGAGCCAT CATTGCCGAA CATCTGACAG AGATAGGCGA CTTGGAACGT TTGGTGACGA AAGGCGCCAT GGGACGCATT TCTCCGCGAG AAATGGTACA ACTACGTGTC GCCCTGCAAG CACTCGAACC GATCAAGGAA GTATGTACCC ATGCAGACGA AGAGAATCTG CGCACACTGG GGGGAAAGTT GGAGCTGTGC AAGGAACTAC GCGACAAGAT ATTGCGTGAG GTGATGCCCG ATGCTCCGGC CGCTCTCGGT CGCGGCCCCG TCATCGCACA TGGCGTCGAT GCCACGCTCG ACGAACTGCG TGCACTGGCA TACAGCGGCA AAGACTATCT GATCAAGCTA CAGCAGCAGG AGATAGAGCG AACGGGAATA CCCAGTCTCA AAGTAGCTTA TAACAATGTG TTCGGCTACT ATATCGAAGT CCGCAACACG CACAAGGACA AGGTACCGGC CGAGTGGATC CGCAAGCAAA CACTCGTCAG TGCCGAACGG TATATTACGG AAGAGCTGAA AGAGTACGAA GCAAAAATAC TCGGAGCCGA AGAGAAAATA GCAGCCCTCG AAGGACAGCT GTACGCCTTG CTTGTAGCCG AACTGCAGCG ATACGTGGCA CCTCTCCAAC AGGACAGTCA AGCGGTGGCT TCTCTGGATT GTCTCCTCTC CTTTGCCGAG TCGGCACGCC GCTACAGATT TATCTGCCCT GTAGTGGACG AGAGCTTTAC CATCGACATC AAAGCGGGCC GCCACCCCGT CATCGAACAG CAGCTACCGG CCGATGAACC TTATATCGCC AACGATATTT ATTTGGACAC AGACCGCCAG CAAGTCATCA TCGTCACCGG CCCCAATATG AGCGGCAAGT CCGCCCTGCT CAGGCAGACG GCCCTTATCT CTCTGATGGC ACAGATAGGC TCTTTCGTAC CGGCCGAAAG TGCACGTATC GGCATGGTGG ACAGCATTTT CACGCGGGTG GGAGCTTCGG ACAACATCTC CATGGGCGAA TCCACTTTCA TGGTCGAAAT GCAAGAAGCT TCCAATATTC TGAACAATCT CACGCCACGC AGTCTGGTAC TGTTCGACGA ATTAGGACGG GGTACCAGCA CCTATGACGG CATCTCCATA GCCTGGTCTA TCGTGGAGTA CATCCATGAC AATCCCAAGG CACACCCCCG GACTCTCTTC GCCACACATT ATCACGAACT GAACGAGCTG GAAGGGCAAC TCGATCGGGT ACACAACTTC AATGTATCGG CTCGCGAAGT GGACGGCAAG ATGCTCTTCC TTCGCAAATT GGAGCCGGGC GGCAGTGCAC ACAGCTTCGG TATTCAAGTA GCCCGTCTCG GCGGTATGCC ACACCATATC GTACAGCGAG CCACAGACAT CCTGCATCGT CTCGAACAAG AAAGGGAGAA AATAGAGGAA GAGGAGCCAA AAACCAAAGA CACCAAACGA GGCCCTTCCG AAAAGGTGAA AAATGCGTCG CCCACGCTTC CTCGGGACGA AAAAGGCAGG AGCATCGACG GTTATCAGCT TAGCTTCTTT CAGCTCGATG ATCCCGTCTT GTCCCAAATC AGAGAGGAGA TCCTCGATCT GAACATCGAC AACCTCACCC CCTTGGAGGC CCTCAATAAG CTGAACGACA TCAAACGAAT CCTCCGCGGA TACTGA
|
Protein sequence | MAKPVVETPL MRQYFQIKQK HPDAILLFRV GDFYETFSED AIVASEILGI TLTRRANGAA QFVELAGFPH HALDTYLPKL VRAGKRVAIC DQLEDPKKTK TLVKRGITEL VTPGVSTNDN VLSHKENNFL AAVSCGKEVF GISLLDISTG EFMAGQGNAD YVEKLLTNYR PKEILVERSE RSRFNDLFHW SGFIFDMEDW AFSSENNRLR VLKHFDLKSL KGFGLEELSM AVTAAGAVLN YLDLTQHHQL QHITSLSRLD ENRYVRLDKF TVRSLELLSP MNEGGKSLLD IIDHTITPMG ARRIRQWIVF PLKDPARIQA RQRVVEFFFR HPEERAIIAE HLTEIGDLER LVTKGAMGRI SPREMVQLRV ALQALEPIKE VCTHADEENL RTLGGKLELC KELRDKILRE VMPDAPAALG RGPVIAHGVD ATLDELRALA YSGKDYLIKL QQQEIERTGI PSLKVAYNNV FGYYIEVRNT HKDKVPAEWI RKQTLVSAER YITEELKEYE AKILGAEEKI AALEGQLYAL LVAELQRYVA PLQQDSQAVA SLDCLLSFAE SARRYRFICP VVDESFTIDI KAGRHPVIEQ QLPADEPYIA NDIYLDTDRQ QVIIVTGPNM SGKSALLRQT ALISLMAQIG SFVPAESARI GMVDSIFTRV GASDNISMGE STFMVEMQEA SNILNNLTPR SLVLFDELGR GTSTYDGISI AWSIVEYIHD NPKAHPRTLF ATHYHELNEL EGQLDRVHNF NVSAREVDGK MLFLRKLEPG GSAHSFGIQV ARLGGMPHHI VQRATDILHR LEQEREKIEE EEPKTKDTKR GPSEKVKNAS PTLPRDEKGR SIDGYQLSFF QLDDPVLSQI REEILDLNID NLTPLEALNK LNDIKRILRG Y
|
| |