Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2655 |
Symbol | |
ID | 5084443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 2696862 |
End bp | 2699492 |
Gene Length | 2631 bp |
Protein Length | 876 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640484218 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001168847 |
Protein GI | 146278688 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.607224 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGACG ACACCGTCAC GCCGATGATG GCGCAATATC TGGAGATCAA GGCGCAGCAC CCCGGCGCGA TCCTGTTCTA CCGGATGGGC GACTTCTACG AGATGTTCTT CGAGGATGCG GCGCTGGCGG CCGAGGCGCT CGACATCGCG CTGACCAAGC GCGGCAAGCA CAAGGGCGAG GATATCGCCA TGTGCGGCGT GCCGATCCAT GCCGCCGAGG GCTACCTGCT GACGCTGATC CGCAAGGGGT TCCGCGTCGC CATCGCCGAG CAGATGGAGG ACCCGGCCGA AGCGAAGAAG CGCGGCTCCA AGTCCGTTGT CCGGCGCGAG GTGGTGCGGC TCGTCACCCC CGGCACGCTG ACCGAGGACA GCCTGCTGGA GGCGCGGCGG CACAACTTCC TCTGCGCCTT CGCCGAGATC CGCGACGAGG CGGCACTCGC CTGGGCCGAC ATCTCGACCG GCGAGTTCAG CGTCACGCCC TGCCCGCTGC CCCGCCTGCT GCCCGAGCTT GCCCGCCTCG CGCCGCGCGA ACTGCTGGTG GCCGACGAAC GCCCGCTCGA CTGGATCGAG GAGGTGGGAT GCGCCCTGAC CCCTCTCGCC CGCGCGAGCT TTGACAGCGC CTCGGCCGAA AAGCGGCTCT GCACGCTCTT CGGGGTCGGC ACGCTGGACA GCTTCGGCAA CTTCACCCGC CCCGAGCTGT CGGCCATGGG CGCGCTGGTC GATTACCTCG ACCTCACGCA GCGCGGAAAG CTGCCGCTCC TGCGCCCGCC CGTGCGCGAG GTCGCGGGCG GCACGGTGCA GATCGACGCC GCCACCCGGC GCAACCTCGA GATCACGCAA GCCCTCACCG GCGGGCGCGA AGGTTCGCTG CTCTCGGCGG TGGACCGCAC CGTCACCGCC CCCGGCGCCC GCCTGCTCGA GCGGCGGCTC TCCAGCCCCT CGCGCGACCT TGGCCTGATC CACGACCGGC TCGCGGCTGT GAGCTGGCTG ACGGACGAGC CGCGGCTGCG CGAGGATCTG CGGGCGAGCC TGCGCCGCGT GCCGGACATG GACCGCGCCC TCTCGCGGCT CGCGCTCGAC CGTGCCGGGC CACGGGACAT GGCGGCGATC CGCGCCGGCC TCACGCAGGC CGAGGCCATC GCGGGTCGTA TGCCGGCCGA CGCGCCTTCC CTGCTCGCGG AGACACTCGA GGCGCTCCGC GGCCACGAGA ACCTCGTGGA TCTCCTCGAT CAGGCGCTGG TGGCCGAGCC GCCGCTGCTG GTGCGCGATG GCGGCTTCAT CGCCCCGGGC TTCGATGACG ACCTCGACGA GACACGGCGC CTGCGCGACG AGGGCCGCGG CGTGATCGCG TCGATGCAGG CCGGCTTCAT CGAGACGACC GGCATCCAGA GCCTGAAGAT CAAGCACAAC AACGTGCTGG GCTATTTCAT CGAAGTCACC TCGACCCACG CCGAAAAGAT GCTCTCACCC CCCCTGTCCG AGAGCTTCAT CCACCGCCAG ACGACCGCGG GGCAGGTGCG CTTCACCACC GTCGCCCTCT CGGAACTCGA AACGCGCATC CTGAACGCCG GGAACCGCGC GCTCGAACTC GAGAAGATGC ATTTTGCGGC GCTGCGGACG GCGATCCTCG ATCAGGCGGG CGCGATCGGC CGCGCCGCCC GGGCGCTGGC CGAGGTGGAC CTGATCGCGG CCTTCGCCGA CCTCGCCGTG GCCGAGGACT GGACCGAGCC GCAGGTGGAC GACAGCCGCG CCTTCGCCAT CGAGGCCGGC CGGCATCCGG TCGTCGAGCG TGCCCTCCGC CGGACCGGCA CGCCCTTCGT GGCGAACGAC TGCGACCTGT CCAAGGCCGA GACGCCGGCC GTCTGGCTCA TCACCGGGCC GAACATGGCC GGTAAATCCA CCTTCCTGCG CCAGAACGCA CTGATCGCGC TGCTCGCCCA GGCGGGCAGC TTCGTCCCCG CCCGCCGGGC CCATATCGGC CTCGTCAGCC AGATCTTCAG CCGCGTCGGC GCCTCGGACG ATCTGGCCCG CGGCCGCTCG ACCTTCATGG TCGAAATGGT CGAAACCGCC GCCATCCTGA ACCAGGCCGA TGACCGCGCG CTTGTGATCC TCGACGAGAT CGGCCGCGGT ACAGCCACCT GGGACGGGCT CTCGATCGCC TGGGCCACGC TCGAGCATCT GCACGACACG AACCGCTGCC GCGCGCTCTT CGCCACCCAC TACCACGAGA TGACGGCGCT CGCCGGCAAG CTCACCGGCG TCGAGAACGC CACCGTGTCC GTCAAGGAAT GGCAGGGCGA GGTGATCTTC CTGCACGAGG TGCGGCGCGG CGCGGCTGAT CGGTCCTATG GTGTGCAGGT GGCGCGGCTC GCGGGCCTTC CCGCCTCGGT AATCGAGCGC GCCCGTACCG TCCTCGACGC GCTCGAGTCC GGCGAACGCG AGAGCGGTCC ACGGCGGCAG GCGCTGATCG ACGACCTGCC GCTCTTTCGC GCCGCCCCGC CGCCGCCCGC CCCCGCCGCT CCTCCCAAAG CCTCGCAGGT GGAAGAGCGG CTGCGCGCGA TCCAGCCCGA CGACCTCAGC CCGCGCGAGG CGCTCAAACT CCTCTACGAT CTCCGGGCCC TCCTGCCCTG A
|
Protein sequence | MSDDTVTPMM AQYLEIKAQH PGAILFYRMG DFYEMFFEDA ALAAEALDIA LTKRGKHKGE DIAMCGVPIH AAEGYLLTLI RKGFRVAIAE QMEDPAEAKK RGSKSVVRRE VVRLVTPGTL TEDSLLEARR HNFLCAFAEI RDEAALAWAD ISTGEFSVTP CPLPRLLPEL ARLAPRELLV ADERPLDWIE EVGCALTPLA RASFDSASAE KRLCTLFGVG TLDSFGNFTR PELSAMGALV DYLDLTQRGK LPLLRPPVRE VAGGTVQIDA ATRRNLEITQ ALTGGREGSL LSAVDRTVTA PGARLLERRL SSPSRDLGLI HDRLAAVSWL TDEPRLREDL RASLRRVPDM DRALSRLALD RAGPRDMAAI RAGLTQAEAI AGRMPADAPS LLAETLEALR GHENLVDLLD QALVAEPPLL VRDGGFIAPG FDDDLDETRR LRDEGRGVIA SMQAGFIETT GIQSLKIKHN NVLGYFIEVT STHAEKMLSP PLSESFIHRQ TTAGQVRFTT VALSELETRI LNAGNRALEL EKMHFAALRT AILDQAGAIG RAARALAEVD LIAAFADLAV AEDWTEPQVD DSRAFAIEAG RHPVVERALR RTGTPFVAND CDLSKAETPA VWLITGPNMA GKSTFLRQNA LIALLAQAGS FVPARRAHIG LVSQIFSRVG ASDDLARGRS TFMVEMVETA AILNQADDRA LVILDEIGRG TATWDGLSIA WATLEHLHDT NRCRALFATH YHEMTALAGK LTGVENATVS VKEWQGEVIF LHEVRRGAAD RSYGVQVARL AGLPASVIER ARTVLDALES GERESGPRRQ ALIDDLPLFR AAPPPPAPAA PPKASQVEER LRAIQPDDLS PREALKLLYD LRALLP
|
| |