Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0355 |
Symbol | |
ID | 7172239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 412711 |
End bp | 415443 |
Gene Length | 2733 bp |
Protein Length | 910 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643538853 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002434780 |
Protein GI | 218885459 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.0171458 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGAAC AGTACCTGCG CATCAAGGAC GACTATCCGG ATGCGCTGCT GCTCTACCGC ATGGGCGACT TCTACGAAAT GTTCTTCGAC GATGCGGAAA CGGCGGCGCG CGAACTGCAG ATCGCCCTCA CCTGCCGCAA CCCCAACGCC GACCTGCGCG TGCCCATGTG CGGCGTGCCC CACCATTCGG TGGAAGGGTA CATCACCCAA CTGCTGGAAA AGGGCTTCAA GGTGGCCCTG TGCGACCAGA TAGAAGACCC GCGCGAGGCC AAGGGGCTGG TCAAGCGCGC CGTGACCCGC GTGCTGACCC CCGGCACGGT GGTGGAGGAC GCCAACCTCG ACGCCAAGGG CCACAATTAT CTGGGCGCGC TGTTCTGGGA CGAGGAAAAG GAGGCGGGCG GCTTTGCCTG GCTCGATTTT TCCACGGGCG AATGGTCGGG CCTGCATGCG AAAAAGGCCG CCGACCTGTG GCAGTGGGTG CGCAAGATGC AGCCCCGCGA ACTGCTGGTG ACCGAAAGCG CGCAGGTGCC CTCGTCCATG TCGCTCACCT CCACCCAGGT GGTGCGGGTG CCGGGCCGGG CGCCCTTCGA CCTCAAGGCC GCCACGGAGC GGCTGCTGCA CGCCCAGGGC GTACGCGACA CCGGCGCGCT GGGCCTTGAA GGAAAGGATG AACTCGTCCG TTCCTGCGGG GCGCTGCTGG CCTACGTCGT GCAAACCCAG AAGCAGGACA TCATTCACCT TGCGCCGTTC CGCCCCCTGA ACCTGGGCCG CCACCTCATC GTCGATGAAG TGACGGAGCG CAATCTGGAG ATCTTCCGTC GCCTGGATGG GCGCAAGGGG CCGGGCACGC TGTGGCACGT GCTGGACCAC ACCCAGACCC CCATGGGCGG TCGCCTGCTG GAAGAACGCC TGCGCCACCC CTGGCGCGAG ATGGGCCCCA TAGAGGAAAC CCAGGCGGCG GTAAGCTGGC TGTTCGAGCG CGATGACACC CGGCGCGACC TGCGGGCCGG GCTGGACGAG GTGTACGACC TTGAACGGCT GACCACGCGC ATCTTCCTGA ACCGCGCCGC CCCCAAGGAC TACGTGGCCC TGCGCCAGAG CCTGGCCGCC CTGCCCAAGG TGCGCGGCGT GCTGGAACGG ACGGAAACGC CCCACGGCGG CTACCCCACC GACGGCGAGG CGCGCGGCGA TGGCCGCCCC CCCTTCCTGC GCGCGGCGCT GAAGGGCTGG GACGACCTGT CCGACTATTT CGACCTGCTG GAACGCGCGC TGGTGGACAG CCCGCCCCAC CTGGTCACCG AAGGCGGGCT GTTCCGGCCC GGCTTCCACG CCGAACTGGA CGAACTGCTC GACCTGACCG AACACGGCGA GAACATGTTG CAGCAACTGC TGGCGGAGGA ACAGGCCGCC AGCGGGCTGC CCAGGCTCAA GCTGGGCTTC AACCGGGTGT TCGGCTACTA CTTCGAGCTG TCGCGCACCG CGTCCGACGC CGTGCCCGAA CACTTCGTGC GCCGCCAGAC CCTGGCCAAC GCGGAACGCT TCACCACTCC GCGCCTGAAG GAACTGGAAG AAAAACTGGT CTCGGCCAGC GACCGCCGCA AATCGCTGGA ATACAAGTTG TTCCAGAAGC TGCGCGAAAC CGTGTCCGAG GCGCGCCCCC GCGTGCTGTT CATGGCCGAT CTGTTGGCAG GGTTCGACTA TTGGCAGAGC CTTGCGGAAA CGGCCCGCCG CTGGAACTGG ACCCGCCCCG TGCTGGCGCA GGATGCCGAC ATCCTGATCC GCGAAGGCCG CCACCCCGTG GTGGAGGCCA TGCAGGGCAG CGCCAACTTC ATCCCCAACG ACCTGCGCAT GGACGAGGCC CGGCGGCTGC TGCTGATCAC CGGCCCCAAC ATGGCGGGCA AATCCACCGT GCTGCGCCAG ACGGCCATCA TCTGCCTGCT GGCCCAGATG GGCTCGTTCG TGCCCGCGCG CGAGGCGCGC CTGGGCATCG CAGACCGGGT ATTCTCTCGG GTGGGCGCAT CGGACAACCT GGCGCAGGGG CAGTCCACCT TCATGGTGGA AATGATGGAA ACCGCGCGCA TCCTGCGCCA GGCGGGCAAG CGCAGCCTAG TCATCCTGGA CGAGATAGGA CGCGGCACCA GCACCTTCGA CGGCCTGGCC CTGGCCTGGG CCGTGGTGGA AGAGCTGGCC CGCCGCGCGG GCGGCACCAT CCGCACCCTG TTCGCCACCC ACTACCACGA GCTGACCGCG CTGGAAGGGC GCATCCCCGG CGTGCACAAC ATGAACATCG CCATCCGCGA ATGGGGCGGC GAAATCGTGT TCCTGCGCCG CCTGGTACCC GGCCCGTCCG ACCGCAGCTA CGGCATCGAG GTGGCCCGGC TTGCCGGGGT GCCGCAGCCG GTGGTACAGC GCGCACGGGA ACTGCTGCAC GAACTGGAAA AAGCCCGTGA CGGCAGGGGT GAAGCCCCCC GCACCGCGCG CGCCGTGCTG CCGGGGCTGG ACATGCCCGA ACCGAAGCAG GCCGCCAAGC CGCGCGCCCT GCTTGCCGCG CGGGCGGAAC CGGTGGCCCA TCCGCTGCTG ACCACCCTGC GCGACCTGGA TACCGACCAC CTTACGCCCA TGGGGGCGCT GCAACTGCTC AACGAATGGA AACTGCTCTG GGGAGGCCCA GCCCATGCCG AAGATACCGT TCCCGACCAG CCCCCGGCAG CGCCGGGAGA CGACCGTGAC TGA
|
Protein sequence | MFEQYLRIKD DYPDALLLYR MGDFYEMFFD DAETAARELQ IALTCRNPNA DLRVPMCGVP HHSVEGYITQ LLEKGFKVAL CDQIEDPREA KGLVKRAVTR VLTPGTVVED ANLDAKGHNY LGALFWDEEK EAGGFAWLDF STGEWSGLHA KKAADLWQWV RKMQPRELLV TESAQVPSSM SLTSTQVVRV PGRAPFDLKA ATERLLHAQG VRDTGALGLE GKDELVRSCG ALLAYVVQTQ KQDIIHLAPF RPLNLGRHLI VDEVTERNLE IFRRLDGRKG PGTLWHVLDH TQTPMGGRLL EERLRHPWRE MGPIEETQAA VSWLFERDDT RRDLRAGLDE VYDLERLTTR IFLNRAAPKD YVALRQSLAA LPKVRGVLER TETPHGGYPT DGEARGDGRP PFLRAALKGW DDLSDYFDLL ERALVDSPPH LVTEGGLFRP GFHAELDELL DLTEHGENML QQLLAEEQAA SGLPRLKLGF NRVFGYYFEL SRTASDAVPE HFVRRQTLAN AERFTTPRLK ELEEKLVSAS DRRKSLEYKL FQKLRETVSE ARPRVLFMAD LLAGFDYWQS LAETARRWNW TRPVLAQDAD ILIREGRHPV VEAMQGSANF IPNDLRMDEA RRLLLITGPN MAGKSTVLRQ TAIICLLAQM GSFVPAREAR LGIADRVFSR VGASDNLAQG QSTFMVEMME TARILRQAGK RSLVILDEIG RGTSTFDGLA LAWAVVEELA RRAGGTIRTL FATHYHELTA LEGRIPGVHN MNIAIREWGG EIVFLRRLVP GPSDRSYGIE VARLAGVPQP VVQRARELLH ELEKARDGRG EAPRTARAVL PGLDMPEPKQ AAKPRALLAA RAEPVAHPLL TTLRDLDTDH LTPMGALQLL NEWKLLWGGP AHAEDTVPDQ PPAAPGDDRD
|
| |