Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1437 |
Symbol | |
ID | 4664237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 1737803 |
End bp | 1740520 |
Gene Length | 2718 bp |
Protein Length | 905 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639819670 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_966882 |
Protein GI | 120602482 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.402439 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAACC CTTCGCCGAA ACTGACCCCC ATGTTCGAGC AGTACCTGCG CATCAAGGAG GACTATCCTG ATGCCCTGCT CTTCTATCGC ATGGGCGACT TCTACGAACT CTTCTTCGAC GACGCAGAGA CCACAGCCCG CGAACTCCAG ATAGCCCTCA CCTGCCGCAA CCCCAATGCC GAACTCAAGG CCCCCATGTG CGGTGTGCCC TATCACGCCG TCGAAGGGTA TATCAGCCAG CTTCTCGACA AGGGCTACCG CGTCGCCATC TGCGAACAGA TAGAAGACCC CAAGGAAGCC AAGGGCCTCG TCAAGCGTGC CGTGACCCGG GTGCTCACTC CCGGTACGGT CATCGATGAC GCCAATCTCG ACGCCAAGGA ACACAACTAT CTCGGGGCGC TGTTCTGGAA TCAGGATGCC GAAGCCGGAG CCTTCGCATG GGTCGATGTA TCGACCGGGG AATGGTCAGG ACTCTACTCG CGCAAACTGG CCGAGTTGTG GCAGTGGGCG CAGAAGATGG CCCCGCGCGA ACTTCTGCTG CCCGAGGGCG TCGACACGCC CGCCATGGCG ACTCTGGGTA CGACGCAGAC GGTGCGAGTC CCGGCGCGGT CGCATTTCGA CCTCAAAAGC GGCACGGAAC GCGTCATGCG CGCTCAGGGG GTGGCAGACC TCGGCTCGCT CGGCCTTGAA GGGAAACCGG AACTCGTTCG CGCGTGTGCG GCGCTACTTG CCTACCTCGC CCAGACCCAG AAACAGGAAC TCTCGCATCT TGCACCCTTC AAGCCCCTGA ATCTCGGCCG CCACCTCATC ATCGACGAAG TCACCGAGCG CAACCTCGAA CTCTTTCATA GGCTGGATGG CCGGAAAGGG CCGGGAACGC TCTGGCATAT CCTCGACCGG ACGCTCACGC CGATGGGCGG CCGTCTTCTC GAAGAGCGGA TGCACCACCC TTGGCGTGAA GCCGGCCCCA TCCGTGAAAC GCAGCAGGTC GTGGAATGGC TGTTTCAGGA CGATGTCCGC CGTGAAGCCT TGCGCACCGC GCTCGACCTC GTCTACGACC TTGAGCGTCT CTCGACCCGC ATCTTCCTCA ATCGGGCCAC ACCCAAGGAT TTCATCGCCC TTCGCCAGAG TCTTTCCGCC TTGCCTGCTG TGCGTGCGAC GCTCGAGCGT CCGGCAAACC CGGAAGGGAC ATACCCCACC GATGCGGAGA CCTCCGGCGA CACACTGCCC AAACCCTTGT CGGACATGCT CTCGGCGTGG GACGACCTTG CCGACTATGC CGACCTGCTG CGACGCGCGC TCACCGACAA CCCGCCGCAC CTCGTCACCG AGGGAGGTCT CTTCCGTCCC GGCTTCGACC CCGACCTTGA TGAACTGCTC GACCTTGCGG AACACGGCGA AGCAAGGCTT CAGGAGCTTC TGGCCGAAGA ACAGACGGTG AGCGGGCTTC CCAAACTCAA GCTCGGCTAC AACAGGGTGT TCGGCTATTT CTTCGAGCTT TCCCGGGCCG GGGCCGACTC CGTTCCGGAG CATTTCGTCC GCCGCCAGAC GCTGGCCAAC GCCGAACGTT TCACGACGGA ACGCCTCAAG GAGCTTGAAG AGAAGCTCGT CTCCGCGACA GACAGACGCA AGACGCTGGA ATACCGGCTA TTCCAGTCGC TGCGCGACAC CGTGGCCGAG GCACGGCCAC GTGTCCTGTT CATGGCGGAC ATGCTCGCCC ACCTCGATTT CTGGCAGAGT CTCGCCGATG TGGCCCGACG CAACGGCTGG GTGCGCCCGG ACGTGCACAC GGGCCATGAT ATCGTCATCC GTGAAGGACG GCACCCCGTC GTCGAGGCGA TGCAGGGTTC AGCCTCGTTC GTGCCCAACG ACCTTCGCAT GGACGAGAAG CGCCGCCTGC TGCTCATCAC CGGCCCTAAC ATGGCAGGCA AGTCCACCGT CCTGCGGCAG ACTGCCATCA TATGCCTTCT GGCCCAGATG GGGGCTTTCG TGCCCGCCCG TGAAGCGTCC ATCGGCATTG CCGACCGGAT ATTCTCGCGC GTGGGGGCCT CGGACAATCT GGCACAGGGG CAGAGCACCT TCATGGTGGA GATGATGGAG ACGGCGCGCA TCCTCCGTCA GGCTTCGAAG CGAAGCCTTG TCATCCTCGA TGAGATTGGC AGGGGCACGA GCACCTTCGA CGGCATGGCC CTTGCGTGGG CTGTGGTTGA GGAACTCACC CGGCGCGCAG GGGGAGGCAT CCGTACGCTC TTCGCCACCC ATTACCATGA GATAACGTCG CTCGAAGGCC GCATTCCGGG GGTGCACAAC ATGAACATCG CCATCCGCGA ATGGAACGGC GACATCGTGT TCCTGCGTCG CCTTGTCCCA GGCCCTGCAG ACAAGAGCTA TGGCATCGAG GTCGCGCGCC TAGCGGGAGT GCCCCATTCC GTGGTACAGC GGGCGCGTGA ACTGCTGGCA GACCTCGAGC GCACGCGGGA TGCTGCGAGA GGGACGAACA GTGCCCCCTC TCGCCAGACG CTGCCCGGTC TCGACCTGCC CTCGAAACAG GAACAGGTCG ACACCATCGT CGCCCCTCCT CCATGCTCTG GTGTGGAGCA CCCGCTGCTT GTGGCATTGC GTGACATCGA CACTGACGAC ATGACCCCGC TTGAAGCCCT CAAGCGTATC ACTGAATGGA AACAACTCTG GGGGACAACC CGTGAAGATC GCTCATAG
|
Protein sequence | MTNPSPKLTP MFEQYLRIKE DYPDALLFYR MGDFYELFFD DAETTARELQ IALTCRNPNA ELKAPMCGVP YHAVEGYISQ LLDKGYRVAI CEQIEDPKEA KGLVKRAVTR VLTPGTVIDD ANLDAKEHNY LGALFWNQDA EAGAFAWVDV STGEWSGLYS RKLAELWQWA QKMAPRELLL PEGVDTPAMA TLGTTQTVRV PARSHFDLKS GTERVMRAQG VADLGSLGLE GKPELVRACA ALLAYLAQTQ KQELSHLAPF KPLNLGRHLI IDEVTERNLE LFHRLDGRKG PGTLWHILDR TLTPMGGRLL EERMHHPWRE AGPIRETQQV VEWLFQDDVR REALRTALDL VYDLERLSTR IFLNRATPKD FIALRQSLSA LPAVRATLER PANPEGTYPT DAETSGDTLP KPLSDMLSAW DDLADYADLL RRALTDNPPH LVTEGGLFRP GFDPDLDELL DLAEHGEARL QELLAEEQTV SGLPKLKLGY NRVFGYFFEL SRAGADSVPE HFVRRQTLAN AERFTTERLK ELEEKLVSAT DRRKTLEYRL FQSLRDTVAE ARPRVLFMAD MLAHLDFWQS LADVARRNGW VRPDVHTGHD IVIREGRHPV VEAMQGSASF VPNDLRMDEK RRLLLITGPN MAGKSTVLRQ TAIICLLAQM GAFVPAREAS IGIADRIFSR VGASDNLAQG QSTFMVEMME TARILRQASK RSLVILDEIG RGTSTFDGMA LAWAVVEELT RRAGGGIRTL FATHYHEITS LEGRIPGVHN MNIAIREWNG DIVFLRRLVP GPADKSYGIE VARLAGVPHS VVQRARELLA DLERTRDAAR GTNSAPSRQT LPGLDLPSKQ EQVDTIVAPP PCSGVEHPLL VALRDIDTDD MTPLEALKRI TEWKQLWGTT REDRS
|
| |