Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_1976 |
Symbol | |
ID | 3756982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 2020900 |
End bp | 2023614 |
Gene Length | 2715 bp |
Protein Length | 904 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637782862 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_388468 |
Protein GI | 78357019 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0020973 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAGCC ATTCTCCAAA GCTGACCCCG ATGTTCGAAC AATACATGAA CATCAAGGCC GAGTATCCTG ACGCCCTGCT CTTTTACCGT ATGGGCGACT TTTACGAGCT GTTCTTTGAG GACGCGGAAG TCGCTGCCCG CGAGCTGCAG ATAGCCCTGA CATGCCGCAA CCCCAATGCC GAAAACAAAG TGCCCATGTG CGGTGTGCCC CATCACTCCG CACGCAGCTA CATCAGCCAG CTGGTGGATA AAGGCTACAA AGTCGCCATC TGCGAACAGA TGGAAGACCC GCGAGAAGCC AAGGGGCTGG TGAAGCGGGG AGTCATAAGA GTTTTGACCT CCGGCACTGC GCTGGAGGAC GAAAACCTTT CCCCCAAGGC TCACACCTAT CTCGGGGCAT TGTGCTGGGA CAAGTCGGAA GGAGCCGGCG GATTTGCATG GGTGGACTTT TCCACAGGCG AGTGGTCCGG ACTGCAAAGC CGGAAAGAAC AGGAACTCTG GCAATGGGTT CAGAAAATGG CGCCCCGCGA ATTGCTGCTG GCCGACACGC TGACACCGCC TGCATCGCTG GAACTGACCG AAACCCAGTT CAGCAAAGTG CCGGAACGCG CTTATTTTGA TTACAAACGC TCGGCGGAAA AGATCATGTC TGCCCAGCAG GTTGCCGAAC TCGGCGCACT GGGGCTGGAA AACCGCAAAG AGCTCGTCCG GGCCTGCGGG GCGCTGCTCA CATATCTTTC CCAGACACAG AAGCAGGACC TTAACCACCT TTGCCAGTTC AAGCCGCTGA ACCTGAACCG ACATCTGCTG CTTGACGAAA TAACCGAGAG GAATCTCGAG CTTTTCAGAC GGCTGGACGG TCGCAAGGGC AAAGGCACCC TGTGGCATGT TCTGGACCAT ACGGTCACCC CCATGGGCGG AAGACTGCTG CAGGAACGCC TGAAACACCC GTGGCGCGAA CAGGCCCCCA TTGACGAAAC GCAGGAAGCC GTTTCGCACT TTTTTGCCCA CAACACACTG CGCAGACAGC TCAGAGAAGC ACTTGATACG GTTTATGACA TCGAGCGGCT GAGCACCCGG ATTTTTCTGA ACCGGGCAAC CCCAAGGGAT TACGTGGCGC TGCGGCAGAG CCTGAAAGCA CTGCCGGCAG TAAGAGAACT GCTGGAGGCC CCGCAGACAG GAGACGGCAG ATACGCCACT CCGGAAGAAC AGCTGGGGGC CGCCCTGCCG CCCTTTCTGC ACAGAATGCT CAAAAGCTGG GATGACCTTG CCGACTACCA CGATCTTCTG GAAAAAGCGC TGGTAGACAA CCCGCCCCAC GTCATCACGG AAGGCGGACT GTTCCGTCAG GGATTCCACC CTGCTCTGGA CGAGCTGATG GATCTCTCCG AACACGGAGC CTCGAAATTG CATGACCTGC TTGCAGAGGT ACAGCAGACC ACAGGCATCA GCAAAATAAA ACTCGGCAAT AACAGAGTGT TCGGTTATTA TTTTGAAGTA CCGAAGTCTG TTTCGGAAGA ACTGCCCGAC ACCTTTGTTC GCCGTCAGAC GCTGGCCAAC GCCGAACGGT ATACATCAGA GCGCCTCAAG GAGCTGGAAG AAAAGCTGTT TTCAGCAGCC GACAAGCGCA AGACCATGGA ACTGAAGCTG TTTCAGCAAC TGCGCGAGCA TGTGGCCCAA GCCCGACCGA GAGTGCTTTT CATGGCGGAT CTGCTGGCCA CCCTTGACCA CTGGCAGGGA CTGGCGGAAG CCGCCCGGCA CTGGAACTGG GTGCGCCCGG TTCTGCATGA CGGGCAGGAT ATTGTCATCC GCGAAGGACG CCATCCTGTG GTGGAAGCCG TACAGGGACC TGCGGGATTC ATCCCCAACG ATCTGCGCAT AGATGACCAG CGCCGCCTGC TGCTGATTAC CGGCCCCAAC ATGGCCGGCA AGTCCACAGT CCTGAGACAG GCTGCGATTA TCTGCATTCT TGCCCAGATA GGTTCTTTTG TTCCTGCACG GGAGGCGCGC ATAGGGCTGT GCGACCGTAT TTTTTCGCGT GTGGGGGCTT CGGACAATCT GGCGCAGGGG CAGTCCACAT TCATGGTTGA AATGATGGAG ACAGCCCGTA TTCTGCGGCA GGCCACCAGG CGCAGTCTGG TCATACTTGA TGAAATCGGC CGCGGCACAT CAACCTTTGA CGGACTGGCT CTGGCGTGGG CTGTAGTTGA AGAACTTATG AAAAAACAGC AGGCCGGCAT ACGCACGCTG TTTGCCACCC ATTACCACGA GCTGACGTCG CTGGAGGGAA CCATTCCCGG CGTGCACAAC ATGAACATCG CCATAAAGGA ATGGGGAGGC GAGATTGTCT TTCTGCGCCG GCTGGTGCCC GGTCCTTCCG ACCGCAGCTA TGGTGTAGAG GTGGCAAAGC TGGCCGGAGT ACCGCAAAAC GTGGTGCAAC GTGCCCGACA GATTCTGGAG TTGCTGGAGC AGAAAAGCAA AGCTGACGGA ACCAGACGCC CGGCATCATA CCACGAAGCC CAGCCGCTGC TGCCGGGTAT GCCCGAACCG CCATCAACAG CAAGTGCAGA ACCACCTCAG ACCGTCACGC CGCCTGAGCC TCCCGTGCTT ACCGCCCTGC GTGACCTTGA CACGGACAAC CTGACACCAC TGGAAGCCCT TACCGTCCTG ACGGAATGGA AAACTTTATG GGGAGCCGGA AAAAATGAAT GCTGA
|
Protein sequence | MTSHSPKLTP MFEQYMNIKA EYPDALLFYR MGDFYELFFE DAEVAARELQ IALTCRNPNA ENKVPMCGVP HHSARSYISQ LVDKGYKVAI CEQMEDPREA KGLVKRGVIR VLTSGTALED ENLSPKAHTY LGALCWDKSE GAGGFAWVDF STGEWSGLQS RKEQELWQWV QKMAPRELLL ADTLTPPASL ELTETQFSKV PERAYFDYKR SAEKIMSAQQ VAELGALGLE NRKELVRACG ALLTYLSQTQ KQDLNHLCQF KPLNLNRHLL LDEITERNLE LFRRLDGRKG KGTLWHVLDH TVTPMGGRLL QERLKHPWRE QAPIDETQEA VSHFFAHNTL RRQLREALDT VYDIERLSTR IFLNRATPRD YVALRQSLKA LPAVRELLEA PQTGDGRYAT PEEQLGAALP PFLHRMLKSW DDLADYHDLL EKALVDNPPH VITEGGLFRQ GFHPALDELM DLSEHGASKL HDLLAEVQQT TGISKIKLGN NRVFGYYFEV PKSVSEELPD TFVRRQTLAN AERYTSERLK ELEEKLFSAA DKRKTMELKL FQQLREHVAQ ARPRVLFMAD LLATLDHWQG LAEAARHWNW VRPVLHDGQD IVIREGRHPV VEAVQGPAGF IPNDLRIDDQ RRLLLITGPN MAGKSTVLRQ AAIICILAQI GSFVPAREAR IGLCDRIFSR VGASDNLAQG QSTFMVEMME TARILRQATR RSLVILDEIG RGTSTFDGLA LAWAVVEELM KKQQAGIRTL FATHYHELTS LEGTIPGVHN MNIAIKEWGG EIVFLRRLVP GPSDRSYGVE VAKLAGVPQN VVQRARQILE LLEQKSKADG TRRPASYHEA QPLLPGMPEP PSTASAEPPQ TVTPPEPPVL TALRDLDTDN LTPLEALTVL TEWKTLWGAG KNEC
|
| |