Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_2054 |
Symbol | |
ID | 3757062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | - |
Start bp | 2098386 |
End bp | 2100701 |
Gene Length | 2316 bp |
Protein Length | 771 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637782942 |
Product | DNA mismatch repair protein MutS-like |
Protein accession | YP_388546 |
Protein GI | 78357097 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTCCA GAAGCGCGCA CGTACTGGAA TTTGATAAGG TGCTCCGGCA CCTTGCCGGA TATGCTGTAT CCGAAGCCGG GGCACAGGCC TGCCTTTTGC TGGCGCCCGC CGCTGACTGT GACAGCGCCA GAATGCGCTG CGCTTTTTTC CGTCAGGGAC AGCTGTGGGC GGAACGGACA GGGTTCTCTC TTGCTCCGTT TCCTGATCTT TCAGGAGTGT TCCGTTTTCT GGAATCACCG GCGGCTGTGC TGGATATTGA CGCCCTGTGG GCCGTGCGGC AGGTGCTTGC GCAATCACGC GAGCTGGCCG TTGCCATAGG GAGTGGCAAT ACGGCGGAAA ACTGGCCGCT TCTGGCGCAG ATGCTGGAAC GTTACCGCAT GCCTGAAAAG GCTCTTTCCG GACTGATGCG TTGCCTTGGC GAAGACGGGT TGCTGCGTGA TGAAAGTTCA CCGGAGCTGC TGCTGGTCCG GGAAGAAATA CGCCGTATTC ACCGGCTCTG CACACGCAAG GTGAAAGAGC TGGCCAATAC TTACAATCTC AGCCATTACA TGCAGGATGA GTTCCTGACG CTTTCCTCTG ACAGATATGT GCTTCCTTTG AAGGCAAACT TCAAAGGCCG GCTGCAGGGA ATAATTCACG AATACTCCCA GACCGGTGAA ACCTGCTATT TCGAACCCAT GTTTCTTGTG GAGATCAACA ACGATCTTCA GAATCTGAAA CGGCAGGAAC GCGCGGAAGA GCGCAAGATT TTTGAATATC TCACGGGTCT GCTTCGCGCA GAGATGGAAG GCGTGAACGC GTGCTGGAAC CTGCTTGTGG AAGCGGATGT GCTGCTGGCG CGGTGCGCTC TGGCCGGTGC GTTTGGCGGC ATCGCCATAG ATTTTGACGA CACAAGACCC TTTTCTCTGC GCGGGGTGAA GCATCCGCTG CTGGCACTGT CTTCTGCGGA TACGCATCCG GTGGACCTTG AATTGCTGGA AGGACAGAAC TGCCTGATCA TCAGCGGGGG TAATGCCGGC GGCAAGACGG TAAGCCTGAA GACCATAGGT CTTGTGGCGC TGATGGCCGC TGCCGGGCTG CCCGTGCCTG CTGCTGAAGG CAGCACCCTG CCGTGCTGGC GTGCAGTTCA TGCTTTTATC GGCGACGAAC AGAGCCTTGA GGGCAATGTC AGTACGTTCA CGGCTCAGAT ACGCAACCTT AGTGATATCT GGCAGGAAGT CGGTAGCGGT TCGCTTGTTA TTCTTGACGA ATTCGGTGCG GGAACGGACC CCGCGCAGGG CGCCGCGCTG GCTCAGGCTG TGGTTGACGA GCTTATGGAA AAGAATGTCA CCGTGTGTGT GGCTACTCAT TTTCCGGCTC TCAAAGCCTA TGCCCTGAGC AGGGAAGGTG TCCGCGCTGC TTCGGTGCTT TTTGATCCCG GTACCAAGAG ACCTCTGTAC CGTCTTGCCT ATGATCAGGT TGGGGCCAGT CAGGCTCTGG ATGTTGCCCG TGAACATGGC ATGCCCGATG CTGTGCTCAG GCGTGCGCAT AACTATCTGC TGCTGGACGG AGAAGATTCA GCTGCGCTTA TTGAACGTCT TAATGCCCTT GCAGTATCCA GAGAACAGGA ACTGGATGAT CTTGAACGCG AAAAGCTGCG ACTGAAGAGC AAGCGGGACA AGCTTGAAGC CAGATTTGAA AAGGAGCGGC TCCGTCTTTT TGAAAGTGTG CAGGCTCAGG CGCAGGACGT GCTGCGTGAC TGGAAACAGG GTAAGCGCAG CCATAAGCAG GCGTTGAAAG ACCTTGCGGC GGTGCGTCAG CGTCTTGTCA GCGAGCAGGC CGCGGCGCAG CAGGGACAGG CCGCCCCGGT GGAAGCACTT TGCGCAGGCA TGACTGTCCG TTATATTCCC TGGGGAAAAA AGGCTGTCAT TGAAGAGGTT GACGCACGCA AAGAGCGCCT TAAAGTTGAC TTGAACGGTG TGTCTTTGTG GGTTTCTTGC AAGGATGTGG CCGCGTCGGT ACCGCCTGCC GCGCCGAAGA CCGTACGGCG TGAGTCGGTG GTGGCGGAAG ATAACGTGCA GGAACCTGCC GGTCCGCCCA TGCGCGCAGA TCTGCGCGGG ATGAGAGCCG ATGTGGCCGT GGCAGAGCTT GCGGCTGTGC TTGATAACGC ACTGCTTGCC GGTACCCGCG AGGTGGAAGT GGTCCATGGC CGGGGTACAG GTGCGCTGCG GAAGGAAGTT CACGCTTTTT TGCGGAGTTT TGCTGCGGTC GATGCTTTCC GCCTTGCACC CGAAGGGCGC GGCGGCGACG GCATGACCAT TGTCGAATTG AAGTAG
|
Protein sequence | MQSRSAHVLE FDKVLRHLAG YAVSEAGAQA CLLLAPAADC DSARMRCAFF RQGQLWAERT GFSLAPFPDL SGVFRFLESP AAVLDIDALW AVRQVLAQSR ELAVAIGSGN TAENWPLLAQ MLERYRMPEK ALSGLMRCLG EDGLLRDESS PELLLVREEI RRIHRLCTRK VKELANTYNL SHYMQDEFLT LSSDRYVLPL KANFKGRLQG IIHEYSQTGE TCYFEPMFLV EINNDLQNLK RQERAEERKI FEYLTGLLRA EMEGVNACWN LLVEADVLLA RCALAGAFGG IAIDFDDTRP FSLRGVKHPL LALSSADTHP VDLELLEGQN CLIISGGNAG GKTVSLKTIG LVALMAAAGL PVPAAEGSTL PCWRAVHAFI GDEQSLEGNV STFTAQIRNL SDIWQEVGSG SLVILDEFGA GTDPAQGAAL AQAVVDELME KNVTVCVATH FPALKAYALS REGVRAASVL FDPGTKRPLY RLAYDQVGAS QALDVAREHG MPDAVLRRAH NYLLLDGEDS AALIERLNAL AVSREQELDD LEREKLRLKS KRDKLEARFE KERLRLFESV QAQAQDVLRD WKQGKRSHKQ ALKDLAAVRQ RLVSEQAAAQ QGQAAPVEAL CAGMTVRYIP WGKKAVIEEV DARKERLKVD LNGVSLWVSC KDVAASVPPA APKTVRRESV VAEDNVQEPA GPPMRADLRG MRADVAVAEL AAVLDNALLA GTREVEVVHG RGTGALRKEV HAFLRSFAAV DAFRLAPEGR GGDGMTIVEL K
|
| |