Gene Dde_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_2054 
Symbol 
ID3757062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp2098386 
End bp2100701 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content58% 
IMG OID637782942 
ProductDNA mismatch repair protein MutS-like 
Protein accessionYP_388546 
Protein GI78357097 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTCCA GAAGCGCGCA CGTACTGGAA TTTGATAAGG TGCTCCGGCA CCTTGCCGGA 
TATGCTGTAT CCGAAGCCGG GGCACAGGCC TGCCTTTTGC TGGCGCCCGC CGCTGACTGT
GACAGCGCCA GAATGCGCTG CGCTTTTTTC CGTCAGGGAC AGCTGTGGGC GGAACGGACA
GGGTTCTCTC TTGCTCCGTT TCCTGATCTT TCAGGAGTGT TCCGTTTTCT GGAATCACCG
GCGGCTGTGC TGGATATTGA CGCCCTGTGG GCCGTGCGGC AGGTGCTTGC GCAATCACGC
GAGCTGGCCG TTGCCATAGG GAGTGGCAAT ACGGCGGAAA ACTGGCCGCT TCTGGCGCAG
ATGCTGGAAC GTTACCGCAT GCCTGAAAAG GCTCTTTCCG GACTGATGCG TTGCCTTGGC
GAAGACGGGT TGCTGCGTGA TGAAAGTTCA CCGGAGCTGC TGCTGGTCCG GGAAGAAATA
CGCCGTATTC ACCGGCTCTG CACACGCAAG GTGAAAGAGC TGGCCAATAC TTACAATCTC
AGCCATTACA TGCAGGATGA GTTCCTGACG CTTTCCTCTG ACAGATATGT GCTTCCTTTG
AAGGCAAACT TCAAAGGCCG GCTGCAGGGA ATAATTCACG AATACTCCCA GACCGGTGAA
ACCTGCTATT TCGAACCCAT GTTTCTTGTG GAGATCAACA ACGATCTTCA GAATCTGAAA
CGGCAGGAAC GCGCGGAAGA GCGCAAGATT TTTGAATATC TCACGGGTCT GCTTCGCGCA
GAGATGGAAG GCGTGAACGC GTGCTGGAAC CTGCTTGTGG AAGCGGATGT GCTGCTGGCG
CGGTGCGCTC TGGCCGGTGC GTTTGGCGGC ATCGCCATAG ATTTTGACGA CACAAGACCC
TTTTCTCTGC GCGGGGTGAA GCATCCGCTG CTGGCACTGT CTTCTGCGGA TACGCATCCG
GTGGACCTTG AATTGCTGGA AGGACAGAAC TGCCTGATCA TCAGCGGGGG TAATGCCGGC
GGCAAGACGG TAAGCCTGAA GACCATAGGT CTTGTGGCGC TGATGGCCGC TGCCGGGCTG
CCCGTGCCTG CTGCTGAAGG CAGCACCCTG CCGTGCTGGC GTGCAGTTCA TGCTTTTATC
GGCGACGAAC AGAGCCTTGA GGGCAATGTC AGTACGTTCA CGGCTCAGAT ACGCAACCTT
AGTGATATCT GGCAGGAAGT CGGTAGCGGT TCGCTTGTTA TTCTTGACGA ATTCGGTGCG
GGAACGGACC CCGCGCAGGG CGCCGCGCTG GCTCAGGCTG TGGTTGACGA GCTTATGGAA
AAGAATGTCA CCGTGTGTGT GGCTACTCAT TTTCCGGCTC TCAAAGCCTA TGCCCTGAGC
AGGGAAGGTG TCCGCGCTGC TTCGGTGCTT TTTGATCCCG GTACCAAGAG ACCTCTGTAC
CGTCTTGCCT ATGATCAGGT TGGGGCCAGT CAGGCTCTGG ATGTTGCCCG TGAACATGGC
ATGCCCGATG CTGTGCTCAG GCGTGCGCAT AACTATCTGC TGCTGGACGG AGAAGATTCA
GCTGCGCTTA TTGAACGTCT TAATGCCCTT GCAGTATCCA GAGAACAGGA ACTGGATGAT
CTTGAACGCG AAAAGCTGCG ACTGAAGAGC AAGCGGGACA AGCTTGAAGC CAGATTTGAA
AAGGAGCGGC TCCGTCTTTT TGAAAGTGTG CAGGCTCAGG CGCAGGACGT GCTGCGTGAC
TGGAAACAGG GTAAGCGCAG CCATAAGCAG GCGTTGAAAG ACCTTGCGGC GGTGCGTCAG
CGTCTTGTCA GCGAGCAGGC CGCGGCGCAG CAGGGACAGG CCGCCCCGGT GGAAGCACTT
TGCGCAGGCA TGACTGTCCG TTATATTCCC TGGGGAAAAA AGGCTGTCAT TGAAGAGGTT
GACGCACGCA AAGAGCGCCT TAAAGTTGAC TTGAACGGTG TGTCTTTGTG GGTTTCTTGC
AAGGATGTGG CCGCGTCGGT ACCGCCTGCC GCGCCGAAGA CCGTACGGCG TGAGTCGGTG
GTGGCGGAAG ATAACGTGCA GGAACCTGCC GGTCCGCCCA TGCGCGCAGA TCTGCGCGGG
ATGAGAGCCG ATGTGGCCGT GGCAGAGCTT GCGGCTGTGC TTGATAACGC ACTGCTTGCC
GGTACCCGCG AGGTGGAAGT GGTCCATGGC CGGGGTACAG GTGCGCTGCG GAAGGAAGTT
CACGCTTTTT TGCGGAGTTT TGCTGCGGTC GATGCTTTCC GCCTTGCACC CGAAGGGCGC
GGCGGCGACG GCATGACCAT TGTCGAATTG AAGTAG
 
Protein sequence
MQSRSAHVLE FDKVLRHLAG YAVSEAGAQA CLLLAPAADC DSARMRCAFF RQGQLWAERT 
GFSLAPFPDL SGVFRFLESP AAVLDIDALW AVRQVLAQSR ELAVAIGSGN TAENWPLLAQ
MLERYRMPEK ALSGLMRCLG EDGLLRDESS PELLLVREEI RRIHRLCTRK VKELANTYNL
SHYMQDEFLT LSSDRYVLPL KANFKGRLQG IIHEYSQTGE TCYFEPMFLV EINNDLQNLK
RQERAEERKI FEYLTGLLRA EMEGVNACWN LLVEADVLLA RCALAGAFGG IAIDFDDTRP
FSLRGVKHPL LALSSADTHP VDLELLEGQN CLIISGGNAG GKTVSLKTIG LVALMAAAGL
PVPAAEGSTL PCWRAVHAFI GDEQSLEGNV STFTAQIRNL SDIWQEVGSG SLVILDEFGA
GTDPAQGAAL AQAVVDELME KNVTVCVATH FPALKAYALS REGVRAASVL FDPGTKRPLY
RLAYDQVGAS QALDVAREHG MPDAVLRRAH NYLLLDGEDS AALIERLNAL AVSREQELDD
LEREKLRLKS KRDKLEARFE KERLRLFESV QAQAQDVLRD WKQGKRSHKQ ALKDLAAVRQ
RLVSEQAAAQ QGQAAPVEAL CAGMTVRYIP WGKKAVIEEV DARKERLKVD LNGVSLWVSC
KDVAASVPPA APKTVRRESV VAEDNVQEPA GPPMRADLRG MRADVAVAEL AAVLDNALLA
GTREVEVVHG RGTGALRKEV HAFLRSFAAV DAFRLAPEGR GGDGMTIVEL K