Gene Dret_1339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1339 
Symbol 
ID8419168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1564679 
End bp1567369 
Gene Length2691 bp 
Protein Length896 aa 
Translation table11 
GC content59% 
IMG OID645037915 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_003198205 
Protein GI258405463 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.742998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTAAGA CCAAGCTGAC CCCTATGCTC GAACAGTATT TGCGCATCAA GGAAGACTAT 
CCTGATGCGC TCTTGTTTTT TCGAATGGGG GATTTTTACG AACTCTTTTT TGAAGACGCC
GAGACTGCGG CCCGGGTCCT GCAGATCACC CTGACCTCCA GAAATCCGAA TGCGGAAACC
AAGGTCCCTA TGGCCGGCGT GCCCCACCAC GCCACGGAGG AATATCTCCG GCAGCTTCTG
GAGCAAGGGT ACAAGGTGGC CATCTGTGAC CAGGTGGAGG ACCCCCGCCA GGCCAAAGGG
CTGGTTAAGC GCGAGGTGAC CCGTGTTTTG ACCCCTGGTA CAGTGGTTGA GGATTCGACC
CTGTCGGCCA AGACCAGTAA TTATCTGGCG GCCGTGTGCT GGCATGGAGG CAGCAAGACC
GGCGCTGCGG CGTGGATCGA TTTTTCCACC GGACAATGGA CGGGGGTGCA GTCCAAGCAC
CAGGTCCAAT TGTGGCAATG GCTGATCAAG ATCCAGCCGC AGGAGGTCCT GATGCCGGAG
GGCACGGAAC TCCCGGAGCA GGCCCAGGTG CTGCGCGACA AGATCCAATT CTGTCCGTAT
AACGGGTATT TTGAGCCGGG CCGGGCCCGT GAACGTCTCC TGCAGGCTCA AGATGTGGCC
AGCCTGACTC CCTTGGATCT GGAGGATAAA CCCGCCCTGG TCCAGGCGTG CGGTGCGCTG
CTGGCCTATT TGCACACAAC GCAGCGCTGC GAAGACCTTT CCCATCTCGG CCAGTTCCAA
CCAATCCAGC CCAACCGATT TCTCCAACTT GACGAGGTCA CTGAGCGCAA TCTCGAACTC
TTTCAGCGTC TTGACGGCGG CAAGGGACCA GGGACGCTGT GGCACGCCCT GGACCGGACC
CTGACCCCGA TGGGCGGGCG ACTTCTTCAG CAGCGGCTGC GCCAGCCGTG GCGTGACCTG
CGAACCATCA CCGCTCACCA GGGTGTTGTC GCCCTGTTGG TGGACGACGA CGGATTGCGC
CAGAGTCTGC GCGAGCGTCT TGACGCCGTT TATGATCTGG AACGGTTGAC GACCCGTATT
TTTCTCGGCC GGTGTACTCC CAAGGATTTC ATTGCCCTGC GCAATAGCCT GAAAGCGTTG
CCCGCCCTGC AGTCCCTGCT TGAAGAAGAC CGCCAGTGGC CTGAGCTGCT TGGTCAGGAA
AAGCGGGCCT GGGACAACCT CGATGATGTC CGGGAACTGC TGGAGAGGAG TCTCGTGGAC
GCTCCCCCTC TGGTGATCAC CGAGGGCGGA CTGTTTCGGC ACGGTTTTGA TCCGGAACTC
GACGAATTGC TCGATTTGAG TGAGGATGGG GAAGGACGAC TGCAGGAATT GTTGCACAAA
GAGCAGCAGG CTTCCTCATT GCCCAAGCTG AAGTTGGGCT ACAATCGCGT TTTCGGGTAT
TATTTCGAAT TGAGCAAAGC CCATAAGGGG CCGGTACCCG ACCACTTCAT CCGGCGACAG
ACCCTGGTCA ACGCCGAGCG GTACATCACC GAAGAACTCA AAACGATCGA GGATAAGGTT
TTCAGTGCCG CAGAGAAGCG CAAGGGCCTG GAATATAATC TTTTCCAGAA CTTGCGTGAG
CAGGTTGCCG CGCACCGGGA ACGGGTTATG GCTGTAGCCG GTATCCTGGC CCGCCTCGAT
TATTGGCAGG GATTGGCCCA CGCCGCCCGG CAATGGGAGT GGAGCCAGCC TGAGCTGCAT
TCAGGGCTTG ATCTGCGCAT CATCGGCGGT CGCCATCCCG CAGTTGAAGC CACGCAGGGG
CGAACCGATT ATATTCCCAA CGACGTTCGC ATCGAGGGAG ATGACCGCGT TTTGCTCATC
ACTGGACCGA ATATGGCCGG GAAATCGACG GTCCTGCGCC AAACGGCCAT TATCTGCATC
CTGGCCCAGA TCGGCTCCTT CGTTCCGGCT CGGGAGGGCT GGATCGGGCT GTGCGACCGC
ATTTTTACCC GTGTCGGGGC CTCGGACAAT CTCGCTCAAG GACAATCGAC CTTCATGGTC
GAGATGACGG AAACGGCCCG CATTTTGCGC CAAGCCAGCC GAAACAGTCT GGTCATCCTT
GATGAGATCG GACGCGGCAC AAGCACCTTC GACGGATTGG CTTTGGCCTG GGCGGTCGTT
GAGGATCTCG TCCAGCGTGG TCATGGCGGC GTGCGCACCC TCTTTGCCAC CCACTACCAT
GAGTTGACCG ACCTGGAGGG ACAGCTTCCC GGGGTGCGCA ATTACAATAT CGCGGTCAAG
GAATGGCGAG GGGATATCGT TTTTCTCCGC CGGCTTGTGC CTGGCCCGGC GGATCGCAGT
TACGGCATCG AGGTCTCGCA ATTGGCCGGT GTACCCCAGG GCGTCGTCAA GCGAGCCAAG
GCTATCCTTG CGCAACTCGA GGAAAAAAGC CGGGGACTGC GCCATACCCC CGAGAAAGGC
AGCCACCGGC AGCAATCCCT GCTCCCAGGC CTTTTCGACC CCCCAGAAAA GCCCCAGCCG
GAGCCGACCG CGCCGCAGGA CCAGGGGCCG CATCCCCTGG AAGAAGCTTT GCAGGAATTG
GATCTCAACG GGATGACCCC GCTTGATGCC CTGAACCTCT TATCTCGATG GAAGCACGAA
TGGACACCTC CGCACGATTC TCAGGATACG CCTGGTACCT CCTCGTCATA A
 
Protein sequence
MSKTKLTPML EQYLRIKEDY PDALLFFRMG DFYELFFEDA ETAARVLQIT LTSRNPNAET 
KVPMAGVPHH ATEEYLRQLL EQGYKVAICD QVEDPRQAKG LVKREVTRVL TPGTVVEDST
LSAKTSNYLA AVCWHGGSKT GAAAWIDFST GQWTGVQSKH QVQLWQWLIK IQPQEVLMPE
GTELPEQAQV LRDKIQFCPY NGYFEPGRAR ERLLQAQDVA SLTPLDLEDK PALVQACGAL
LAYLHTTQRC EDLSHLGQFQ PIQPNRFLQL DEVTERNLEL FQRLDGGKGP GTLWHALDRT
LTPMGGRLLQ QRLRQPWRDL RTITAHQGVV ALLVDDDGLR QSLRERLDAV YDLERLTTRI
FLGRCTPKDF IALRNSLKAL PALQSLLEED RQWPELLGQE KRAWDNLDDV RELLERSLVD
APPLVITEGG LFRHGFDPEL DELLDLSEDG EGRLQELLHK EQQASSLPKL KLGYNRVFGY
YFELSKAHKG PVPDHFIRRQ TLVNAERYIT EELKTIEDKV FSAAEKRKGL EYNLFQNLRE
QVAAHRERVM AVAGILARLD YWQGLAHAAR QWEWSQPELH SGLDLRIIGG RHPAVEATQG
RTDYIPNDVR IEGDDRVLLI TGPNMAGKST VLRQTAIICI LAQIGSFVPA REGWIGLCDR
IFTRVGASDN LAQGQSTFMV EMTETARILR QASRNSLVIL DEIGRGTSTF DGLALAWAVV
EDLVQRGHGG VRTLFATHYH ELTDLEGQLP GVRNYNIAVK EWRGDIVFLR RLVPGPADRS
YGIEVSQLAG VPQGVVKRAK AILAQLEEKS RGLRHTPEKG SHRQQSLLPG LFDPPEKPQP
EPTAPQDQGP HPLEEALQEL DLNGMTPLDA LNLLSRWKHE WTPPHDSQDT PGTSSS