Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1126 |
Symbol | |
ID | 5693960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 1335747 |
End bp | 1338392 |
Gene Length | 2646 bp |
Protein Length | 881 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641263720 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001529010 |
Protein GI | 158521140 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0124737 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCCA CAGGCGCCAC TCCCATGATG CAGCAGTATC TCTCCATCAA GGAGCAGCAC CGGGACGCCA TTCTTTTTTA CCGAATGGGC GACTTTTACG AGATGTTTTT TGAGGACGCT CAAACCGCGG CCCCGGTCCT TGAGATCGCT CTGACCTCCC GCAACAAGAA CGACACCGAT CCCATTCCCA TGTGCGGTGT GCCGGTAAAG GCCGCGGACG GCTATATCGG CCGGCTCATC GAAAACGGGT TCAAGGTGGC GGTATGCGAG CAGACCGAGG ACCCTGCCGC GGCCAAAGGC CTGGTCCGGC GGGACGTGGT GCGCATCGTC ACTCCGGGCA TGATCATCGA CAATGCTCTG CTGGAAAAGG GAACCAATAA CTACGTTGTC TGCCTGGCCC ATGCCGACGG TGTTGTGGGG TTTGCCAGCG TGGATATCTC CACCGGCACT TTTCGGGTGT GCGAGTCCTC CGACCTGCGG GCCGTGCGCC ACGAGCTGCT GCGCATCGCG CCCCGGGAAG TGGTAATACC GGAATCCGGC GCCGATGACG CGGCGCTTTC GCCCTTTGTT TCCCTTTTTC CGCCGGCCAT TCGAACAACG CTCGCTAACC GGGAGTTTGA TTACAGAACC GCCTGCCAGC GGCTGACCGA CCAGTTTCAG ACCCGGTCCC TGGAGGGGTT CGGGTGCCGG GGCCTCAAAC CCGGCATTGT CGCGGCCGGG GCCCTGCTTT CCTATGTAAA CGATACCCAG AGACAGAAGG CGTCCCACCT GACCGGGCTG GAGGTCTACA GCATCGACCA GTACCTGCTG ATGGACGAGG TGACCTGCCG GAATCTGGAA CTGGTGGCCA ACCTTCGCAA CAATGGCAGG CAGGGAACCC TTATTGATGT GCTGGACGCC TGCGTCACCG CCATGGGCAG CCGCCTGCTG CGGCGCTGGA TGCTCTATCC CCTGCTGTCG GCAGAAGCCA TCAACCGGCG GCTGGACGCG GTGGCAGAGG CCAAAGAGGG CCTGGGCACT CGAAAGGCGG TGCGGGAACT GCTCAAACAG GTCTACGATA TCGAGCGGCT TACCAGCCGG GCCGTTATGG GCCGGGTCAC CCCTCGGGAC CTGCTGGCCT TGAAACAGAC CCTTTTCGCC CTGCCGGGTC TGGCAACAGA ACTGAAGTCT TTTGACAGCC CTTTTTTCTC CTTTGCCGGG GAACCGGGGC CCGAAGGCCT TGATAAGCTG GCCGGCCTGG CCGATCTGCT GAAGGCGGCG GTGCGGGAGG ACGCGCCGGT TTCCATCGCT GACGGCGGTG TCATCAACCC CGACTATCAT CCCCGGCTGG CCGAACTGGT AACCATCAGC CGGGACGGCA AGAGCAGCCT GGCCCGGCTG GAGGCAACGG AAAAAGAGAA GACCGGCATT TCCACCCTCA AGGTGCGGTA CAACAAGGTG TTTGGTTACT ATATCGAGGT ACCCCGGTCC CAGGTGGGGG CCGTGCCGGC TCACTACGTT CGCAAGCAGA CCCTTGTCAA CGGTGAGCGC TACATCACCG ACGAGCTTAA GGTGTTTGAG GAAAAAGCCC TGGGCGCCGA AGAACAGCGC GTTCGGCTGG AGCAGGAGTT GTTTGCCGAT ATCGTGGGCC GGGTGACCGC GTGCAGCCCG ATGCTGTTTG CCGTGGCCCG GGTCGCGGCC GGAATCGACG TGTTGTGCGC CCTGGCCCAG GTGGCCGATG ACCATGACTA TGTCCGGCCC GAGATGCTGT CCGGCGGCGA GATCATCATT GAAGAGGGCC GTCATCCCGT GGTGGAGCGC ATGCTTTCCG GCGAACGGTA CGTGCCCAAC AGCATTACGT TAAACGATAC CGACCGGCAG CTGCTGATCA TCACCGGTCC CAACATGGCG GGCAAATCCA CGGTGCTGCG CAAGGTGGCG CTGTTTTCGG TCATGGCCCA GATGGGCTCC TTTGTACCGG CCCGGCGGGC CGCCATGGGT GTGGTGGACC GGCTCTTTAC CCGGGTGGGG GCCCTGGACA ACCTGGCCTC AGGCCAGAGT ACCTTCATGG TGGAGATGGA AGAGACGGCC AACATCATCA ACAACGCCAC GCCGAAAAGC CTGGTGGTGA TCGACGAGAT CGGCCGGGGC ACCAGCACCT ACGACGGCCT GAGCATTGCC TGGGCCGTGG CCGAGGCCCT GCATGATCTG CACGGCAGAG GGGTCAAGAC CCTGTTTGCC ACCCATTACC ACGAGCTGAC CGAACTGGAA AACACCCGGC CCCGGGTGAA GAACTTTCAT ATTGCCGTCA AGGAGTGGAA CGATACCATC ATTTTTTTAA GAAAGCTGGT GGAGGGCAGC ACCAACCGCA GCTACGGCAT TCAGGTGGCA AGGCTGGCCG GCATTCCCGG CCCGGTGATC GCCAGGGCCA AGAAGATTCT GCTGGACATC GAGCAGGGCA CCTACAGTTT TGAGGCAAAG TCCGGCACTG CTCCGGGCAC CGGACAGAGC GGCCCGGTTC AGCTCTCCCT GTTTACCCCG CCGGAACAGA TGCTGGTGGA CCGGCTTCAA AAGGTCGACA TTTCAACCAT GACGCCCCTG GAGGCATTGA ACTGCCTTCA CGAACTGCAA CAGAAGGCGC ACGCCATATC GGAGACCGAC GGATGA
|
Protein sequence | MASTGATPMM QQYLSIKEQH RDAILFYRMG DFYEMFFEDA QTAAPVLEIA LTSRNKNDTD PIPMCGVPVK AADGYIGRLI ENGFKVAVCE QTEDPAAAKG LVRRDVVRIV TPGMIIDNAL LEKGTNNYVV CLAHADGVVG FASVDISTGT FRVCESSDLR AVRHELLRIA PREVVIPESG ADDAALSPFV SLFPPAIRTT LANREFDYRT ACQRLTDQFQ TRSLEGFGCR GLKPGIVAAG ALLSYVNDTQ RQKASHLTGL EVYSIDQYLL MDEVTCRNLE LVANLRNNGR QGTLIDVLDA CVTAMGSRLL RRWMLYPLLS AEAINRRLDA VAEAKEGLGT RKAVRELLKQ VYDIERLTSR AVMGRVTPRD LLALKQTLFA LPGLATELKS FDSPFFSFAG EPGPEGLDKL AGLADLLKAA VREDAPVSIA DGGVINPDYH PRLAELVTIS RDGKSSLARL EATEKEKTGI STLKVRYNKV FGYYIEVPRS QVGAVPAHYV RKQTLVNGER YITDELKVFE EKALGAEEQR VRLEQELFAD IVGRVTACSP MLFAVARVAA GIDVLCALAQ VADDHDYVRP EMLSGGEIII EEGRHPVVER MLSGERYVPN SITLNDTDRQ LLIITGPNMA GKSTVLRKVA LFSVMAQMGS FVPARRAAMG VVDRLFTRVG ALDNLASGQS TFMVEMEETA NIINNATPKS LVVIDEIGRG TSTYDGLSIA WAVAEALHDL HGRGVKTLFA THYHELTELE NTRPRVKNFH IAVKEWNDTI IFLRKLVEGS TNRSYGIQVA RLAGIPGPVI ARAKKILLDI EQGTYSFEAK SGTAPGTGQS GPVQLSLFTP PEQMLVDRLQ KVDISTMTPL EALNCLHELQ QKAHAISETD G
|
| |