Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1913 |
Symbol | mutS |
ID | 3848501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 2159661 |
End bp | 2162342 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637841582 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_442443 |
Protein GI | 83719722 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.396246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAGG ACGTGCAGGA AATTACGGAT TCAAAACAGC AGCTTACGGA AGCCGCATTC AGCAATCATA CGCCGATGAT GCAGCAGTAC CTTCGCATTA AGGCGGAGCA TCCCGAAACG CTCGTGTTCT ACCGGATGGG CGACTTCTAC GAGCTCTTCT TCGAAGACGC GGAAAAAGCC GCGCGCCTGC TCGACCTGAC CCTCACGCAA CGCGGCGCAT CCGCCGGCAC GCCGATCAAG ATGGCGGGCG TGCCGCATCA CGCGGTCGAG CAATACCTCG CGAAGCTCGT GAAATTCGGC GAATCGGCGG CGATCTGCGA ACAGATCGGC GATCCCGCGA CGTCGAAAGG CCCCGTCGAG CGCAAGGTCG TGCGTGTCGT GACGCCGGGC ACGCTGACCG ACGCCGCACT GCTGTCCGAC AAGAGCGACG TGTTCCTGCT CGCGCTTTGC GTCGGACACA ACAAGCGCGG CGTCGCGTCG ACCATCGGCC TTGCGTGGCT CAATCTCGCG AGCGGCGCGC TGCGGCTCGC CGAGATCGCG CCGGACCAGC TCGGCGCGGC GCTCGAGCGC ATCCGCCCCG CCGAGATCCT CGCGGCCGAC GGCGCGATCG AAGCGGTGCC GGCCGGCACG GGCGCGATCA CGCGCGTGCC GGCGTGGCAC TTCGATATCG CGTCGGGCAC GCAACGCCTC TGCGATCAAC TCGAAGTCGC GAGCCTCGAC GGCTTCGGCG CGCAGGCGCT CACGAGCGCG AACGGAGCGG CGGGCGCGCT GCTAATCTAC GCGGCGGCGA CGCAAGGCCA GCAACTTCGC CACGTGCGCA GCCTCAAGGT CGAAAACGAA TCCGAGTACA TCGGGCTCGA CCCGTCGACG CGGCGCAACC TCGAACTCAC CGAAACGCTG CGCGGCACCG AATCGCCGAC GCTCTATTCG CTGCTCGACA CCTGCTGCAC CGCGATGGGC AGCCGCCTGC TGCGCCACTG GCTGCATCAT CCGCCGCGCG CATCGGTCGC CGCGCAGGCA CGCCACCAGG CGATCGGCGC GTTGCTCGAC GCGCCCGTGC ACGTCGGCCT CGACAGCCTG CGCTCGGCGC TGCGGCAGAT CGCCGATGTC GAGCGAATCA CCGGCCGCCT CGCGCTGCTG TCCGCGCGGC CACGCGATCT GTCCAGCCTG CGCGACACGT TCGCCGCCCT CCCCGCGCTG CGCGAACGCG TGGCCGAGAT CGCGCCGAAC GCTGCCGCGC TCGGCCGCCT CGAAGCCGCG CTCGAGCCGC CGCCCGGCTG CCTCGATCTG CTCACGCGCG CGATCGCGCC CGAGCCGGCG GCAATGGTGC GCGACGGCGG CGTGATCGCC CGCGGCTACG ACGCCGAGCT CGACGAGCTG CGCGACATCT CGGAGAACTG CGGCCAGTTC CTGATCGATC TCGAAACGCG CGAGCGCGCA CGCACCGGCA TTCCGAACCT GCGCGTCGAG TACAACAAGG TTCACGGCTT CTACATCGAG GTCACGCGCG GCCAGACCGA CAAGGTGCCC GACGACTATC GCCGCCGCCA GACGCTCAAG AACGCGGAAC GCTACATCAC GCCCGAACTG AAGACGTTCG AGGACAAGGC GCTGTCCGCG CAGGAGCGCG CGCTCGCCCG CGAACGCGCG CTTTACGACA GCGTGCTGCA AGCGCTATTG CCTCACATCG AGGGTTGCCA GCGCGTCGCG AGCGGCCTCG CGGAGCTCGA CCTGCTTGCG GCATTCGCCG AGCGCGCCCG CACGCTCGAC TGGGTCGCGC CGGAATTCAT CGACGAGATC GGCATCGAGA TCGACCAAGG CCGCCATCCG GTCGTCGAAG CACAGGTCGA GCAGTTCATC GCGAACGATT GCGCGCTGAA CTCCGATCGG AAGCTGCTCC TCATCACCGG TCCGAACATG GGCGGTAAAT CGACGTTCAT GCGACAGACG GCGCTCATCG CACTGATGGC GTACGTCGGC AGCTACGTGC CGGCGAAGGC GGCGCGCTTC GGCCCGATCG ACCGCATCTT CACGCGCATC GGTGCGGCGG ACGATCTCGC AGGCGGCCGC TCGACGTTCA TGGTCGAAAT GACAGAAGCT GCCGCGATCC TGAACGACGC GACGCCGCAA AGCCTTGTGC TGATGGACGA AATCGGCCGC GGCACGTCGA CGTTCGACGG CCTCGCGCTC GCCTGGGCGA TCGCGCGCCA TTTGCTGTCG CACAATCGCT GCTATACGTT GTTCGCGACG CACTACTTCG AGCTCACGCA ATTGCCCGCG GAATTCCCGC AAGCGGCGAA CGTGCATCTG TCGGCGGTCG AGCACGGCCA CGGCATCGTG TTCCTGCACG CGGTCGAGGA AGGCCCGGCG AACCAGAGCT ATGGCCTGCA GGTCGCGCAA CTCGCGGGCG TTCCGGCGCC GGTGATTCGC GCCGCCCGCA AGCATCTCGC GCACCTCGAG CAGCAGTCCG CAGCCCAGGC GACGCCGCAG CTCGATCTCT TCGCCGCGCA ACCGATCGTC GACGAGCAGG AGTGCAACCA GCCGCCGGCA GCGGCGCCGC ACCCGGCGCT CGAGCGCCTG CTCGCGCTCG ATCCGGACGA CCTGAAGCCG CGCGACGCGC TCGACCTGCT CTACGAACTG CGCGCGCTCG CCCGCTCAGG CGCAACGGAT GCGCAACGCT GA
|
Protein sequence | MDKDVQEITD SKQQLTEAAF SNHTPMMQQY LRIKAEHPET LVFYRMGDFY ELFFEDAEKA ARLLDLTLTQ RGASAGTPIK MAGVPHHAVE QYLAKLVKFG ESAAICEQIG DPATSKGPVE RKVVRVVTPG TLTDAALLSD KSDVFLLALC VGHNKRGVAS TIGLAWLNLA SGALRLAEIA PDQLGAALER IRPAEILAAD GAIEAVPAGT GAITRVPAWH FDIASGTQRL CDQLEVASLD GFGAQALTSA NGAAGALLIY AAATQGQQLR HVRSLKVENE SEYIGLDPST RRNLELTETL RGTESPTLYS LLDTCCTAMG SRLLRHWLHH PPRASVAAQA RHQAIGALLD APVHVGLDSL RSALRQIADV ERITGRLALL SARPRDLSSL RDTFAALPAL RERVAEIAPN AAALGRLEAA LEPPPGCLDL LTRAIAPEPA AMVRDGGVIA RGYDAELDEL RDISENCGQF LIDLETRERA RTGIPNLRVE YNKVHGFYIE VTRGQTDKVP DDYRRRQTLK NAERYITPEL KTFEDKALSA QERALARERA LYDSVLQALL PHIEGCQRVA SGLAELDLLA AFAERARTLD WVAPEFIDEI GIEIDQGRHP VVEAQVEQFI ANDCALNSDR KLLLITGPNM GGKSTFMRQT ALIALMAYVG SYVPAKAARF GPIDRIFTRI GAADDLAGGR STFMVEMTEA AAILNDATPQ SLVLMDEIGR GTSTFDGLAL AWAIARHLLS HNRCYTLFAT HYFELTQLPA EFPQAANVHL SAVEHGHGIV FLHAVEEGPA NQSYGLQVAQ LAGVPAPVIR AARKHLAHLE QQSAAQATPQ LDLFAAQPIV DEQECNQPPA AAPHPALERL LALDPDDLKP RDALDLLYEL RALARSGATD AQR
|
| |