Gene BMA10229_A3145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A3145 
SymbolmutS 
ID4792103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp3180338 
End bp3183157 
Gene Length2820 bp 
Protein Length939 aa 
Translation table11 
GC content68% 
IMG OID 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001029086 
Protein GI124385860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCGCA AAGCCATCGA CAACGGGCTT GACACGTGCG AATCGCGACC GAATCGACAC 
ATTGGCCGTC CGGCCAGCTT CGCGCGCCGT CCCGACAAGG TAAACTCTCA CCCAATTCCG
TATACGCAGA CAGATATCCG CACGATGGCC ACCCAAATCG ACGCCTCCTC CGAAGCAGCA
GCGGCTACGG CCGCCGCGCA GCACACGCCG ATGATGCAGC AGTACCTACG CATCAAGTCG
GAGCACCCCG ACACGCTCGT GTTCTACCGG ATGGGCGACT TCTACGAGCT CTTCTTCGAA
GACGCGGAAA AAGCCGCGCG TCTGCTCGAC CTGACGCTCA CGCAACGCGG CGCATCCGCC
GGCACGCCGA TCAAGATGGC GGGCGTGCCG CATCACGCGG TCGAGCAATA CCTCGCGAAG
CTCGTGAAAT TCGGCGAATC GGCGGCGATC TGCGAACAGA TCGGCGACCC CGCGACGTCG
AAAGGCCCCG TCGAGCGCAA GGTCGTGCGC GTCGTGACGC CGGGCACGCT GACCGACGCC
GCGCTGCTGT CCGACAAGAG CGACGTGTTT CTGCTCGCGC TGTGCGTCGG CCACAACAAA
CGCGGCGTCG CGTCGAACAT CGGCCTCGCC TGGCTCAATC TCGCGAGCGG CGCGCTGCGG
CTCGCCGAGC TCGCGCCGGA TCAGCTCGGC GCGGCGCTCG AGCGCATCCG GCCCGCCGAG
ATTCTCGCGG CCGACGGCAC GATCGAATCG GTGCCGGCCG GCATGGGCGC GATCACGCGC
GTGCCCGCGT GGCACTTCGA TATCGCGTCG GGCACGCAGC GCCTCTGCGA TCAGCTCGAA
GTCGCGAGCC TCGACGGCTT CGGCGCGCAA GCACTCACGA GCGCGAACGG CGCGGCGGGC
GCGCTGCTGA TCTACGCAGC GGCCACGCAG GGCCAGCAGC TTCGCCACGT GCGCAGCCTC
AAGGTCGAAA ACGAATCCGA ATACATCGGG CTCGATCCGT CGACGCGGCG CAATCTCGAG
CTCACCGAGA CGCTGCGCGG CACCGAATCG CCGACGCTCT ATTCGCTGCT CGACACCTGC
TGCACCGCGA TGGGCAGCCG CCTGTTGCGC CATTGGCTGC ATCATCCGCC GCGCGCGTCG
GTCGCCGCGC AGGCGCGCCA TCAGGCAATC GGCGCGCTGC TCGACGCCCC CCCGAACGCC
GGCCTCGACA GCCTGCGCTC GGCGCTGCGG CAAATCGCCG ACGTCGAGCG GATCACCGGC
CGCCTCGCGC TGCTATCCGC GCGGCCACGC GATCTGTCCA GCCTGCGCGA CACGTTCGCC
GCCCTGCCCG CGCTGCGCGA GCGCGTTGCC GAGATCGCAT CGAACGCGGC CGCGCTCGGC
CGCCTCGAAG CCGCGCTCGA GCCGCCGCCC GGTTGCCTCG ATCTGCTCAC GCGCGCCATC
GCGGCCGAGC CGGCGGCGAT GGTGCGCGAC GGCGGCGTGA TCGCCCGAGG CTACGACGCC
GAGCTCGACG AGTTGCGCGA CATTTCGGAG AACTGCGGCC AGTTCCTGAT CGATCTCGAA
ACGCGCGAGC GCGCACGCAC CGGCATTTCG AACCTGCGCG TCGAGTACAA CAAGGTCCAC
GGTTTCTATA TCGAAGTCAC GCGCGGCCAG ACCGACAAGG TGCCCGACGA CTACCGCCGC
CGCCAGACGC TCAAGAACGC CGAGCGCTAC ATCACGCCCG AGCTGAAGAC GTTCGAGGAC
AAGGCGCTGT CCGCGCAGGA ACGCGCGCTC GCTCGCGAAC GCGCGCTGTA CGACGGCGTG
CTGCAAGCGC TCCTGCCCCA TATCGAGGGT TGCCAGCGCG TCGCAAGCGG CCTCGCCGAA
CTCGATCTGC TCGCGGCGTT CGCCGAGCGC GCCCGCACGC TCGACTGGGT GGCGCCGGAA
TTCACCGACG AGATCGGCAT CGAAATCGAT CAGGGCCGCC ATCCGGTCGT CGAGGCACAG
GTCGAGCAAT TCATCGCGAA CGATTGCGCC CTGAACCCCG AGCGAAAGCT GCTCCTCATC
ACCGGCCCGA ACATGGGCGG TAAATCGACG TTCATGCGTC AGACCGCGCT CATCGCGCTG
ATGGCGTACG TCGGCAGCTA CGTGCCGGCG AAAGCCGCGC GCTTCGGCCC CATCGACCGC
ATCTTCACGC GCATCGGCGC GGCGGACGAC CTCGCGGGCG GCCGCTCGAC GTTCATGGTC
GAAATGACGG AGGCCGCCGC GATCCTGAAC GACGCAACGC CGCACAGCCT CGTGCTGATG
GACGAAATCG GCCGCGGCAC ATCGACGTTC GACGGCCTCG CGCTCGCCTG GGCGATCGCA
CGCCATCTGC TGTCGCACAA CCGCTGCTAC ACGCTGTTCG CGACGCACTA CTTCGAGCTC
ACGCAACTGC CGGCGGAATT CCCGCAGGCG GCGAACGTGC ATCTGTCCGC CGTCGAGCAC
GGTCACGGCA TCGTATTCCT GCATGCGGTC GAGGAAGGGC CGGCGAATCA GAGCTACGGC
CTGCAGGTCG CGCAGTTGGC GGGGGTTCCC GCGCCGGTGA TTCGCGCCGC GCGCAAGCAT
CTCGCGCACC TCGAACAGCA GTCCGCCGCC CAGGCGACGC CGCAGCTCGA TCTCTTCGCC
GCACCACCGG TCGTCGACGA GCCGGAATGC AACGAGCCCC CCGCCGCCGC GACGCCGCAC
CCCGCGCTCG AGCGCCTGCT CGAGCTCGAT CCGGACGACT TGAAACCGCG CGACGCGCTC
GATCTGCTTT ACGAACTTCA CACACTCGCC CGCTCGGGCC CGGCGGATGC GCAACGCTGA
 
Protein sequence
MGRKAIDNGL DTCESRPNRH IGRPASFARR PDKVNSHPIP YTQTDIRTMA TQIDASSEAA 
AATAAAQHTP MMQQYLRIKS EHPDTLVFYR MGDFYELFFE DAEKAARLLD LTLTQRGASA
GTPIKMAGVP HHAVEQYLAK LVKFGESAAI CEQIGDPATS KGPVERKVVR VVTPGTLTDA
ALLSDKSDVF LLALCVGHNK RGVASNIGLA WLNLASGALR LAELAPDQLG AALERIRPAE
ILAADGTIES VPAGMGAITR VPAWHFDIAS GTQRLCDQLE VASLDGFGAQ ALTSANGAAG
ALLIYAAATQ GQQLRHVRSL KVENESEYIG LDPSTRRNLE LTETLRGTES PTLYSLLDTC
CTAMGSRLLR HWLHHPPRAS VAAQARHQAI GALLDAPPNA GLDSLRSALR QIADVERITG
RLALLSARPR DLSSLRDTFA ALPALRERVA EIASNAAALG RLEAALEPPP GCLDLLTRAI
AAEPAAMVRD GGVIARGYDA ELDELRDISE NCGQFLIDLE TRERARTGIS NLRVEYNKVH
GFYIEVTRGQ TDKVPDDYRR RQTLKNAERY ITPELKTFED KALSAQERAL ARERALYDGV
LQALLPHIEG CQRVASGLAE LDLLAAFAER ARTLDWVAPE FTDEIGIEID QGRHPVVEAQ
VEQFIANDCA LNPERKLLLI TGPNMGGKST FMRQTALIAL MAYVGSYVPA KAARFGPIDR
IFTRIGAADD LAGGRSTFMV EMTEAAAILN DATPHSLVLM DEIGRGTSTF DGLALAWAIA
RHLLSHNRCY TLFATHYFEL TQLPAEFPQA ANVHLSAVEH GHGIVFLHAV EEGPANQSYG
LQVAQLAGVP APVIRAARKH LAHLEQQSAA QATPQLDLFA APPVVDEPEC NEPPAAATPH
PALERLLELD PDDLKPRDAL DLLYELHTLA RSGPADAQR