Gene Smed_0030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0030 
Symbol 
ID5320857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp28334 
End bp31081 
Gene Length2748 bp 
Protein Length915 aa 
Translation table11 
GC content66% 
IMG OID640788961 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001325725 
Protein GI150395258 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000355609 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATTTCC TGATGGATGC ATCGAACCGA ACGGGCGACG TCCTTTCCGT GTCCGATCTC 
GCGAGCGAAG AGAGCCGCTC CACCGCCACG CCGATGATGG AACAGTTCAT CGAGATCAAG
GCGAACAACC GGGATTCGCT CCTGTTTTAC CGCATGGGTG ATTTCTACGA GCTGTTTTTC
CAGGATGCGG TCGAGGCCTC GCGCGCACTC GGTATCACGC TGACGAAACG CGGACAGCAC
ATGGGGCAGG AAATCCCCAT GTGCGGCGTG CCGGTGCATG CGGCTGACGA TTACCTGCAG
AAGCTGATTG CGCTCGGCTA TCGCGTCGCG GTCTGCGAAC AGGTGGAGGA CCCTGCCGAG
GCGAAGAAGC GCGGCGGCAA ATCGGTCGTG CGCCGCGATG TCGTCCGCCT CGTAACGCCG
GGAACGATCA CCGAGGACAA GCTGCTCTCG CCCTCGGAAT CAAACTATCT CATGGCGCTC
GCGCGCATCA GGAGCGGCTC GGAGCCCGCT TATGCGCTTG CCTGGATCGA TATTTCGACG
GGAATCTTCC GCCTCGCCGA GACCGCGGAG AGCCGGCTTC TTGCCGACAT ATTGCGCATC
GAACCACGCG AACTGATCCT GCCGGATACC GTCTTCCACG ATCCGGATCT CAGGCCCGTT
TTCGACGTGC TCGGGCGGGT CGCGGTACCG CAGCCGGCCA TCCTTTTCGA CAGTGCGACG
GCGGAAGGCC GGATATCACG CTACTACGGC GTCGGGACGC TCGACGGCTT CGGCAGTTTC
TCGCGCGCGG AGCTCGCCGC CGCATCGGCG GCAGTCTCCT ATGTCGAAAA GACCCAGCTC
CAGGAGCGCC CGGCGCTCGG CATACCGGAA AGGGAAAGCG CCGCCTCGAC CCTCTTCATC
GATCCGGCAA CCCGTGCCAA TCTGGAGCTC GCCAAGACGC TGTCGGGCTC GCGCGACGGC
AGCCTGCTCA AGTCGCTCGA CCGTACGGTG ACGAGCGGTG GCGCCCGGCT GTTGGCCGAA
CGGCTGATGT CACCCCTGAC CGACCCGGAA CGGATCAATC GGCGGCTCGA TTCCATCGAA
ATGCTGGCCG ACCAGCCGCG CTTCACGGCC GACGCTCGCG ATGCGCTTCG CAGGGCGCCG
GACATGCCGC GCGCCCTGTC GCGGCTCGCG CTTGGCCGCG GCGGCCCTCG CGATCTCGGT
GCCATACAGG CGGGCATGCG GGCCGCGGTC GCGATCGCGG CGCTTCTCTC GGGTGCCGAG
CTTTCGGTGG AACTGGCTGA AGCGCGTGAC GCGATCGCGG GCTTGCCGCG GGACCTCCTC
GCGCGCCTCG ACGCGACCCT TGCGGAGGAA TTGCCGCTTT TGAAGCGCGA TGGCGGTTTC
GTCCGCGAAG GTGCTAACGC CGAACTCGAC GAGATGCGCG CTCTGCGCGA CCAGTCGCGC
CGCGTGGTTG CCGGTCTTCA GCTCCAATAT TGCGAAGAGA CCGGAATCAA GTCGCTGAAA
ATCAAGCATA ACAACGTGCT CGGCTACTTC ATCGAGGTGA CCGCCGGAAA TGCCGGCGCC
ATGATCGATA CGGATGCGGG CCGTGCCCGC TTCATCCACC GCCAGACCAT GGCGAACGCC
ATGCGCTTCA CCACGACCGA GTTGGCGGAG CTCGAAACCA AGATCGCCAA TGCCGCGGAC
CGCGTTCTGG CGATCGAACT CGAGACTTTC GAGGTCATGA CGCGCGAGGT GGTCGCCGAG
GCCGAAGCGA TCAAAGCGGC GGCGCTGGCG CTGGCGACGA TCGACGTCTC GGCCGGACTG
GCGGTGCTTG CGGAGGAGCA GAACTATACG CGCCCCGCCG TCGACCGCTC GCGCATGTTC
GCGATCGACG GGGGCCGCCA CCCCGTGGTG GAGCAGGCGT TGAGACGCCA GGCCGCCAAT
CCCTTCGTCG CGAATGGCTG CGACCTTTCC CCGCCCGGTG GGGAAGAGGG CGGCGCGATC
TGGCTCCTCA CCGGCCCCAA CATGGGCGGC AAGTCGACTT TCCTGCGGCA GAACGCGCTG
ATCGCGATCA TGGCGCAGAT GGGGTCCTTC GTGCCTGCAT CCGCCGCGCA TATCGGCGTC
GTCGACCGCC TCTTCTCACG CGTCGGGGCA TCGGACGACC TCGCGCGTGG CCGTTCGACC
TTCATGGTCG AAATGGTCGA GACGGCTGCG ATTCTCAACC AGGCGACCGA CCGCTCGCTG
GTGATCCTCG ACGAGATCGG CCGCGGCACG GCGACCTTCG ACGGCCTGTC GATCGCCTGG
GCGGCTGTCG AGCATCTGCA CGAGGTCAAT CGTTGCCGCG GGCTGTTCGC GACCCATTTC
CACGAATTGA CGGTGCTTTC CGAAAAACTC GGCCGGCTTT CCAACGCGAC CATGCGCGTC
AAGGAGTGGG ACGGCGACGT CATATTCCTG CATGAAGTGG GGCCAGGGGC AGCCGACCGC
TCCTACGGAA TCCAGGTCGC CCGGCTTGCC GGCTTGCCGG CGTCGGTCGT CGCCCGCGCG
CGGGACGTTC TCGCCAAGCT TGAAGACGCG GACCGCAAAA ATCCGGCGAG CCAGCTGATC
GACGACCTGC CGCTGTTCCA GGTCGCGGTC CGGCGCGAGG AGGCGGCGAG GGCACCGGGA
CTTTCCAGGG CGGAGGAGGC CCTGAAGGCG CTCAACCCGG ACGACATGAC GCCGCGCGAG
GCGCTCGACG CGCTGTACGC GCTCAAGAAG CAGCTTTCCA ACCGCTGA
 
Protein sequence
MNFLMDASNR TGDVLSVSDL ASEESRSTAT PMMEQFIEIK ANNRDSLLFY RMGDFYELFF 
QDAVEASRAL GITLTKRGQH MGQEIPMCGV PVHAADDYLQ KLIALGYRVA VCEQVEDPAE
AKKRGGKSVV RRDVVRLVTP GTITEDKLLS PSESNYLMAL ARIRSGSEPA YALAWIDIST
GIFRLAETAE SRLLADILRI EPRELILPDT VFHDPDLRPV FDVLGRVAVP QPAILFDSAT
AEGRISRYYG VGTLDGFGSF SRAELAAASA AVSYVEKTQL QERPALGIPE RESAASTLFI
DPATRANLEL AKTLSGSRDG SLLKSLDRTV TSGGARLLAE RLMSPLTDPE RINRRLDSIE
MLADQPRFTA DARDALRRAP DMPRALSRLA LGRGGPRDLG AIQAGMRAAV AIAALLSGAE
LSVELAEARD AIAGLPRDLL ARLDATLAEE LPLLKRDGGF VREGANAELD EMRALRDQSR
RVVAGLQLQY CEETGIKSLK IKHNNVLGYF IEVTAGNAGA MIDTDAGRAR FIHRQTMANA
MRFTTTELAE LETKIANAAD RVLAIELETF EVMTREVVAE AEAIKAAALA LATIDVSAGL
AVLAEEQNYT RPAVDRSRMF AIDGGRHPVV EQALRRQAAN PFVANGCDLS PPGGEEGGAI
WLLTGPNMGG KSTFLRQNAL IAIMAQMGSF VPASAAHIGV VDRLFSRVGA SDDLARGRST
FMVEMVETAA ILNQATDRSL VILDEIGRGT ATFDGLSIAW AAVEHLHEVN RCRGLFATHF
HELTVLSEKL GRLSNATMRV KEWDGDVIFL HEVGPGAADR SYGIQVARLA GLPASVVARA
RDVLAKLEDA DRKNPASQLI DDLPLFQVAV RREEAARAPG LSRAEEALKA LNPDDMTPRE
ALDALYALKK QLSNR