Gene Rleg_0539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0539 
SymbolmutL 
ID8011730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp560667 
End bp562469 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content67% 
IMG OID644823129 
ProductDNA mismatch repair protein 
Protein accessionYP_002974382 
Protein GI241203286 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.111991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCA GACAGCTTTC CGAAACGCTC ATCAACCAGA TCGCCGCCGG CGAAGTCATC 
GAACGGCCGG CGAGCGCTGC CAAGGAACTG ATCGAAAACG CGCTCGACGC GGGTGCGACG
CGCATCGAGA TCGCCACATC AGGCGGCGGC AAGGCGCTGC TGCGCGTCAG CGACAACGGC
TCGGGCATGG ACGCGGCCGA TCTGGAACTG GCGGTCAGGC GCCACTGCAC CTCGAAGATC
TCAGAGACGC TTGAGGATAT CCGCACGCTC GGCTTCCGCG GCGAGGCGCT GCCCTCGATC
GGCTCGGTCG CAAGGCTCAG CATTGCAAGC CGCAGGCGCG ACAGCACAGG CGGCCACGAG
ATTGCCGTTG CCGGCGGCAA GATCGCGCAT ATGCGCCCGG CTGCCGCCAA TCCCGGGACG
ATCGTCGAAG TGCGCGACCT GTTCTTCGCG ACACCCGCCC GGCTCAAATT CCTGAAGACG
GAGAAAGCCG AGGCCGGCGC CATTACCGAG ATCGTCAAGC GCATGGCGAT CGCCTTTCCG
GCGGTACGCT TCGTGCTGTC GGGTTCGGAC CGCACGACAC TGGAATTTCC GGCGACCGGC
GACGACCATC TGGCGCGCAT GGCGCAGGTG CTCGGCAAGG AGTTCCGCGA CAACGCCATT
GCGCTCGACG CGGTGCGCGA GGAGATCGCG CTTACCGGCT TTGCCGGCGT GCCGACCTTC
AATCGCGGCA ACTCCGCCCA CCAATACGCT TTCGTCAACG GCCGGCCGGT GCAGGACAAG
CTGATCCTCT CGGCGATCCG CGGCGCCTAT GCCGAGACGA TCCCATCCGG ACGCTATCCG
GTGGCGGTGC TGGCGATCAC GCTCGATCCC GCTCTGGTCG ACGTCAACGT GCATCCGGCA
AAATCCGACG TGCGGTTCCG CGATCCTAGC CTGGTGCGCG GCCTGATCGT CGGCGCCATC
CGCGAGGCCC TGGCGCGCGA CGGCAGCCGG GCGGCAACCA CCGGCGCGAG CGACATGCTG
CGCTCCTTCC GCCCCGGTTT CCAGCCGAAT AACCAGCGGC CGCAAACGGC ATGGTCGGCC
GAAACCTCGC CCTCCCAGCC CTATCAGCCG GCAACGGGAT TCGGCGAGCG GCCACAGGCG
TCCTTCGACG GACTTTCGAT GCCGACGGCG CGGGCCGAGC CGCAGTTTTC GCCGCAGCCG
GCAGTCGCCG AACCAAACAC GCGGTATCCG CTCGGCGCGG CGCGGGCGCA GATCCACGCA
AACTACATCG TCGCCCAGAC CGAGGATGGG CTCGTCATCG TCGACCAGCA TGCCGCGCAT
GAGCGGCTGG TTTTCGAGGC GATGCGCAAG GCGCTGCATT CGAAGCGGCT GGCCTCGCAG
GTGCTGCTCA TCCCCGAGAT CGTCGATATT CCGGAAGAGG ACTGCGACCG GCTGATGCAG
CATGCGGCCG AGCTTTCCGA ACTCGGCCTG GCGATCGAGC GTTTCGGCCC AGGCGCGATC
GCCGTGCGCG AGACGCCGGC GATGCTCGGC GAGGTCGATG CGCATGGGCT GATCCGCCAG
CTTGCCGACG AGATCGCCGA ATGGGACACG GCGTCGGGCC TATCGGCCAA GCTCGAATAT
GTGGCAGCGA CCATGGCCTG CCACGGGTCG GTGCGCTCGG GACGGCGGCT GCGGCCGGAG
GAAATGAACG CGCTGCTGCG GGAGATGGAA GTGACCCCCG GCTCCGGCCA GTGCAATCAC
GGCCGGCCGA CCTATATCGA ATTGAAGCTC AGCGATATCG AGCGGCTTTT CGGCAGAAGC
TAA
 
Protein sequence
MAIRQLSETL INQIAAGEVI ERPASAAKEL IENALDAGAT RIEIATSGGG KALLRVSDNG 
SGMDAADLEL AVRRHCTSKI SETLEDIRTL GFRGEALPSI GSVARLSIAS RRRDSTGGHE
IAVAGGKIAH MRPAAANPGT IVEVRDLFFA TPARLKFLKT EKAEAGAITE IVKRMAIAFP
AVRFVLSGSD RTTLEFPATG DDHLARMAQV LGKEFRDNAI ALDAVREEIA LTGFAGVPTF
NRGNSAHQYA FVNGRPVQDK LILSAIRGAY AETIPSGRYP VAVLAITLDP ALVDVNVHPA
KSDVRFRDPS LVRGLIVGAI REALARDGSR AATTGASDML RSFRPGFQPN NQRPQTAWSA
ETSPSQPYQP ATGFGERPQA SFDGLSMPTA RAEPQFSPQP AVAEPNTRYP LGAARAQIHA
NYIVAQTEDG LVIVDQHAAH ERLVFEAMRK ALHSKRLASQ VLLIPEIVDI PEEDCDRLMQ
HAAELSELGL AIERFGPGAI AVRETPAMLG EVDAHGLIRQ LADEIAEWDT ASGLSAKLEY
VAATMACHGS VRSGRRLRPE EMNALLREME VTPGSGQCNH GRPTYIELKL SDIERLFGRS