Gene Rleg2_0496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0496 
SymbolmutL 
ID6979212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp508411 
End bp510213 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content67% 
IMG OID643395208 
ProductDNA mismatch repair protein 
Protein accessionYP_002280019 
Protein GI209548102 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.170626 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCA GACAGCTTTC CGAAACGCTC ATAAACCAGA TCGCCGCCGG CGAAGTCATC 
GAACGGCCGG CGAGCGCTGC CAAGGAACTG ATCGAAAACG CGCTCGATGC GGGCGCGACG
CGCATCGAGA TCGCCACGGC GGGCGGCGGC AAGGCGTTGC TGCGGGTCAG CGACAACGGC
TCGGGCATGG ATGCGGCCGA TCTGGAACTG GCGGTCAGGC GCCACTGCAC CTCGAAACTA
TCGGAGACGC TGGAGGATAT CCGCACGCTC GGTTTCCGCG GCGAGGCGCT GCCGTCGATC
GGCTCGGTCG CAAGGCTCAG CATCGCCAGC CGCAGGCGCG ACAGCGCCGG CGGCCACGAG
ATCGCCGTCA ACGCCGGCAA GGTCGCACAT TTGCGCCCGG CCGCCGCCAA TCCCGGGACG
ATCGTCGAAG TGCGCGATCT TTTTTTCGCG ACCCCCGCCC GGCTCAAATT CCTGAAGACC
GAGAAGGCCG AGGCCGGCGC CATTACCGAG ATCGTCAAGC GCATGGCGAT CGCCTTTCCC
GCCGTGCGCT TCGTGCTGTC CGGTTCCGAC CGCACGACGC TGGAATTTCC GGCGACCGGC
GACGACCATC TGGCGCGCAT GGCGCAGGTG CTCGGCAAGG ACTTCCGCGA CAACGCCATC
GCCCTCGACG CGGTGCGCGA GGAAATTTCA CTCACCGGCT TTGCCGGCGT GCCGACTTTC
AATCGCGGCA ACTCCGCCCA TCAATACGCC TTCGTCAACG GCCGGCCGGT GCAGGACAAG
CTGATCCTGT CGGCGATCCG CGGCGCCTAT GCCGAAACCA TCCCGTCAGG CCGTCATCCG
GTGGCGGTGC TGTCGATTAC CCTCGATCCC GCCCTCGTCG ACGTCAACGT GCATCCGGCA
AAATCCGACG TGCGGTTTCG CGACCCCGGC CTGGTGCGCG GCCTGATCGT CGGCGCCATC
CGCGAGGCCC TGGCGCGCGA CGGCAGCCGG GCGGCAACGA CCGGGGCAAG CGACATGCTG
CGCTCCTTCC GCCCCGGCTT TCAGCCGCAG GCGCAGCGAC CGCAGACGGC ATGGTCGGCC
GAGACCTCGC CCTTCCGGCC CTACCAGCCG ACAACGGGTT TTTCCGAACG GCCACAGGCC
TCCTTCGACG GGCTGTCGAT GCCGACGGCG CGGGCCGAGC CGCCGTTTTC GCCGCAGCCG
GCAGCAGCCG ACACCACCGC ACGCTATCCG CTCGGCGCGG CGCGGGCGCA GATTCACGCC
AACTACATCG TCGCCCAGAC CGAGGACGGG CTTGTCATCG TCGACCAGCA TGCCGCCCAT
GAGCGGCTGG TGTTCGAGGC GATGCGCAAG GCGCTGCATT CGAAGCGGCT GGCCTCGCAG
GTGCTGCTCA TCCCGGAGAT CGTCGATATT CCAGAAGAGG ATTGTGACCG GCTGATGCAG
CATGCGGCGG AACTTGCCGA ACTCGGCCTG GCAATCGAGC GTTTCGGCCC CGGGGCGATT
GCCGTGCGCG AGACGCCGGC GATGCTCGGC GAAGTCGACG CGCATGGCCT GATCCGCCAG
CTTGCCGACG AGATTGCCGA ATGGGACACG GCGTCGGGCC TGTCGGCCAA GCTCGAATAT
GTGGCGGCGA CGATGGCCTG CCACGGGTCG GTGCGCTCCG GACGGCGGCT GCGGCCGGAG
GAAATGAACG CGCTGCTCAG GGAAATGGAA GTGACGCCCG GCTCCGGCCA GTGCAATCAC
GGCCGGCCGA CCTATATCGA ATTGAAGCTC AGCGATATCG AGCGGCTCTT CGGCAGAAGC
TGA
 
Protein sequence
MAIRQLSETL INQIAAGEVI ERPASAAKEL IENALDAGAT RIEIATAGGG KALLRVSDNG 
SGMDAADLEL AVRRHCTSKL SETLEDIRTL GFRGEALPSI GSVARLSIAS RRRDSAGGHE
IAVNAGKVAH LRPAAANPGT IVEVRDLFFA TPARLKFLKT EKAEAGAITE IVKRMAIAFP
AVRFVLSGSD RTTLEFPATG DDHLARMAQV LGKDFRDNAI ALDAVREEIS LTGFAGVPTF
NRGNSAHQYA FVNGRPVQDK LILSAIRGAY AETIPSGRHP VAVLSITLDP ALVDVNVHPA
KSDVRFRDPG LVRGLIVGAI REALARDGSR AATTGASDML RSFRPGFQPQ AQRPQTAWSA
ETSPFRPYQP TTGFSERPQA SFDGLSMPTA RAEPPFSPQP AAADTTARYP LGAARAQIHA
NYIVAQTEDG LVIVDQHAAH ERLVFEAMRK ALHSKRLASQ VLLIPEIVDI PEEDCDRLMQ
HAAELAELGL AIERFGPGAI AVRETPAMLG EVDAHGLIRQ LADEIAEWDT ASGLSAKLEY
VAATMACHGS VRSGRRLRPE EMNALLREME VTPGSGQCNH GRPTYIELKL SDIERLFGRS