Gene Rleg2_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0032 
Symbol 
ID6978741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp30528 
End bp33170 
Gene Length2643 bp 
Protein Length880 aa 
Translation table11 
GC content66% 
IMG OID643394743 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002279561 
Protein GI209547644 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.317103 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAGC AATATATCGA GATCAAGGCG AACAATCCGG GTTCGCTGCT CTTCTATCGC 
ATGGGCGATT TCTACGAGCT GTTCTTCGAG GATGCGCTGG AAGCCTCCCG CGCGCTCGGC
ATCACGCTGA CGAAGCGCGG CCAGCACATG GGCCAGGATA TCCCGATGTG CGGCGTTCCG
GTGCATGCGG CCGACGATTA CCTGCAGAAA CTGATCTCGC TCGGTTTCCG CGTCGCCGTC
TGCGAGCAGA TCGAAGATCC GGCCGAAGCG AAAAAACGCG GCGGCAAATC CGTCGTCAAG
CGCGATGTCG TCCGCCTGGT CACGCCGGGC ACGATCACCG AGGAAAAGCT GCTTTCGCCC
TCGGAATCCA ACTATCTGAT GGCGCTGACC CGCATTCGCG GCGGGGCCGA ACCGTTGCTG
GCGCTTGCCT GGATCGACAT TTCCACCGGC GTCTTCCGGC TGGCCGAAAC CGAAGCCTCG
CGGCTGCTTG CCGATATCCT GCGCATCGAT CCGCGCGAAC TGATCCTGCC GGAGACGATC
TTCCACGATC CGGAACTCAA GCCGGTCTTC GACGTGCTCG GCCGCACCGC GGTGCCGCAG
CCTTCCGTGC TCTTCGACAG CGCCAGCGCC GAAGGCCGGA TCGCGCGGTA TTTCGGCGTC
TCGACGCTCG ACGGCTTCGG CACCTTCTCG CGCGCCGAAC TGGCGGCGGC TGCCGCCGCC
GTCGCCTATG TCGAGAAGAC CCAGATCGCC GAGCGGCCGC CGCTTGGAAA GCCGGAACGG
GAAAGTGCGG CGTCGACACT GTTCATCGAT CCCGCCACCC GCGCCAACCT GGAGCTGGCC
CGCACGCTGT CGGGCGACCG CAACGGCTCA TTGCTGAAAG CGATCGACCG CACCGTTACC
GGCGGCGGCG CGCGGCTTCT GGCCGAGCGG CTGATGTCGC CGCTGACCGA CCCCGCCCGC
ATCAATGCGC GGCTCGATTC GATCGGCTTC CTGATCGACG AACCCTCGCT CTGCGGCAAT
CTGCGCGACA CGCTGAAACA TGTGCCCGAC ATGCCGCGCG CCCTATCCCG CCTGGCGCTC
GACCGCGGCG GCCCGCGCGA TCTCTCAGCC ATCCGCCAGG GCCTGCAAGC GGCGAACGAC
GTGGCAGCGA TGCTTGCAAG CGCGATGCTG CCGGAAGAGC TTGGCCAGGC GCTGTCCGGG
CTGCAGGCCC TTCCCGCAGC GCTCGAAACC CTGCTGGCCG AGACGCTCGC CGACGAATTG
CCGCTGCTGA AGCGCGACGG CGGCTTCCTG CGCGACGGCG CCAGTGCCGA GCTCGACGAG
GTCCGGGCGC TGCGCGACCA GTCGCGCCGG GTGATCGCGG GCCTGCAACT GCAATATGCC
GAGGAAATCG GCATCCGGTC GCTGAAGATC AAACACAACA ACATCCTCGG CTATTTCATC
GAGGTGACCG CCGGCAATGC CTCGCCGATG ACGGACACGG CTGAGGCCAA GGCCCGCTTC
ATCCACCGCC AGACGATGGC GAGCGCGATG CGCTTCACCA CCACCGAACT CGCCGATCTC
GAAAGCCGCA TCGCCAATGC CGCCGACCGG GCGCTGACGA TCGAGCTCGC CGCCTTCGAG
AGGATGACGG CGGCCGTGGT TGCGGAAGCC GAGGCGATCA AATCCGGCGC GAGGGCGCTT
GCCGTCATCG ACGTTGCAGC TAGCCTGGCG CTTCTCGCCG AGGAGCAGGC CTATTGCCGT
CCGCAGGTCG ACGGCTCGAA GATGTTTGCC ATCGATGGCG GCCGCCATCC GGTCGTCGAA
CAGGCGCTGC GGCGGCAGGC GAGCGGCCCC TTCGTCGCCA ACAATTGCGA TCTCTCGCCG
AAAGCAGGCG ACAAGGACGG GGCGATCTGG CTCTTGACCG GCCCGAACAT GGGCGGCAAA
TCGACCTTCC TGCGGCAGAA CGCGCTGATA GCCATCCTGG CGCAGATGGG CTCCTTCGTG
CCGGCGACCT CGGCCCATAT CGGCATCGTC GACCGGCTTT TCTCGCGCGT CGGCGCCTCC
GACGACCTGG CGCGCGGGCG CTCCACCTTC ATGGTCGAGA TGGTCGAGAC CGCTGCGATC
CTCAACCAGG CGAGCGACCG TTCACTCGTC ATTCTCGACG AGATCGGCCG CGGCACCGCC
ACCTTCGACG GCCTGTCGAT CGCCTGGGCC TCCGTCGAGC ACCTGCATGA GGCCAACCGC
TGCCGCGGCC TCTTCGCCAC GCATTTCCAT GAGCTGACCG TGCTTTCGGA AAAGCTTGTC
CGGCTATCGA ACGCCACGAT GCGCGTCAAG GAATGGGACG GCGACGTCAT CTTCCTACAT
GAGGTCGGCC CGGGTGCGGC CGACCGCTCC TACGGCATCC AGGTCGCCCG CCTTGCCGGG
CTTCCGGCTT CGGTGGTGAC GCGGGCCCGC GATGTGCTCA CCCGCCTCGA GGATGCCGAC
CGCAAGAACC CGGCGAGCCA GCTGATCGAC GACCTGCCGC TCTTCCAGGT GGCGGTGCGC
CGCGAGGATA CCGCGCGCGG GCCGTCCAAG GTCGAGGAGA CGCTGAAGGC GATGAGCCTT
GACGACATGA CGCCGCGCGA GGCAATGGAC GCGCTTTACG ACCTCAAGAA AAAATTGAAA
TAG
 
Protein sequence
MMEQYIEIKA NNPGSLLFYR MGDFYELFFE DALEASRALG ITLTKRGQHM GQDIPMCGVP 
VHAADDYLQK LISLGFRVAV CEQIEDPAEA KKRGGKSVVK RDVVRLVTPG TITEEKLLSP
SESNYLMALT RIRGGAEPLL ALAWIDISTG VFRLAETEAS RLLADILRID PRELILPETI
FHDPELKPVF DVLGRTAVPQ PSVLFDSASA EGRIARYFGV STLDGFGTFS RAELAAAAAA
VAYVEKTQIA ERPPLGKPER ESAASTLFID PATRANLELA RTLSGDRNGS LLKAIDRTVT
GGGARLLAER LMSPLTDPAR INARLDSIGF LIDEPSLCGN LRDTLKHVPD MPRALSRLAL
DRGGPRDLSA IRQGLQAAND VAAMLASAML PEELGQALSG LQALPAALET LLAETLADEL
PLLKRDGGFL RDGASAELDE VRALRDQSRR VIAGLQLQYA EEIGIRSLKI KHNNILGYFI
EVTAGNASPM TDTAEAKARF IHRQTMASAM RFTTTELADL ESRIANAADR ALTIELAAFE
RMTAAVVAEA EAIKSGARAL AVIDVAASLA LLAEEQAYCR PQVDGSKMFA IDGGRHPVVE
QALRRQASGP FVANNCDLSP KAGDKDGAIW LLTGPNMGGK STFLRQNALI AILAQMGSFV
PATSAHIGIV DRLFSRVGAS DDLARGRSTF MVEMVETAAI LNQASDRSLV ILDEIGRGTA
TFDGLSIAWA SVEHLHEANR CRGLFATHFH ELTVLSEKLV RLSNATMRVK EWDGDVIFLH
EVGPGAADRS YGIQVARLAG LPASVVTRAR DVLTRLEDAD RKNPASQLID DLPLFQVAVR
REDTARGPSK VEETLKAMSL DDMTPREAMD ALYDLKKKLK