Gene Rleg_1176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1176 
Symbol 
ID8012290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1151693 
End bp1153723 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content64% 
IMG OID644823760 
ProductPeptidoglycan-binding LysM 
Protein accessionYP_002975010 
Protein GI241203914 
COG category[S] Function unknown 
COG ID[COG1652] Uncharacterized protein containing LysM domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0861609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAGA ACCGTGCCGG CTTGCTGGCT CTTGCAGTCC TCGCAATCGC AATCCTACTG 
ATGGTGTTTG TCGTCATGCC GCGTATCGGC GGCGATGCAA CCAAGGTCGG CGACGCCATC
AACCAGGCGA GCACGGAGCT CAAGAACACG GTTAACGAGG CGTCAAAGAC ATCGCGCTCC
GCTGTGGGCG ATGCGGCTGC GGTCGCCGAT CAGGTCGGCC GCCTTTCGGC CGATGCCGGC
GTGTCGCTCA GCGAGCTCAA GGCGCTGTTT GCCGACGGCA AAGGCCCTGC CATCGATGTC
TTCACTGCCG CCAAAACCAA GGCCGTCAAC GCGCTGACCG CACTTGCCGG CTTCACTATT
CCCGAGGGCC TCGATCCCGC GACCCAGACC CTTGCCGCCA AGGCCAAGGA CGGTGGCGCG
AAGGCGCTCG CCATCGTCCG GTCGCTGCCC GAGAACATCG CCGATGCGCT TGCCGCGATC
GCCAAGGCCG AGGTTGCGCT GACCGGTGCG CCGGAGACCG CCCCTGGGGC GAATACGGCA
GCGGAAAATA CCGGCCCGAA ACTTCCCGCC TTCGACGTAT TGCGCGTCGA GCCCGACGGA
TCGACGGTGA TTGCCGGTTC CGCCGAACCG AACGGCAAGC TCGAGGTGCT CGACGGCGAG
AAGGTGGTGA CGACGGCCAA TGTCGATGCG AGTGGCGATT TTGCTGCCGT TCTCGACGAT
CCGCTTCCAG CCGGCGACCA CCAGCTGGTG CTCAAGTTCA CGGGCAAGGA CGGCAAGAGC
ACGCTTTCCG AAGAAGTCGC GACCATTTCC GTGCCGAAGG ATGGCAATGG CGCCAATCTG
CTCGCCATGG TTTCCAAACC CGGCGCGGCG AGCCGCATCA TCACCGCGCC GAAGGCCGGA
ACCGAAGTTG CCGACGCCTC CAATCCGATG GCGCCGCCTG CGGACAAGCC GGCAACCGGT
GAAAGCTCGG CTGCACCAAC CGGCGAGCTG GCGCTGCAGA CGCCGAACCT CACCGACACA
CCCTCCGGCG GCGCTGATAC GGCACCGGCC ATTCCCGGCA CTGCCGCACC GGATAAGACG
AATGCACCCG ACGTGATGGT CAATGCCGTC GAGATCGAAG GCAACAAGAT CTTCATTGCC
GGCACGACGC GCTCCAATGC CAAGGTCATC GGTTATGCCG ACGACAGCCT CGTCGGCCAG
GATACTGCCG GCTCCGACGG CCATTTCGTC ATCGACGGTG TGGTGGCGCT TTCGGTCGGC
GACCACAAGA TCCGCGTCGA TGTCGTCGAT CCCACTGGCA AGGTGATCGT GCGTGCGTCG
GTGAATTTCA ATCGCCCCGC CGGCGATCAG GTGAGGGTCG CCGCGCAATC GGCCCCCGCA
GATGCGAACG GTGCTTCCTC GATGGTTCCG CTTGACGAAG GCGAGCTCCG CAAACTGAAA
GCCGAAGTCG GCAAGGCCTT CGGCCTGCTG AAGGGGCTTT TTGCCGATGG CAAGTTGCCC
GGCGCCGAAC AGCTTGCGGC AGCACGCTCG GCAACGGACT TTGCGCTGCG CTCCGTCGCA
GACTTCCGTC CAGCGGCCGA TGCGCCTGAT GTCTTCAAGC AAGCGTCAGG TTCCTCCTCA
CAGGTCGCCG GCAATGCGCT GAAGCTGCTG CAGGGCCTGC CTGGGGATGC GAAGTCGGTC
GGCGCCGCAC TCGACAAGCT GGGCGGGATG ATTGCCGAGC TCACCGCCGC ACCCGCGCCG
GCAACGCCGT CTGCAAACGA GGTCGGAAGC AACCAGCCGA AGACGATCGA ACAGGCGCCG
CTGACGGCGA ACAACGCGGC GGTCATCATT CGCCGCGGCG ACACGCTTTG GCAGATCTCG
CGCCGTACCT ATGGTCTCGG CGTTCGCTAC ACGACGATCT ACATCGCCAA CGAGGACAAG
ATCATCAATC CCGATCGCAT CCGCCCTGGC CAGATTTTCG GCCTGCCGAA GGATGTACTG
CCGAATGCCG AAGAGCTGCA CCGCAAGCGC ATGTCCGGCC AGCATCTCTA A
 
Protein sequence
MMKNRAGLLA LAVLAIAILL MVFVVMPRIG GDATKVGDAI NQASTELKNT VNEASKTSRS 
AVGDAAAVAD QVGRLSADAG VSLSELKALF ADGKGPAIDV FTAAKTKAVN ALTALAGFTI
PEGLDPATQT LAAKAKDGGA KALAIVRSLP ENIADALAAI AKAEVALTGA PETAPGANTA
AENTGPKLPA FDVLRVEPDG STVIAGSAEP NGKLEVLDGE KVVTTANVDA SGDFAAVLDD
PLPAGDHQLV LKFTGKDGKS TLSEEVATIS VPKDGNGANL LAMVSKPGAA SRIITAPKAG
TEVADASNPM APPADKPATG ESSAAPTGEL ALQTPNLTDT PSGGADTAPA IPGTAAPDKT
NAPDVMVNAV EIEGNKIFIA GTTRSNAKVI GYADDSLVGQ DTAGSDGHFV IDGVVALSVG
DHKIRVDVVD PTGKVIVRAS VNFNRPAGDQ VRVAAQSAPA DANGASSMVP LDEGELRKLK
AEVGKAFGLL KGLFADGKLP GAEQLAAARS ATDFALRSVA DFRPAADAPD VFKQASGSSS
QVAGNALKLL QGLPGDAKSV GAALDKLGGM IAELTAAPAP ATPSANEVGS NQPKTIEQAP
LTANNAAVII RRGDTLWQIS RRTYGLGVRY TTIYIANEDK IINPDRIRPG QIFGLPKDVL
PNAEELHRKR MSGQHL