Gene Rleg_3597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3597 
Symbol 
ID8014450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3630776 
End bp3632629 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content60% 
IMG OID644826162 
Productoligoendopeptidase, pepF/M3 family 
Protein accessionYP_002977382 
Protein GI241206286 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.794644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.398139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA AGCTCCCGCA CGCCGGCCTT CTTCTATCGG CAGCAGCGCC AGCTGGGGCC 
GCCGATCCGG CGCTCGGCGT TCTGCCGGTC TGGAAGCTGC AGGATCTTTA TCCCTCCGCC
ACCTCCACCG CCTTCGTCGC CGACATGGAA AAAGCCGGCA AGGCCGCGAT CGCCTTTGAA
GAAAAGTGGA AGGGCACACT CACGGAGGCG ACGGCGAAGA CCGGCGCCGA GGGCATCGGC
GCGGCCCTGA AGGAATATGA GGCGCTGGAC GACATCATCG GCCGCCTCGG CTCCTTTGCC
GGCCTCACTT ATTTCTCCGA TACCACCAAC CCGACAAACG GCAAGCTCTA CGGCGACGTA
CAGGCCAAGA TCACCGAATT TTCCGGTCAT CTCCTGTTCT TCGCGCTGGA ACTCAACCGC
ATCGACGACG CGGTGATCGA CGCCTGCATG GCGAATGATC CCGCCGCCGG ACATTATCGC
CCTTGGCTGC TCGACCTCAG GAAGGACAAG CCCTACCAGC TCGACGATAG GCTGGAACAG
CTCTTCCTTG AGAAGTCGAT GACATCAGCC GCAGCCTTCA ACCGCCTCTT CGACGAAACC
ATGGCGGAAC TTCGCTACGA GATCGATGGC GAGAAAGTGC CGCTCGAAGT GGCGCTGAAC
AAGCTGCAGG AAAAGGATCC GGAAGTGCGC CGCAAGGCAG CCATGGCGCT CGCCGAAACC
TTCAAGGCGA ATATCCGCAC CTTTACGCTG ATCACCAACA CGCTTGCCAA GGATAAGGAG
ATCGCCGACC GCTGGCGCGG CTTCGAGGAC ATCGCCGACA GCCGACACCT GGCAAACCGC
GTCGAGCGCG AGGTCGTCGA TGCGCTGGCC GCAGCCGTCC GCGAAGCCTA TCCCCGCCTT
TCGCATCGCT ATTACAAGAT GAAGGCGAAA TGGCTTGGCA TGGAGCAGAT GAATTTCTGG
GACCGCAACG CGCCGCTTCC GGAAACCTCC AGCGCCATCA TCTCGTGGCC GGAGGCGAAG
GACACCGTGC TATCGGCCTA TGGCAATTTT TCCCCCGAGA TGGCTGATAT CGCCAGGCGG
TTCTTCGACG AACAGTGGAT CGATGCTCCG GTTCGTGCCG GCAAGGCGCC CGGCGCCTTC
GCGCATCCGA CGGTTCCCTC GGCCCATCCC TATGTGCTCG TCAATTATAT GGGCAAGCCG
CGCGATGTAA TGACGCTTGC CCATGAACTC GGGCACGGCG TGCATCAGGT TCTCGCCGGC
GCGCAGGGAG CGCTGATGTG CCAGACGCCG CTGACGCTTG CCGAAACCGC TTCCGTCTTC
GGCGAGATGC TGACCTTCCG CGCGCTTCTG CAAAAGACCA CCGATACGCG CGAGCGCAAG
GCGATGCTCG CCCAGAAGGT CGAGGATATG ATCAACACGG TCGTGCGCCA GATCGCCTTC
TACGAATTCG AGCGCAAGCT CCACACCGCT CGTAAAGCTG GCGAACTCAC AGCTGACGAC
ATCGGCGAAC TCTGGCTCTC CGTCCAGTCG GAAAGCCTCG GGCCGGCGAT CAGCATTTCT
GAAGGGTACG AGACCTATTG GGCCTATATC CCCCATTTCA TCCACTCGCC CTTCTATGTC
TACGCCTATG CCTTTGGCGA TTGCCTGGTA AATTCGCTCT ATGCCGTCTA CCAGAAAGCC
GAGAAGGGCT TTCAGGAGAA GTATTTCGAA CTGCTGAGGG CCGGCGGCAC CAAGCATCAC
TCGGAACTGC TGAAGCCTTT TGGCCTCGAC GCCACCGATC CGTCGTTCTG GAGCCAGGGC
CTGTCGATGA TCGAAGGGCT GATCGATGAG TTGGAAGCGT TGGATAGGGG CTGA
 
Protein sequence
MKIKLPHAGL LLSAAAPAGA ADPALGVLPV WKLQDLYPSA TSTAFVADME KAGKAAIAFE 
EKWKGTLTEA TAKTGAEGIG AALKEYEALD DIIGRLGSFA GLTYFSDTTN PTNGKLYGDV
QAKITEFSGH LLFFALELNR IDDAVIDACM ANDPAAGHYR PWLLDLRKDK PYQLDDRLEQ
LFLEKSMTSA AAFNRLFDET MAELRYEIDG EKVPLEVALN KLQEKDPEVR RKAAMALAET
FKANIRTFTL ITNTLAKDKE IADRWRGFED IADSRHLANR VEREVVDALA AAVREAYPRL
SHRYYKMKAK WLGMEQMNFW DRNAPLPETS SAIISWPEAK DTVLSAYGNF SPEMADIARR
FFDEQWIDAP VRAGKAPGAF AHPTVPSAHP YVLVNYMGKP RDVMTLAHEL GHGVHQVLAG
AQGALMCQTP LTLAETASVF GEMLTFRALL QKTTDTRERK AMLAQKVEDM INTVVRQIAF
YEFERKLHTA RKAGELTADD IGELWLSVQS ESLGPAISIS EGYETYWAYI PHFIHSPFYV
YAYAFGDCLV NSLYAVYQKA EKGFQEKYFE LLRAGGTKHH SELLKPFGLD ATDPSFWSQG
LSMIEGLIDE LEALDRG