Gene Rleg_1218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1218 
Symbol 
ID8012325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1193881 
End bp1195923 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content64% 
IMG OID644823801 
ProductProlyl oligopeptidase 
Protein accessionYP_002975051 
Protein GI241203955 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000231766 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAACCTTC ATCTCGAAAC GGACGACGAT CCGCGCACCC GCCAGTTCAT TGACGAGCAC 
AACCGGGTTT CCGACGCGGC ACTGAGAACA CCTGAATTCG AGAGGGATCG CGACGCGATC
AAGGCGCTGA TCGAGCGGCA GGACAGGCTG ATCGTTCCGA TGCGCCGAGA CGAGTGGCTG
TTCGATTTCC GGCAGAGCAA GGACAATCCT CTCGGCATCT GGCTCCGCCT GCCGGCAGAC
CAGCAGCCGC TGCCGGACGC CGCCTGGGAG CCGGTCTTCG ATCTCGACGC CTTCTGCGTC
AGGGAAGGCA AGCGCTGGAA CTGGCGCGGC GCCGTCACCT GCCCATGGGA GCCGACGCGG
GTGCTGCTCA CGCTTTCGGA CGGCGGCTCC GACCTTCTCC GGTTGCTCGA ATTCGACGCC
GAACCGAAAC AGGTCGTCGA GGGCGGCTTC GATACACCGG CCGCGCGATC GCATGCCACC
TGGCTGAGCC GCGACGAAAT CTGCTATTTC GGTTCCATCG ACCGATTTTC GGCGACCCGC
TCGGGATGGC CGCGCGTCGG ACGGCGATTG AAGCGCGGGC AGCGACCGGA AGATGCCGCC
GTCATGTTCG AAGCAGCAGA CGAGGACGTC TATGGCTTCA ATCTCGTCAT CGACCCTGCC
TTATCAGGCG CTTCAGCGGA TCGCGGATTG ATCGATATTT TCGTCGCCGC CCACGAGATC
GGCGTCGCGA GCGCTTTCCT GATCGCCGAT AACGGCACAC AGCGACGCAT CGATCTGCCT
AAGGAAGCCG ATTTCCAGTT CAATCATGAT CATTGCCTTT GGCGGGCAAA GACCGATGAG
CGCGTTGCGA CCGGCAGCCT CGTGCTGCAG CGCTTCGATC CCGCCTCGGA GACGGCCCTG
CTCGGGCCGG AGCGGATCCT GTTTGAGCCT GGCGAGGGCC AGTCGATCGC GCAGATGATG
CTGATGCAGG AGTGGTGTGT CTTCATCATC TCCGATCGGC TGCGTCCGCG TCTCATGGTG
CTAGACCTGA CAAGGCCCGA TGCCAAACAG CGCGAGATCG CGCTGCCGGC GGACATGCAG
ACGGCTCATT TCAGGCCGCT TCATGCGGAT CTGCATCTTG GCGACGATAC GCTCTACATC
GTCGGCCAGG GCTTCCTGCA GCCTCCCGTC TGTTACCGCC TCGAGCTGTC GGATCGCAGC
AAGCAAGCCG AGCCGATTTT CGTCGCCACG GCGCCGAGCT ATTTCGATGC GACCGATATG
TCATCCGAAC TGCTGGAGGC TGTTTCCGAG GATGGAACGA AGGTTGCTTA CCGGCTGGTC
CTGCCGAAGC AGTGGACCAA GGGCGCGCTG CCGGTGCTGC TTTACGGCTA TGGCGGCTTC
GATGTTTCGC TATCGCCGAA CTATTCCGGC GTGACGGGAC GCTGGCTGGA ACAAGGCGGC
GCCTATGTGC AGGCCTATAT CCGCGGCGGT GGCGAATTCG GGCCCGACTG GTATCGCAGC
GCCAAGCGGC AGGGGCGAGA CCGGGCGTTT GCCGACTTCG TCGCCATCGC CCGCGATCTC
GTCGCCCGCG GCTATACAGT GCCGTCGCGG ATCGCCTGCC AGGGCGGCAG CAATGGCGGA
CTGCTGACCG GCGTGATGCT GACGCGCTAT CCCGACGATT TCGGCGCCGT CTGGTGCCAG
GTGCCGGTCC TCGACATGAC ACGTTTCCAC CTGTTCAGCG CAGGGCAAGC CTGGATGGAC
GAATATGGCG ACCCGGAGAC GCCGGCGGAC CGGGACTTCA TGCTCGGCTA TTCGCCGCTT
CACAATGTCG GGCCGGCGAC CAAGGTCAGC TATCCGCCGA TCTACATCGA AAGCTCCGCC
AATGACGACC GCGTGCACCC CTCGCATGCG CGCCGCTTTG CCGCACGGCT GGAGGAAGAC
GGACACCGGC CGCTCTTCCA TGAATTCGGC TCCGGCGGGC ATGGCGGCGA CGGCAATTCC
GAAGAGCGCG CCGCCCGCGC GGCGATGGGT TACAGCTTCC TTCGCCAAAC TATCATGCGG
TAG
 
Protein sequence
MNLHLETDDD PRTRQFIDEH NRVSDAALRT PEFERDRDAI KALIERQDRL IVPMRRDEWL 
FDFRQSKDNP LGIWLRLPAD QQPLPDAAWE PVFDLDAFCV REGKRWNWRG AVTCPWEPTR
VLLTLSDGGS DLLRLLEFDA EPKQVVEGGF DTPAARSHAT WLSRDEICYF GSIDRFSATR
SGWPRVGRRL KRGQRPEDAA VMFEAADEDV YGFNLVIDPA LSGASADRGL IDIFVAAHEI
GVASAFLIAD NGTQRRIDLP KEADFQFNHD HCLWRAKTDE RVATGSLVLQ RFDPASETAL
LGPERILFEP GEGQSIAQMM LMQEWCVFII SDRLRPRLMV LDLTRPDAKQ REIALPADMQ
TAHFRPLHAD LHLGDDTLYI VGQGFLQPPV CYRLELSDRS KQAEPIFVAT APSYFDATDM
SSELLEAVSE DGTKVAYRLV LPKQWTKGAL PVLLYGYGGF DVSLSPNYSG VTGRWLEQGG
AYVQAYIRGG GEFGPDWYRS AKRQGRDRAF ADFVAIARDL VARGYTVPSR IACQGGSNGG
LLTGVMLTRY PDDFGAVWCQ VPVLDMTRFH LFSAGQAWMD EYGDPETPAD RDFMLGYSPL
HNVGPATKVS YPPIYIESSA NDDRVHPSHA RRFAARLEED GHRPLFHEFG SGGHGGDGNS
EERAARAAMG YSFLRQTIMR