Gene Rleg2_1073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1073 
Symbol 
ID6979792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1092403 
End bp1094445 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content64% 
IMG OID643395785 
ProductProlyl oligopeptidase 
Protein accessionYP_002280593 
Protein GI209548676 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.594316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000514493 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAACCTTC ATCTCGAAAC GGACGACGAT CCGCGCACCC GCCAGTTCAT CGACGATCAT 
AATCGCATTT CCGATGCGGC GTTCAGAACG GCTGAATTTG CCAGGGATCG CGACGCGATC
AAGGCGTTGA TCGAGCGGCA GGACAGGTTG ATCGTGCCGA TGCGCCGCGG TGCGTGGTTG
TTCGATTTTC GGCAGAGCAA GGACAATCCT CTCGGCGTCT GGCTCCGTCT GCCTGCCGAC
CAGGAGCCGC TGCCGGACGC CGGTTGGGAA CCGGTTTTCG ATCTCGATGC CTTCTGCATC
CGTGATGGCA AGCGCTGGAA CTGGCGCGGC GTCGTCACCT GCCCGTGGGA GCCGACGCGG
GTGCTGCTCA CGCTTTCGGA CGGCGGCTCG GATCTTCTCC GGCTGCTCGA ATTCGACGCC
GAGAAGAAAG AGATCGTCGA GGGCGGTTTC GATACGCCGG CCGCACGATC GCATGCCAAT
TGGCTGAGCC GTGATGAGAT CTGCTATTTC GGCTCGACCG ACCGATTTTC GGCGACCCGC
TCCGGATGGC CACGGGTCGG GCGGCGATTG AAGCGCGGGC AGCGGCCGGA GGATGCCGCC
GTCATGTTCG AAGCGGCGAA TGAAGACGTC TACGGCTTCA ATCTCGTCAT CGATCCTGCC
CTGTCAGGCG GTTCGGCGGA TGCCGGATTG ATCGACATTT TCGTCGCCGC CCATGAGATC
GGCGTCGCCA GCGCTTTCCT GACCACCGCC GACGGCACGC GGCGGCGCAT CGATCTGCCT
GAGGAGGCCG ACTTCCAGTT CAATCACGAT CATTGCCTCT GGCGGGCAAA GACCGATGAG
CGTGTTGCCA CCGGCAGCCT CGTGCTGCAG CGCTTCGATC CCGGCTCGGA GACGGCTCCG
CTCGGGCCGG CGCGGATCCT GTTTCAGCCT GGCGAGGGCC AGTCGATCTC GCAGATGATG
CTGATGCGCG AGTGGTGCGT CTTCATCATT TCCGATCGGC TGCGTCCGCG TCTCATGGTG
CTGGATCTGA CCAAGCCCGA TGCCGAACAG CGCGAGATCG TGCTGCCTGC CGACATGCAG
ACGGCTCATT TCAGGCCGCT TTATGCGGAT CTGCATCTTG GCGACGATAC GCTCTGCCTC
ATCGGCCAGG GCTTCCTGCA GCCGCCGGCC TGTTACCGGC TCGAGCTATC GGATCGCAAC
AAGGCGGCCG AGCCGGTTTT CATCGCCGCA GCACCCAGCT ATTTCGATGC GGCCGGTATG
TCATCCGAGC TGCTGGAGGC GGTTTCCGAG GATGGAACCA GGGTTACCTA CCGGCTGGTC
CTGCCGAAAC ACTGGACCAA GGGCGCGCTG CCGGTGCTGC TTTACGGCTA TGGCGGCTTC
GATGTTTCGC TGTCGCCGAG CTATTCCGGC GTGACGGGAC GCTGGCTGGA ACAGGGCGGC
GCCTATGTGC AGGCCTATAT CCGCGGCGGC GGCGAATTCG GCCCCGATTG GTATCGCAGC
GCCAAGCGGC AGGGGCGAGA CCGGGCGTTC GCCGATTTCG TCGCCATCGC CCGCGATCTC
GTCGCCCGCG GCTACACCGT GCCGTCGCGC ATCGCCTGCC AGGGCGGCAG CAATGGCGGC
CTGCTGACCG GCGTGATGCT GACGCGCTAT CCGGACGATT TCGGCGCCGT CTGGTGCCAG
GTGCCGGTGC TCGACATGAC ACGCTTCCAC TTGTTCAGCG CAGGCCAGGC CTGGATGGAC
GAATATGGCG ATCCGGAGGT AGCAGCGGAT CGGGATTTCA TGCTCGGCTA TTCGCCGCTC
CATAATGTCC GGCCGGCGAC CGAGGTCACC TATCCGCCGA TCTATATCGA AAGCTCCGCC
AACGACGACC GGGTGCATCC CTCGCATGCG CGCCGCTTCG CCGCGCGGCT GGAGGAAGCG
GGACACCGGC CGTTCTTCCA TGAGTTCGGC TCGGGCGGGC ATGGCGGCGA CGGCAATTCC
GAAGAGCGCG CCGCCCGGGC GGCGATGGGT TATAGCTTCC TTCGCCAAAC TATCATGACG
TAA
 
Protein sequence
MNLHLETDDD PRTRQFIDDH NRISDAAFRT AEFARDRDAI KALIERQDRL IVPMRRGAWL 
FDFRQSKDNP LGVWLRLPAD QEPLPDAGWE PVFDLDAFCI RDGKRWNWRG VVTCPWEPTR
VLLTLSDGGS DLLRLLEFDA EKKEIVEGGF DTPAARSHAN WLSRDEICYF GSTDRFSATR
SGWPRVGRRL KRGQRPEDAA VMFEAANEDV YGFNLVIDPA LSGGSADAGL IDIFVAAHEI
GVASAFLTTA DGTRRRIDLP EEADFQFNHD HCLWRAKTDE RVATGSLVLQ RFDPGSETAP
LGPARILFQP GEGQSISQMM LMREWCVFII SDRLRPRLMV LDLTKPDAEQ REIVLPADMQ
TAHFRPLYAD LHLGDDTLCL IGQGFLQPPA CYRLELSDRN KAAEPVFIAA APSYFDAAGM
SSELLEAVSE DGTRVTYRLV LPKHWTKGAL PVLLYGYGGF DVSLSPSYSG VTGRWLEQGG
AYVQAYIRGG GEFGPDWYRS AKRQGRDRAF ADFVAIARDL VARGYTVPSR IACQGGSNGG
LLTGVMLTRY PDDFGAVWCQ VPVLDMTRFH LFSAGQAWMD EYGDPEVAAD RDFMLGYSPL
HNVRPATEVT YPPIYIESSA NDDRVHPSHA RRFAARLEEA GHRPFFHEFG SGGHGGDGNS
EERAARAAMG YSFLRQTIMT