Gene Rleg_6197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6197 
Symbol 
ID8016210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp241800 
End bp242702 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content61% 
IMG OID644827503 
Productproline-specific peptidase 
Protein accessionYP_002978703 
Protein GI241258819 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01250] proline-specific peptidases, Bacillus coagulans-type subfamily 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.941686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGAAG TCACGACCAA AGAAGCTTAC CTGCCCTTTC GCGACTATCG CACCTGGTAT 
CGCGTCACCG GTTCGCTGGA GAGCGGCAAG CTGCCCCTCG TCGTCGCCCA TGGCGGGCCT
GGCTGCACCC ATGATTATGT CGATTCCTTC AAGGATATCG CCGCCCTCGA CGGCCGTCCG
GTCATCCATT ACGACCAGCT CGGCAATGGC AATTCCACCC GACTTCCGGA AAAAGGCCCG
GATTTCTGGA CGGTCGGCCT GTTCCTCGAG GAGCTGGACA CGCTACTTTC CCATCTCGGC
ATTCGGGATC GTTATGCCTT CCTCGGCCAG TCCTGGGGCG GCATGCTCGG CGCCGAACAT
GCGGTGCGCC AGCCGCAAGG TCTGAAGGCG CTTGTCATCG CCAACTCGCC GGCCAACATG
CACACCTGGG TTTCGGAGGC GAACCGGCTG AGGCAGGAAC TGCCGAAAGA GGTGCAGGAC
ACGCTGCTGA AGCATGAGCT GGTGGGAAGC CTCACCGATC CGGACTATAT CGCCGCCTCA
CGCGTCTTCT ATGACCGCCA TGTCTGCCGC GTGGTGCCGT GGCCGCCTGA AGTGGCGCGG
ACCTTTGCAA TCATGGACGA GGACAACACC GTCTACCGCA ACATGAACGG CCCGACCGAA
TTTCACGTCA TCGGTACGAT GAAAGACTGG ACGATCGAGA ACAGGCTGGA CCGCATCGAA
GCCCCGACGC TGCTGATCTC GGGAAAATAC GACGAGGCGA CGCCCCTGGT GGTAAGGCCC
TATCTCGAAC GCGTTCCGGG CTGCGAATGG GTGCTCTTCG AAAATTCCAG CCATATGCCG
CATGTCGAGG AAAAGCAGCT TTGCCTGGCG ACCGTTTCCG GTTTCCTGTC CCGGCACGAC
TGA
 
Protein sequence
MGEVTTKEAY LPFRDYRTWY RVTGSLESGK LPLVVAHGGP GCTHDYVDSF KDIAALDGRP 
VIHYDQLGNG NSTRLPEKGP DFWTVGLFLE ELDTLLSHLG IRDRYAFLGQ SWGGMLGAEH
AVRQPQGLKA LVIANSPANM HTWVSEANRL RQELPKEVQD TLLKHELVGS LTDPDYIAAS
RVFYDRHVCR VVPWPPEVAR TFAIMDEDNT VYRNMNGPTE FHVIGTMKDW TIENRLDRIE
APTLLISGKY DEATPLVVRP YLERVPGCEW VLFENSSHMP HVEEKQLCLA TVSGFLSRHD