Gene Rleg_1180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1180 
Symbol 
ID8012293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1157547 
End bp1158509 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content63% 
IMG OID644823764 
Productproline iminopeptidase 
Protein accessionYP_002975014 
Protein GI241203918 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.270645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.812597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA TACTGCGCAC GCTCTATCCC GAAATCGAAC CCTATGTTTC CGGCCATCTC 
GATGTCGGCG ACGGCCATGT GATCTATTGG GAGCGCTCGG GCACGCCTGG CGCCAAGCCC
GCCGTCTTCC TGCATGGCGG CCCGGGCGGC GGCATCTCGC CCGCCCATCG CCGGCTCTTC
GATCCCGCCC TCTACGACGT CATGCTGTTC GACCAGCGCG GTTGCGGCAG GTCGACGCCG
CATGCGGAAC TGCATGCCAA CACGACCTGG CACCTCGTCG CCGATATCGA GCGGCTGCGC
GAGATGGCCG GCGTCGACAG CTGGCAGGTG TTCGGCGGCT CCTGGGGCTC GACCCTGGCG
CTCGCCTATG CCGAGACGCA TCCGCAGCAC GTCTCCGAGC TCATCCTGCG CGGCATCTAT
ACGCTGACCA AGGCCGAGCT CGACTGGTAT TATCAATTCG GCGTCTCGGA GATGTTCCCT
GACAAGTGGG AGCGGTTCAT CGCGCCGATT CCGCCGGAAG AGCGGCATGA GATGATGCAT
GCCTATCATC GCCGCCTGAC GCATGAGGAC AGGAATGTGC GCCTCGCGGC GGCGCAGGCC
TGGAGCATCT GGGAAGGCGA AACGATCACG CTGCTGCCGG AACCTTCGAC GAGCTTCAAG
TTCGAGGAGC CGGAATTCGC CTATGCATTT GCGCGCATCG AGAACCATTT CTTTGTCAAT
GCCGGCTGGA TGGACGAGGG ACAGCTGATC CGCGATGCCG GCAGGCTCAA GCATATTCCC
GGCGTCATCG TGCATGGCCG CTACGACATG CCCTGCCCGG CCAAATATGC CTGGCTGCTG
CACAAGGCCT GGCCGAAGGC GGAGTTCCAC CTGATCGAGG GTGCGGGCCA TGCCTATTCG
GAGCCGGGGA TTCTCGATCG GCTGATCCGG GCGACGGATA AGTTTGCCAG GAAGCAAGAC
TGA
 
Protein sequence
MTEILRTLYP EIEPYVSGHL DVGDGHVIYW ERSGTPGAKP AVFLHGGPGG GISPAHRRLF 
DPALYDVMLF DQRGCGRSTP HAELHANTTW HLVADIERLR EMAGVDSWQV FGGSWGSTLA
LAYAETHPQH VSELILRGIY TLTKAELDWY YQFGVSEMFP DKWERFIAPI PPEERHEMMH
AYHRRLTHED RNVRLAAAQA WSIWEGETIT LLPEPSTSFK FEEPEFAYAF ARIENHFFVN
AGWMDEGQLI RDAGRLKHIP GVIVHGRYDM PCPAKYAWLL HKAWPKAEFH LIEGAGHAYS
EPGILDRLIR ATDKFARKQD