Gene Rleg_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2056 
Symbol 
ID8013085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2049305 
End bp2050279 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content63% 
IMG OID644824642 
Productproline iminopeptidase 
Protein accessionYP_002975873 
Protein GI241204777 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.604828 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCTC TCTATCCCGA AATCGAACCC TATGATCATG GCCTGCTCGA TACGGGCGAC 
GGCAATCTGA TCTATTGGGA GGCCTGCGGC AATCCGGCGG GCCGCCCGGC GCTGGTGCTT
CATGGCGGCC CTGGTTCCGG CTGTACGACC GCGGCGCGCC GCTATTTCGA TCCCGACGCC
CACCGAATCA TTCTGTTCGA TCAGCGCAAT TGCGGCCGCA GCCTGCCGAG CGCTGCCGAT
CCCGAAACCG ATCTCTCCCT CAACACCACC TGGCATATCG TTGCCGATAT CGAGCGGCTG
CGGGCCTGTC TCGGCATCGA TACCTGGCTC CTTTTCGGCA ATTCCTGGGG TTCGACGCTG
GCGCTGGCCT ATGCTGAAAC CCATCCGGAG TGTGTCGCCG CGATCGTCCT GTCAGGCGTG
ACCACCACCC GGCGCTCGGA AATCGACTGG CTCTATCGTG GCATGGCGCC GCTCTTTCCG
GAAGAATGGC AACGTTTCCG CCAGGCTGTT CCTCCTGGCA GCCAGGGACG GGACGAGGAC
ATGGTTGCAG CCTATCATCG TCTCCTCAAC GATGCGGACC CGGAAACGCG CCTCCAAGCG
GCGCGCGACT GGCATGATTG GGAGGCGGCC TCGATCCTGC TCGCCGATCC CCAAGGCCGG
CCGCGCCGCT GGGCCGATCC GGCCTGTTTG CTGACGCGCG CCCGCATCAT CACCCACTAC
TTCACCAACG GCGCATGGCT GGAGGACGCC CAGCTTTTGA AGAACACCGC GCGGCTCATC
GGCATTCCCG GTATCCTGCT GCAGGGAAGG CTCGACATCG AGGCGCCGCT CGTCACGGCC
TGGGAACTCG CCCGCGCCTG GCCGCAAAGC GAGCTCAGCA TCCTTCCGCA TGCTGCCCAT
TCCATCGCAA ATCCGGATAT GAGCGCGGCG ATTGTGACTG CCACCGATCG ATTTCGCGAT
TTTCCTCCAA AATAA
 
Protein sequence
MSALYPEIEP YDHGLLDTGD GNLIYWEACG NPAGRPALVL HGGPGSGCTT AARRYFDPDA 
HRIILFDQRN CGRSLPSAAD PETDLSLNTT WHIVADIERL RACLGIDTWL LFGNSWGSTL
ALAYAETHPE CVAAIVLSGV TTTRRSEIDW LYRGMAPLFP EEWQRFRQAV PPGSQGRDED
MVAAYHRLLN DADPETRLQA ARDWHDWEAA SILLADPQGR PRRWADPACL LTRARIITHY
FTNGAWLEDA QLLKNTARLI GIPGILLQGR LDIEAPLVTA WELARAWPQS ELSILPHAAH
SIANPDMSAA IVTATDRFRD FPPK