Gene Rleg_5371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5371 
Symbol 
ID8007329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp780835 
End bp782259 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content62% 
IMG OID644822275 
Productpeptidase M24 
Protein accessionYP_002973535 
Protein GI241113700 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.56285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGC ACGCGACAAA CGCAGGCGGC TATCGGATGG GATCGCTGCT GGCCGATTTC 
CAGCCGGATT TCGATTTCTC CGCGCCGCTG CCGCTTGCTG TCGAAGAGTT CGAGGACCGC
CTTCGCCGAA TTCGCCGTCA GGCGATCGAA GCCGGTCATG ACGCGCTGAT CGTCCATGCC
GGCAGCGTCG GCTGGTTCCA CGCTTCGAAC GCCTATCTGC GCTATATTTG CGACTGGATG
CGCGAAGGCG TGCTGATCAT CCCGACCGAC GCCGACAAGG CGATGGTGCT TCTGTCCTTC
TTCACCCAAT CCGTCCTGCT TCCGCCGGGC GGCGAGCCTG TGCTCGTCGA CGAAATCTGG
CAGATCGGTC CGATCGGCCG CGAATATGCC GACCGCCCCG GCGATTCCGT CATCAAGACT
GCCGAGAAAT GCGCCGAGGT TCTCGCCAGT CTCGGCCTCA CCAAGGCCCA GATCGGCAGG
ATCGGCGACC GCACGTCGCT GACCTTCTGG TCTGCACTCG AGGAATTGAT GCCGAAGAGC
AAGTTCGTGG CTGACAACGC CATTCTCGAC CGCATGCAGA AGGTCCGCTC GACGCGCGAG
ATCGAGATCT TCCGCGCCGC CGCCCAGCTG ATCAGCATCG GCACGCAGGC TGCCTATCAT
GTGGCAAAAT CAGGCGTGAC CGACCATGAA ATCCTCGCCG CCTTCACCTA TGCGCAGATG
GCACTCGGCG GCGAAACCGG CGACGGCTAC CAGATCGGCA TCAACGAATT CGGCACCCAT
TGCGGCAAGC CCTATGGCCA CATCGTCCGC CCAGGCGACC TCATCAACCT CTACATCTCC
AACGTCACCT ATCGCGGCTA TACCGCCCAG ACCGCCCGCA TGATCGCGAT TGGTGACATC
ACCAGCCGTC AGGAGGAGGT GCTTGCCGCC TGCACCGAGG GCGTCAAGCG GGCCGAAAAG
CTCATCAAGC CCGGCGCCTT GATGCGCGAC GTCAACAATG CTGCCTTTGA ACCGATGATC
GAGCGCGGCA TGCTCACCTC ACCCGAGGCA CGCACGATGC CCTATAACTG GTCGCCGATG
GAAGACGGCG GGGCACGCCT GATCCCCAAT CAGTATGTGA AGGACATCGA CTGGGAGGCG
CAGGGCCGCA AGCTCATGCA CGTCTATCCG GCAACGCACG GACCGCACAA TCCAAACCTC
GGCCATTCGG TCGGCATGGC TGGTGGCCAG AACAGCTTCA ACATCTCCTC ACATAACTAC
GACAGGATGG AGGAGGGCAT GGTCTTCGTG CTGCACACGC AGTGGCTGGA ACCGCTGTCG
GCCGGCTGCA ATATCGGCGA CATGTATGTC GTGACCAAGG ACGGCTTTGA GAACCTCAGC
CGCCATACCC CGCTTGAAAC CCGCCGCGTC GCTGCCGAGG CCTGA
 
Protein sequence
MNMHATNAGG YRMGSLLADF QPDFDFSAPL PLAVEEFEDR LRRIRRQAIE AGHDALIVHA 
GSVGWFHASN AYLRYICDWM REGVLIIPTD ADKAMVLLSF FTQSVLLPPG GEPVLVDEIW
QIGPIGREYA DRPGDSVIKT AEKCAEVLAS LGLTKAQIGR IGDRTSLTFW SALEELMPKS
KFVADNAILD RMQKVRSTRE IEIFRAAAQL ISIGTQAAYH VAKSGVTDHE ILAAFTYAQM
ALGGETGDGY QIGINEFGTH CGKPYGHIVR PGDLINLYIS NVTYRGYTAQ TARMIAIGDI
TSRQEEVLAA CTEGVKRAEK LIKPGALMRD VNNAAFEPMI ERGMLTSPEA RTMPYNWSPM
EDGGARLIPN QYVKDIDWEA QGRKLMHVYP ATHGPHNPNL GHSVGMAGGQ NSFNISSHNY
DRMEEGMVFV LHTQWLEPLS AGCNIGDMYV VTKDGFENLS RHTPLETRRV AAEA