Gene Rleg2_5133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5133 
Symbol 
ID6978227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp772180 
End bp773604 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content63% 
IMG OID643394264 
Productpeptidase M24 
Protein accessionYP_002279082 
Protein GI209547164 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCTA GCGCAACCAA TTCCGGTTCC TACCGCGTCG GATCGCTGCT TGCCGATTTC 
CAGCCGGATT TCGATTTCGA CGCTCCTCTG CCGCTGCCGG TCGAGGAATT CGAGGATCGC
CTGCGCCGCA TCCGCCGGCA GGCTGTGGAA GCCGGACATG ACGCGCTGAT CGTCCACACA
GGCGGCGTCG GCTGGTTCCA CACCTCCAAC CACTATCTGC GCTACATCTG CGACTGGATG
CGTGAAGGCG TGCTGATCAT CCCAACTGAT AATGACAAGC CGCTGACGCT CCTGTCCTTT
TTCACCCAGT CCGTGCTGCT GCCGCCGGGC GGCGAGCCGG TGCTGGTCGA GGACATCTGG
CAGATCGGCC CGATCGGCCG CGAATATGCC GATCGTCCGG GCGACAGCGT CGTCAAGACC
GCCGAGAAGT GCGCCGAACT CTTGGCCGGC ATGGGACTGG CCAAGGCTCA GATCGGCCGC
ATCGGTGACG GGACGTCGCT GACCTTCTGG ACAGCGCTCG ACGAACTGAT GCCGAAGGCG
AAGTTTGTCG CCGACAACGC CATTCTCGAC CGGATGCAGA AGGTGAGATC GCCGCGCGAG
ATCGCGCTAT TCCGCGCCGC TGCCCAGCTT ATTAGCATCG CCACCCAGGC GGCCTATCAC
GTCGCCAGGC CCGGCGTCAC CGACCATGAA ATCTATGCCG CTTTCACCCA GGCGCAGCTG
TCCTACGGCG GCGAGACCGG CGACGGCTAC CAGATCGGCA TCAACGAGTT CGGCACTCAT
TGTGGCAAAC CCTACGGCCA TGTCGTCCGG CCGGGCGACC TGATCAATCT CTATGTCTCC
AACGTCACCT ACCGGGGCTA CACGGCCCAG ACCGCCCGCA TGATCGCGAT TGGCGGGATC
ACCAAGCGGC AAGAAGCGGT GCTGGCCGCC TGCACCGAGG GCGTCAAGCG CGCCGAAAAG
CTGATCCGTC CCGGTGCCCT GATGCGTGAC GTCAATAACG CGGCCTTCGA GCCGATGATC
GAGCGAGGAA TGTTGGCCTC GCCTGAGGCG CGCAGCATGC CCTATAACTG GGCGCCGATG
GACGATGGCG GTGCCCGTCT TATCCCGCGC CAGTATGTGA AGAACATCGA CTGGGAAGCG
CAGGGCCGCA AGCTGATGCA CCTCTACCCG GCGACCCACG GGCCGCATAA CCCCAATCTC
GGCCATTCAG TCGGCATGGC CGGCGGGCAG AACAGTTTCA ACATCTCCTC CCACAATTAC
GATCGAATGG AAGAGGGCAT GGTGTTCGTG TTGCATACGC AGTGGCTGGA GCCTCTGTCG
GCCGGCTGCA ATGTTGGCGA CATGTATGTC GTCACCAAGG ATGGGTTCGA GAACCTCTCC
CGCCACACCC CGCTTGAAAC CCATCGCATC GCTGCCGAGG CCTGA
 
Protein sequence
MTSSATNSGS YRVGSLLADF QPDFDFDAPL PLPVEEFEDR LRRIRRQAVE AGHDALIVHT 
GGVGWFHTSN HYLRYICDWM REGVLIIPTD NDKPLTLLSF FTQSVLLPPG GEPVLVEDIW
QIGPIGREYA DRPGDSVVKT AEKCAELLAG MGLAKAQIGR IGDGTSLTFW TALDELMPKA
KFVADNAILD RMQKVRSPRE IALFRAAAQL ISIATQAAYH VARPGVTDHE IYAAFTQAQL
SYGGETGDGY QIGINEFGTH CGKPYGHVVR PGDLINLYVS NVTYRGYTAQ TARMIAIGGI
TKRQEAVLAA CTEGVKRAEK LIRPGALMRD VNNAAFEPMI ERGMLASPEA RSMPYNWAPM
DDGGARLIPR QYVKNIDWEA QGRKLMHLYP ATHGPHNPNL GHSVGMAGGQ NSFNISSHNY
DRMEEGMVFV LHTQWLEPLS AGCNVGDMYV VTKDGFENLS RHTPLETHRI AAEA