Gene Rleg_3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3024 
Symbol 
ID8013939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3018344 
End bp3019495 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content63% 
IMG OID644825592 
Productpeptidase M24 
Protein accessionYP_002976820 
Protein GI241205724 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.504097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTGC ATTTCGAAAA GGCCGAATTC GCAAGCCGGC TTGCGCGCCT CACCGAGAAG 
ATGAAGGAAG AAAAGCTCGA CGCCTTGCTG CTCTTCGCCC AGGAAAGCAT GTACTGGCTG
ACCGGCTACG ACACCTTCGG CTATTGCTTC TTCCAGACGC TGGTCGTCAA GAGCGACGGC
ACCATGGCGC TGATCACCCG CTCGGCTGAT CTTCGCCAGG CCAGGCATAC CTCGATCCTC
GAGGACATCC ATATCTGGGT CGACCGGGTC AATGCCGACC CGACGCTCGA CCTGAAGAAC
CTGCTGGTTG AGCTGGACCT GCTCGGCGCC CGCATCGGCA TCGAATATGA TACCCACGGC
ATGACCGGCC GCGTCGCCCG GCTGCTCGAC GCGCAATTGA CCACCTTCGG CCAGATCGTC
GACGCCTCCT ACCTCGTCAG CCGGCTGCGC CTGATCAAGA GCCCGACGGA GGTCGCCTAT
GTCGAGCGCG CCGCCGCTCT CGCCGACGAT GCGCTCGATG CCGCGATCCG GTTGACAAAG
CCCGGCGCCG ACGAGGCGGA TATCCTCGCT GCCATGCAGG GTGCGATTTT TTCCGGCGGC
GGCGACTATC CCGCCAACGA GTTTATCATC GGCTCCGGCG CCGACGCGCT GCTCTGCCGC
TACAAGGCCG GCCGCCGCAA GCTCGACGCC AACGACCAGT TGACGCTCGA ATGGGCTGGC
GCCTATGCGC ATTACCATGC CGCCATGATG CGCACGATCG TCATCGGCGA GCCGACGCAT
CGCCACCGCG AGCTTTACAA CGCCTGCCGC GAAACCATCG AGGCGATCGA AACCGTGCTG
AAGCCCGGCC AGACCTTTGG CGATGTCTTC GACATGCATG CCAGGATCAT CGACGAGCGC
GGCCTGGCTC GCCACCGGCT GAATGCCTGC GGTTATTCGC TCGGCGCCCG CTTCTCGCCC
TCCTGGATGG AGCATCAGAT GTTCCATGTC GGCAATCCGC AGCCGATCGA GCCGAACATG
TCGCTCTTCG TGCACATGAT CATCGCCGAC TCAGATACGG GCACGGCGAT GACGCTCGGC
CAGACCTATC TGACGACAGC GGATGCGCCG CGCGCGCTAT CCTGCCATCC GCTCGATTTC
ATCGGGCTCT GA
 
Protein sequence
MALHFEKAEF ASRLARLTEK MKEEKLDALL LFAQESMYWL TGYDTFGYCF FQTLVVKSDG 
TMALITRSAD LRQARHTSIL EDIHIWVDRV NADPTLDLKN LLVELDLLGA RIGIEYDTHG
MTGRVARLLD AQLTTFGQIV DASYLVSRLR LIKSPTEVAY VERAAALADD ALDAAIRLTK
PGADEADILA AMQGAIFSGG GDYPANEFII GSGADALLCR YKAGRRKLDA NDQLTLEWAG
AYAHYHAAMM RTIVIGEPTH RHRELYNACR ETIEAIETVL KPGQTFGDVF DMHARIIDER
GLARHRLNAC GYSLGARFSP SWMEHQMFHV GNPQPIEPNM SLFVHMIIAD SDTGTAMTLG
QTYLTTADAP RALSCHPLDF IGL