Gene Rleg2_0924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0924 
Symbol 
ID6979642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp938095 
End bp939678 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content65% 
IMG OID643395635 
Productprotease Do 
Protein accessionYP_002280444 
Protein GI209548527 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.010153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0275633 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAGA ATTTCAACGG ACGTCCGTCC CTCGCCACTG TGCTCAAGGC TTCTACCGTC 
GCCGGTATCG CAGCCGCTGT GCTCGCAACC GGCGTTCCGC TCGAAATCAC CCGGTCTTAT
GCCGAAGCCG TCAAGGTTCA GGCGCCTGCC GTGCCGAGCT TCGCCAATGT CGTCGATGCC
GTTTCGCCGG CCGTCGTTTC CGTCCGCGTC GAAAACCGCG TCAATCCCGT CTCCGACAAT
GACGGCTTCT CCATCGAAGG CCGCGGCTTC GACGATCTTC CCGATGATCA TCCGCTGAAG
CGCTTCTTCA AGCAGTTCGG TGGCCAGGAC CCAAGTGATC AGCAGGGCCA TCAGCGGCGC
TTCGGCCAGA ACGGCCCGGG TGGCCAAAAT GGCCCCGGCG GCAAGGGCCG TCTGCGTCCG
GTCGCTCAGG GGTCCGGCTT CTTCATCTCC GAGGATGGCT ACATCGTTAC CAACAACCAC
GTCGTTTCCG ACGGCCAGGC CTTCGTCGCT GTCATGAATG ACGGCACCGA ACTCGATGCC
AAGCTGATCG GCAAGGATCC GCGCACCGAT CTCGCCGTCC TCAAGGTCGA CGGCAAGGGC
AAGAAGTTCA CCTACGTCAA CTGGGCCGAT GACAACAATG TCCGCGTCGG CGACTGGGTC
GTCGCCGTCG GCAACCCCTT CGGCCTCGGC GGCACGGTCA CGGCCGGCAT CGTTTCGGCC
CGCGGCCGCG ATATCGGCTC TGGCCCCTAT GACGATTACC TGCAGGTGGA TGCCGCCGTG
AACCGCGGCA ACTCCGGTGG CCCAACCTTC AACCTCAGCG GCGAAGTCGT CGGCATCAAC
ACCGCGATCT TCTCGCCGTC CGGCGGCAGC GTCGGTATCG CCTTCGCCAT TCCGGCCTCG
ACAGCCAGGG ATGTCGTCGC CGATCTGATG AAGGACGGCC AGGTTTCGCG TGGCTGGCTG
GGTGTCCAGA TCCAGCCGGT GACCAAGGAT ATCGCCGAAT CCATCGGCCT TTCCGAGCCG
AGCGGCGCCC TTGTCGTCGC GCCCCAGGCC GGGTCGCCCG GCGACAAGGC CGGCATGAAG
GCCGGCGACG TCGTTACCGC GCTGAACGGT GAAACGATCA AGGATGCGCG TGACCTCAGC
CGCCGCATTG GCGCGATGCA GCCGGGCAGC AAGGTCGAGC TTTCGGTCTG GCGTGCCGGC
AAGGCCCAGC CTCTGACCGT CGAACTCGGC ACGCTGCCGG CCGACCAGAA GGATGCGAAC
GCCGATGACA ACAGCCAGCC GCAGCAGCCG GAGGCACCGG CGTCCGAAAA GGCGCTTGCC
GATCTCGGCC TGACGGTCGG TCCTTCCGAT GACGGCAAGG GCCTGGCGAT CACCGGCATC
GACCCGGACT CCGACGCCGC CGACAAGGGC ATCAAGGAAG GCGAGAAGAT CACCTCGGTC
AACAACCAGG AAGTCTCCAG CCCCGCCGAT GTCGTCAAGG TGCTGAACCA GGCCAAGAAG
GACGGCCGCA CCCGGGCGCT CTTCCAGATC CAGTCGAGCG AAGGAAGCCG TTTCGTCGCT
CTTCCGATCA ACGGCCAGGG CTGA
 
Protein sequence
MLKNFNGRPS LATVLKASTV AGIAAAVLAT GVPLEITRSY AEAVKVQAPA VPSFANVVDA 
VSPAVVSVRV ENRVNPVSDN DGFSIEGRGF DDLPDDHPLK RFFKQFGGQD PSDQQGHQRR
FGQNGPGGQN GPGGKGRLRP VAQGSGFFIS EDGYIVTNNH VVSDGQAFVA VMNDGTELDA
KLIGKDPRTD LAVLKVDGKG KKFTYVNWAD DNNVRVGDWV VAVGNPFGLG GTVTAGIVSA
RGRDIGSGPY DDYLQVDAAV NRGNSGGPTF NLSGEVVGIN TAIFSPSGGS VGIAFAIPAS
TARDVVADLM KDGQVSRGWL GVQIQPVTKD IAESIGLSEP SGALVVAPQA GSPGDKAGMK
AGDVVTALNG ETIKDARDLS RRIGAMQPGS KVELSVWRAG KAQPLTVELG TLPADQKDAN
ADDNSQPQQP EAPASEKALA DLGLTVGPSD DGKGLAITGI DPDSDAADKG IKEGEKITSV
NNQEVSSPAD VVKVLNQAKK DGRTRALFQI QSSEGSRFVA LPINGQG