Gene Rleg2_0763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0763 
Symbol 
ID6979481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp778524 
End bp780029 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content66% 
IMG OID643395475 
Productprotease Do 
Protein accessionYP_002280284 
Protein GI209548367 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACACA TCCTCCGCAA ACATCGCACT GCTGCCCTCA TCGGGGCTGC CATCATCGCC 
GGCGCGGCAT GCCTGCCGTT TACCATTACC GCGGCGAACG CCGTTGCCTC GCCTGCGGAT
GCCGGCGGCA TTCTCGCCGC CAGCGGCTCT TTCGCCTCTA TCGTCGATGC CGACAAACCT
GCCGTCGTCA CCATCACCAC GACCATGAAG GCAACCGATG TCAGCGCCGA CCAGCAGCAG
TCGCCGATGG ACGAGCAGTT CCGCCAGTTT TTCGAGGATC AGGGCATCCC GCTGCCGCGC
CAGGCGCCGA AAAACCGGTC TTCGCAGCAG ACAATGGCGC TCGGCTCCGG CTTCATCATC
AGCCCAGACG GCGTGATCGT TACCAACAAC CATGTCATCG ACAATGCCGT CGACATCAAG
GTGACGCTGG ATGACGGCAC GGAACTGCCG GCCAAGCTGA TCGGCACCGA CCCGAAATCC
GATGTCGCCG TCGTGAAGAT AGAGGCGGGA AAGCCGCTGC AGACCATTGC CTGGGGCGAT
TCCGACAGGC TGAAGCTCGG CGACCAGATC CTGGCGATCG GCAACCCCTT CGGCATCGGC
ACCACGGTGA CGGCGGGCAT CGTCTCGGCG CGCGGCCGCG ACCTGCACAG CGGGCCCTAT
GACGATTTCA TCCAGATCGA CGCGCCGATC AACCATGGCA ACTCAGGCGG GCCGCTCGTC
GACCGCAGCG GCAATGTCGT CGGCATCAAC ACTGCGATCT ATTCGCCGAA CGGCGGCAGT
GTCGGCGTCG GCTTCGCCAT TCCGTCCGAC GAGGCCAAGG CGATCGTCGC CAAACTGCAG
AAGGACGGCT CGATCGATCA CGGTTATCTC GGCGTGCAGA TCCAGCCTGT GACGAAAGAC
GTCGCCGATG CCGTCGGCCT CGATAAAACA GGCGGCGCAC TGGTTGCCGC CGTCACCGCC
GATACGCCGG CGGCCCATGC CGGCGTGAAG CCGGGCGATA TCATCACCTC GGTCGGCGGC
GAGAGCGTCA AGACGCCGAA GGACCTGTCG CGGCTGGTCG CCGATCTTTC GCCAGGCGCC
AAAAAATCTC TCGGCATCTG GCGTGACGGC AAGACGATCG ATCTCAACGT CACCGTCGGC
GGCAATGAGG ACGGCCAGAA ACAGGCCGCC GCCGAAAGCT CGGACAGCAA GGGCGAGAGC
AGCGGCCAGC CGAGCCTCGG CATCGGCCTC GCCGACCTGA CGCCAGACGT GCGCGAGCAG
CTCACCCTGC CGCGCGCCGT CAGCGGCGCG GTGGTCGCCA GCGTCGATCC CGACAAGTCG
GCCGCGGCCG CCGGCATCCA GTCGGGCGAT GTCATCGTCT CGGTCAACGA CAGACCGGTC
CACAGCACCC GCGACGTCAA GACCGCGATT GCCGAGGCCG GCAAGGCTGG CCGCAAATCG
GTGCTACTGC TCGTCGAACG CGATGGCGGC AAGACCTTCG TCGCCGTGCC GTTCGGTGCG
GCCTGA
 
Protein sequence
MSHILRKHRT AALIGAAIIA GAACLPFTIT AANAVASPAD AGGILAASGS FASIVDADKP 
AVVTITTTMK ATDVSADQQQ SPMDEQFRQF FEDQGIPLPR QAPKNRSSQQ TMALGSGFII
SPDGVIVTNN HVIDNAVDIK VTLDDGTELP AKLIGTDPKS DVAVVKIEAG KPLQTIAWGD
SDRLKLGDQI LAIGNPFGIG TTVTAGIVSA RGRDLHSGPY DDFIQIDAPI NHGNSGGPLV
DRSGNVVGIN TAIYSPNGGS VGVGFAIPSD EAKAIVAKLQ KDGSIDHGYL GVQIQPVTKD
VADAVGLDKT GGALVAAVTA DTPAAHAGVK PGDIITSVGG ESVKTPKDLS RLVADLSPGA
KKSLGIWRDG KTIDLNVTVG GNEDGQKQAA AESSDSKGES SGQPSLGIGL ADLTPDVREQ
LTLPRAVSGA VVASVDPDKS AAAAGIQSGD VIVSVNDRPV HSTRDVKTAI AEAGKAGRKS
VLLLVERDGG KTFVAVPFGA A