Gene Rleg_0871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0871 
Symbol 
ID8012024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp861290 
End bp862792 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content65% 
IMG OID644823456 
Productprotease Do 
Protein accessionYP_002974707 
Protein GI241203611 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACACA TCCTCCGCAA CCATCGCACC GCAGCCCTTG TCGGGGCCGC CATCATCGCC 
GGCGCAGCCT GCCTGCCCCT CGCCCTCAAC GCCTCCAATG CCGTTGCTGC GCCTTCGGAT
AACGGCGGCA TTCTCGCCCC CAACGGCTCC TTCGCTTCCA TCGTCGAAGC CGACAAGCCT
GCGGTCGTCA CCATCACCAC GACGATGAAG GCGACCGATG TCAGCGCCGA CCAGGAATCG
CCGATGGACG AGCAGTTCCG CCAGTTCTTC GAGGATCAGG GCATCCCGCT GCCGCGCCAG
GCACCGCAAA AGCGGCCTTC GCAGCAGGCG ATGGCGCTCG GTTCCGGCTT CATCATCAGC
CGCGACGGGG TGATCGTCAC CAACAACCAT GTCATCGACA ATGCCGTCGA TATCAAGGTG
ACGCTGGATG ACGGCACGGA ACTGCCGGCC AAGCTGATCG GCACCGATCC GAAATCCGAT
GTCGCCGTGC TGAAGATAGA GGCGGGCAAG CCGCTGCAGA CTATCGCCTG GGGCGATTCC
GACAGGCTGA AGCTCGGCGA CCAGATTCTG GCGATCGGCA ACCCCTTCGG CATCGGCACC
ACGGTGACGG CAGGCATCGT CTCGGCGCGC GGCCGCGACC TGCACAGCGG GCCTTATGAC
GATTTCATCC AGATCGACGC GCCGATCAAC CATGGCAATT CCGGAGGACC GCTGGTCGAC
CGCAGCGGCA ATGTCGTCGG CATCAACACC GCCATCTATT CGCCGAACGG CGGCAGCGTC
GGCGTCGGTT TCGCCATTCC CTCCGACGAG GCCAAGGCGA TCGTCGCCAA GCTGCAGAAG
GACGGCTCGA TCGATCACGG CTATCTCGGC GTGCAGATCC AGCCGGTCAC CAAGGATGTC
GCCGATGCCG TCGGCCTCGA TAAGACCGGC GGCGCGCTGG TTGCCGCCGT CACCGCCGAT
ACGCCGGCCG CCCATGCCGG CCTGAAGCCC GGCGATATCG TCACGTCAGT CGGCGGCGAG
AGCGTCAAGA CGCCGAAAGA CCTGTCGCGC CTGGTCGCCG ACCTTTCGCC GGGCGCGAAA
AAATCCCTCA GCGTCTGGCG CGACGGCAAG ACGATCGATC TCAACGTCAC CGTCGGCACC
AATGAGGAAG GCCAGAAACA GGCGGCGGCC GAAAGCCCCG ACGCTCAAGA TCAGAGCTCC
GGCCAGCCGA GCCTCGGCAT CGGCCTCGCC GATCTGACGC CCGATGTGCG CCAGCAGCTC
AACCTGCCGC GCTCGATCAA CGGTGCGGTG GTCGCCAAGG TCGCCCCGGA CAAGTCAGCG
GCTGCCGCCG GCATCCAGTC CGGCGATGTC ATCGTCTCGG TGAATGACAG ACCTGTTCAT
AACGCCCGCG ACGTCAAGAC CGCAATTGCC GATGCCGGCA AGGCCGGCCG CAAGTCGGTG
CTGCTGCTCG TCGAACGCGA TGGCAACAAG ACCTTTGTCG CCGTGCCGTT TGGGGCGGCG
TGA
 
Protein sequence
MSHILRNHRT AALVGAAIIA GAACLPLALN ASNAVAAPSD NGGILAPNGS FASIVEADKP 
AVVTITTTMK ATDVSADQES PMDEQFRQFF EDQGIPLPRQ APQKRPSQQA MALGSGFIIS
RDGVIVTNNH VIDNAVDIKV TLDDGTELPA KLIGTDPKSD VAVLKIEAGK PLQTIAWGDS
DRLKLGDQIL AIGNPFGIGT TVTAGIVSAR GRDLHSGPYD DFIQIDAPIN HGNSGGPLVD
RSGNVVGINT AIYSPNGGSV GVGFAIPSDE AKAIVAKLQK DGSIDHGYLG VQIQPVTKDV
ADAVGLDKTG GALVAAVTAD TPAAHAGLKP GDIVTSVGGE SVKTPKDLSR LVADLSPGAK
KSLSVWRDGK TIDLNVTVGT NEEGQKQAAA ESPDAQDQSS GQPSLGIGLA DLTPDVRQQL
NLPRSINGAV VAKVAPDKSA AAAGIQSGDV IVSVNDRPVH NARDVKTAIA DAGKAGRKSV
LLLVERDGNK TFVAVPFGAA