Gene Rleg_1073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1073 
Symbol 
ID8012200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1046768 
End bp1048351 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content63% 
IMG OID644823656 
Productprotease Do 
Protein accessionYP_002974907 
Protein GI241203811 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.952362 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAGA ATTTCAACGG ACGTCCGTCC CTCGCCACTG TGCTCAAGGC CTCTACCGTT 
GCCGGTATCG CAGCCGCTGT GCTCGCAACC GGCGTTCCGC TCGAAATCAC CCGGTCTTAT
GCCGAAGCTG TCAAGGTTCA GGCGCCCGCC GTTCCGAGCT TCGCCAATGT CGTCGACGCC
GTTTCGCCGG CCGTCGTTTC CGTCCGGGTC GAAAATCGCG TCAATCCCGT CTCCGACAAC
AACAATGACG GCTTCTCCTT CGATTTCAAC GGCCGCGGCT TCGACGACCT ACCCGACGAT
CATCCGCTGA AGCGGTTCTT CAAGCAGTTC GGCCAGGATC CGAATGATCA GCAGGGCCAT
TCCAGGCGCT TCGGCCAGAA CGGCCCGAAT GGTCCGGGCG GCAAGGGTCG CCTCCGCCCC
GTCGCCCAGG GCTCCGGCTT CTTCATCTCT GAGGACGGCT ACATCGTCAC CAACAATCAC
GTCGTTTCCG ATGGTCAGGC CTTCGTCGCC GTCATGAAAG ACGGCACCGA ACTCGATGCC
AAGCTGATCG GCAAGGATCC GCGCACCGAT CTCGCTGTGC TGAAGGTCGA CGGCAAGGGC
AAGAAGTTCA CCTACGTCAA CTGGGCCGAC GACAACAATG TCCGCGTCGG TGACTGGGTC
GTTGCCGTCG GCAATCCCTT CGGTCTCGGC GGCACGGTTA CAGCCGGCAT CGTCTCAGCT
CGCGGCCGTG ATATCGGTTC CGGTCCTTAT GACGATTATC TGCAGGTGGA TGCCGCCGTG
AACCGCGGCA ACTCCGGCGG TCCGACCTTC AACCTCAGCG GCGAAGTCGT CGGCATCAAC
ACCGCGATCT TCTCGCCGTC GGGCGGCAGC GTCGGCATCG CCTTCGCCAT TCCCGCCTCG
ACCGCCAAGG ACGTCGTCGC CGATCTGATG AAGGACGGCC AGGTTTCGCG CGGCTGGCTG
GGTGTCCAGA TCCAGCCGGT AACCAAGGAC ATCGCCGAAT CCATCGGCCT TTCCGAGCCG
AGCGGCGCCC TGGTCGTTGC CCCGCAGGCC GGGTCGCCGG GTGACAAGGC CGGCATGAAG
GCCGGCGACG TCGTCACCGC GCTGAATGGG GAGACGATCA AGGATGCCCG TGATCTCAGC
CGCCGTATCG GTGCGATGCA GCCGGGCAGC AAGGTTGAGC TTTCGGTCTG GCGCGCCGGC
AAGGCCCAGC CTCTCACCGT CGAACTCGGC ACGTTGCCGA TCGACCAGAA GGATGCGTCT
GCCGATGACA ACAGCCAGCC GCAGCAGCCT GAAGCACCGG CTTCCGAGAA GGCGCTTGCC
GATCTCGGCC TGACGGTCGG CCCGTCTGAC GACGGCAAGG GCCTGGCGAT AACAGACATC
GACCCGAACT CCGATGCCGC CGACAAGGGC ATTAAGGAAG GTGAGAAGAT CACCTCGGTC
AACAACCAGG AGGTCTCCAG CGCCGACGAC ATCGTCAAGG TGCTGAACCA AGCCAAGAAG
GACGGGCGCA CCCGCGCTCT CTTCCAGATC CAGTCCAGTG AGGGGAGCCG CTTCGTAGCG
CTTCCGATCA ACGGCCAGGG CTGA
 
Protein sequence
MLKNFNGRPS LATVLKASTV AGIAAAVLAT GVPLEITRSY AEAVKVQAPA VPSFANVVDA 
VSPAVVSVRV ENRVNPVSDN NNDGFSFDFN GRGFDDLPDD HPLKRFFKQF GQDPNDQQGH
SRRFGQNGPN GPGGKGRLRP VAQGSGFFIS EDGYIVTNNH VVSDGQAFVA VMKDGTELDA
KLIGKDPRTD LAVLKVDGKG KKFTYVNWAD DNNVRVGDWV VAVGNPFGLG GTVTAGIVSA
RGRDIGSGPY DDYLQVDAAV NRGNSGGPTF NLSGEVVGIN TAIFSPSGGS VGIAFAIPAS
TAKDVVADLM KDGQVSRGWL GVQIQPVTKD IAESIGLSEP SGALVVAPQA GSPGDKAGMK
AGDVVTALNG ETIKDARDLS RRIGAMQPGS KVELSVWRAG KAQPLTVELG TLPIDQKDAS
ADDNSQPQQP EAPASEKALA DLGLTVGPSD DGKGLAITDI DPNSDAADKG IKEGEKITSV
NNQEVSSADD IVKVLNQAKK DGRTRALFQI QSSEGSRFVA LPINGQG