Gene Rleg2_2539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2539 
Symbol 
ID6981281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2570194 
End bp2571930 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content64% 
IMG OID643397253 
Productprotease Do 
Protein accessionYP_002282038 
Protein GI209550121 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.874075 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCCA CGAATCGCTC GCCCTTCAGA CGAACGCTCG CGCTTATGGC CAGCGCTGCA 
ATTCTTGCGC ATGCTGGCAT GAACGGGGTC GCCTATGCGC AAACCGCGCC GCAGACGACG
GCGCCCGGCG TTGCTACACC CGCTCCGGCT ACTCCGGAAA CGGCTGCTCC TGCGCCGACG
CCGCCTGCAA CGGCTGCACC GCAGCCGACC CCGCAAATGC AGGCCGCAAC CCCGAACAAC
GGTCCCGCTT CGGTCGCCGA TCTCGCCGAA GGGCTGCTCG ACGCCGTGGT CAACATCTCG
ACCTCGCAGA ATGTGAAGGA TGACGAGGGC GTCGGTCCGG CGCCGCGCGC GCCCGACGGC
TCGCCGTTCC AGGAGTTCTT CAACGACTTC TTCGACAAGA AGCAGGGCAA CAAGGGCCCG
AACCACAATG TCAGCTCGCT CGGCTCCGGC TTCGTCATCG ACCCGGCCGG CTATATCGTC
ACCAACAACC ATGTGATCGA GGGCGCCGAC GACATCGAGA TCAATTTCGC CAATGGTTCG
AAGCTCAAGG CGAAACTGAT CGGCACGGAT ACGAAGACCG ATCTTTCGGT GCTGAAGGTC
GAGCCGAAGG CACCGCTGAA ATCGGTGAAA TTCGGCGATT CCAGCACCAT GCGCATCGGC
GACTGGGTGA TGGCGATCGG CAATCCGTTC GGCTTCGGCG GTTCGGTGAC GGTCGGCATC
ATTTCCGGGC GCGGCCGCAA CATCAATGCT GGCCCCTATG ACAACTTCAT TCAGACGGAT
GCCGCGATCA ACAAGGGCAA TTCCGGCGGA CCGCTCTTCA ACATGAAGGG TGAGGTGATC
GGCATCAATA CGGCGATCAT TTCGCCGAGC GGCGGCTCGA TCGGCATCGG CTTCTCGGTG
CCTTCAGAGC TTGCCTCCGG CGTCGTCGAT CAATTGCGCG AATATGGTGA GACGCGGCGC
GGCTGGCTCG GTGTGCGCAT CCAGCCGGTC ACCGACGATA TCGCTGACAG TCTCGGGCTC
GACACTGCCA AGGGTGCTCT GGTCGCCGGC GTCATCAAGG GCGGCCCGGT CGACGACGGT
TCGATCAAGG CGGGTGACGT CATTTTGAAA TTCGACGGCA AGACCGTCAG CGAAATGCGC
GATCTGCCGC GCGTCGTGGC GGAAAGCTCG GTTGGCAAGG AAGTCGACGT GGTGGTGCTG
CGCGACGGCA AGGAGCAGAC CGTTAAGGTG AAACTCGGCC GGCTCGAAGA CAGCGACCAG
GCGGCAGCAT CCGGCGATGC GGCGCCCGAC GGTTCGCAGG ATGACGGCGT GATCACCCCG
GACCCCGGCG AGAACAACGA CATGGACGAG CCGGACTCCG GCGATCAGGC CCAGCCGGCA
CCAGGCGCGC CCACGCCGGA CCAACACCAG GGCCAGGTGT CACCGGATGC ATCAACACCG
AAGAACGTGC TCGGCCTGTC GCTGTCGCTT TTGAGCGCCG AGACGCGCAA GGCTTTCGGC
ATTGCCGAGA GCGTCGACGG TGTCGTCGTG ACGGAGGTGA CACCCGGCTC CGCCTCGGCC
GAAAAAGGGC TGAAGCCCGG CGACGTGATC GTGGAAGTGG CGCAGGAGTT TATGAAGTCG
CCGGACGCGG TCGCTGCCAA GGTGAAGTCG CTGAAGCAGG AAGGCCGCCG CAACGCCCAA
CTGATGATCG CATCGGCAAA TGGTGATCTG CGGTTTGTGG CGGTGCCAAT GGAGTAA
 
Protein sequence
MAPTNRSPFR RTLALMASAA ILAHAGMNGV AYAQTAPQTT APGVATPAPA TPETAAPAPT 
PPATAAPQPT PQMQAATPNN GPASVADLAE GLLDAVVNIS TSQNVKDDEG VGPAPRAPDG
SPFQEFFNDF FDKKQGNKGP NHNVSSLGSG FVIDPAGYIV TNNHVIEGAD DIEINFANGS
KLKAKLIGTD TKTDLSVLKV EPKAPLKSVK FGDSSTMRIG DWVMAIGNPF GFGGSVTVGI
ISGRGRNINA GPYDNFIQTD AAINKGNSGG PLFNMKGEVI GINTAIISPS GGSIGIGFSV
PSELASGVVD QLREYGETRR GWLGVRIQPV TDDIADSLGL DTAKGALVAG VIKGGPVDDG
SIKAGDVILK FDGKTVSEMR DLPRVVAESS VGKEVDVVVL RDGKEQTVKV KLGRLEDSDQ
AAASGDAAPD GSQDDGVITP DPGENNDMDE PDSGDQAQPA PGAPTPDQHQ GQVSPDASTP
KNVLGLSLSL LSAETRKAFG IAESVDGVVV TEVTPGSASA EKGLKPGDVI VEVAQEFMKS
PDAVAAKVKS LKQEGRRNAQ LMIASANGDL RFVAVPME