Gene Rleg2_3812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3812 
Symbol 
ID6982575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3945413 
End bp3946954 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content64% 
IMG OID643398534 
Productpeptidase M48 Ste24p 
Protein accessionYP_002283300 
Protein GI209551383 
COG category[R] General function prediction only 
COG ID[COG4784] Putative Zn-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.685027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAGAA ACAGACTGGA GAGCTTGACG ACGTGGAAAT CGCCCGCGCT TTCCAGTGAT 
GCCCTCTTCG CGCCCCGGCG CTTCGCGCGT CGCCTGATGC TGCTTTCGGC CGTCGCACTG
ACGCTTAATG GCTGTCAAAC GCTGATCGAG CAATCCTATC AGCCGAGCGT CTCGCCTTCT
TCCAATCCGC AGATCGTCGA CGAGGTGCAG AAGAACGACC CGCGCGCGGC GATGGGCGCC
CGCGAACATC CGCGCATCGT CGCAAGCTAC GGCGGCGAAT ACAAGGACGC CAAGACCGAG
CGCCTCGTCG CCCGCATCGC CGGCGCGCTG ACGGCGGTGT CTGAGAATCC GAGCCAGTCC
TACCGCATCA CCATCCTGAA TTCACCGGCG ATCAACGCCT TCGCGCTGCC GGGTGGTTAT
CTCTACGTCA CCCGCGGCCT GCTCGCCCTT GCCAACGACG CCTCCGAAGT CGCCGCCGTG
CTGTCGCACG AGATGGGCCA TGTGACGGCA AACCACGGCA TCGAGCGGCA GAAGCGCGAA
GAGGCTGAGG TCATCGCCAG CCGCGTCGTC GCCGAGGTCC TTTCCAGCGA TATCGCCGGC
AAGCAGGCAC TGGCCCGCGG CAAACTGCGC CTTGCCGCCT TCTCCCGCCA GCAGGAACTA
CAGGCCGATG TCATCGGCGT ACGGATGCTC GGCGAAGCCG GCTACGACCC CTATTCCGCT
GCCCGCTTTC TCGATTCCAT GGCGGCGTAC AGCCGCTTCA TGTCGGTCGA TCCCGAAGCC
GACCAGAGCC TCGACTTCCT GTCGAGCCAT CCGAATTCCG CCCAGCGCAT CGAGCTTGCC
CGCACCCATG CGCGGGCCTT CGGCCAGGAA GGGTCAGTCG GCGACAAGGG CCGCGACTAT
TATCTCGACG GCATAGACGG TCTGCTCTAC GGCGATAGCC CTGAGGAAGG CTATGTGCGC
GGCCAGACCT TCCTGCATGG AGGCCTCGGC ATCCGCTTCG ACGTGCCGCC GGACTTCCAC
ATCGACAACA AGGTCGAAGC CGTGATGGCC ACGGGCCCGA ACGACATCGC CGTCCGCTTC
GACGGCGTCG CCGACAATCA GAACCAGAGC CTCACCAACT ATATTTCCAG CGGCTGGGTG
ACCGGCCTCG ACCCGTCGAC CATCCAGCCG GTTACCATCA ACGGCATGGA AGCAGCCACA
GCACGCGCAA GTGCGGATCG CTGGGATTTC GATGTCACCG TGATCCGCAA CAATTCGCAG
ATCTTCCGTT TCCTGACCGC CGTGCCGAAA GGCAGCGACG CCCTCGAGCC GACCGCCAAT
GTCCTGCGCG CAAGTTTCCG GCGCATGACG CCGGCCGAGG CAGCCTCCCT GAAGCCGCTG
CGCATCCGTG TCGTCACCGT CCGGCCGGGT GAAAACATCT CGACGCTCGC CGCCCGCATG
ATGGGCACCG ACCGCAAGCT CGATCTCTTC AAACTCATCA ATGCCTTGCC CACGGGTGCA
GCCGTTTCAC CCGGCGATCG CGTCAAGATC ATCGCTGAAT AA
 
Protein sequence
MRRNRLESLT TWKSPALSSD ALFAPRRFAR RLMLLSAVAL TLNGCQTLIE QSYQPSVSPS 
SNPQIVDEVQ KNDPRAAMGA REHPRIVASY GGEYKDAKTE RLVARIAGAL TAVSENPSQS
YRITILNSPA INAFALPGGY LYVTRGLLAL ANDASEVAAV LSHEMGHVTA NHGIERQKRE
EAEVIASRVV AEVLSSDIAG KQALARGKLR LAAFSRQQEL QADVIGVRML GEAGYDPYSA
ARFLDSMAAY SRFMSVDPEA DQSLDFLSSH PNSAQRIELA RTHARAFGQE GSVGDKGRDY
YLDGIDGLLY GDSPEEGYVR GQTFLHGGLG IRFDVPPDFH IDNKVEAVMA TGPNDIAVRF
DGVADNQNQS LTNYISSGWV TGLDPSTIQP VTINGMEAAT ARASADRWDF DVTVIRNNSQ
IFRFLTAVPK GSDALEPTAN VLRASFRRMT PAEAASLKPL RIRVVTVRPG ENISTLAARM
MGTDRKLDLF KLINALPTGA AVSPGDRVKI IAE