Gene Rleg_4141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4141 
Symbol 
ID8014935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4226031 
End bp4227479 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content64% 
IMG OID644826711 
Productpeptidase M48 Ste24p 
Protein accessionYP_002977921 
Protein GI241206825 
COG category[R] General function prediction only 
COG ID[COG4784] Putative Zn-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0703431 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.108462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTGC TTTCGGCCGT CGCAATGGCG CTGAATGGCT GCCAAACGCT GATCGACCAA 
TCCTATCAGC CGAGTGTTTC GCCGTCTTCC AATCCACAGA TCGTCGACGA GGTGCAGAAA
AACGACCCGC GCGCGGCGAT GGGCGCCCGC GAGCATCCGC GCATCGTGGC AAGCTACGGC
GGCGAATACA AGGACGCCAA AACCGAGCGC CTCGTCGCCC GCATCGCCGG CGCGCTGACG
GCGGTGTCGG AAAATCCGAG CCAGTCCTAC CGCATCACCA TCCTGAATTC GCCGGCGATC
AACGCCTTTG CGCTGCCGGG CGGTTATCTC TACGTCACCC GCGGCCTGCT CGCCCTTGCC
AACGACGCTT CGGAAGTTGC CGCCGTGCTG TCGCACGAAA TGGGCCATGT GACGGCGAAC
CACGGCATCG AGCGGCAGAA GCGCGAAGAG GCTGAGGTTA TCGCCAGCCG CGTCGTCGCC
GAAGTCCTTT CCAGCGACAT CGCCGGCAAG CAGGCGCTTG CCCGCGGCAA GCTGCGGCTC
GCCGCCTTCT CCCGCCAGCA GGAGCTGCAG GCCGATGTCA TCGGTGTGCG CATGCTCGGT
GAAGCTGGCT ATGATCCCTA TGCCGCTGCC CGTTTCCTCG ATTCGATGGC GGCTTACAGT
CGCTTCATGT CGGTTGATCC CGAAGCCGAC CAGAGCCTCG ACTTCCTGTC GAGCCATCCG
AATTCGGCTC AGCGCATAGA GCTCGCCCGC ACCCACGCCC GCGCCTTCGG CCAGGAAGGC
TCGGTCGGCG ACAAGGGCCG CGATTATTAT CTCGACGGCA TAGACGGACT GCTCTACGGC
GACAGCCCGG AAGAAGGCTA TGTGCGCGGC CAGACCTTCC TGCATGGCGG CCTCGGCATC
CGCTTCGACG TGCCGCCGGA TTTCCACATC GACAACAAGG TCGAGGCGGT GATGGCCACC
GGTCCGAACG ACATCGCCGT CCGCTTCGAC GGCGTCGCCG ACAATCAGAA CCAGAGCCTC
ACCAACTATA TCTCCAGCGG CTGGGTAACC GGCCTCGACC CGTCGACCAT CCAACCGATC
ACCATCAACG GCATGGAAGC AGCCACCGCG CGCGCCAGCG CCGACCGCTG GGATTTCGAT
GTCACCGTGA TCCGCAACAA TTCGCAGATC TTCCGTTTCC TGACCGCCGT GCCGAAAGGC
AGCGGCGCCC TTGAGCCAAC GGCGAATGTG CTGCGCGCGA GTTTCCGCCG CATGACGCCG
GCAGAGGCCG CCTCGCTGAA ACCGCTGCGC ATCCGCGTCG TCACCGTCCG GCCGGGTGAG
AACATCTCGA CGCTCGCCGC CCGCATGATG GGCACAGACC GCAAGCTCGA TCTCTTCAAG
CTCATCAATG CCCTGCCCAC GGGTGCAGCC GTTTCTATAG GCGATCGCGT CAAGATCATC
GCCGAATAA
 
Protein sequence
MMLLSAVAMA LNGCQTLIDQ SYQPSVSPSS NPQIVDEVQK NDPRAAMGAR EHPRIVASYG 
GEYKDAKTER LVARIAGALT AVSENPSQSY RITILNSPAI NAFALPGGYL YVTRGLLALA
NDASEVAAVL SHEMGHVTAN HGIERQKREE AEVIASRVVA EVLSSDIAGK QALARGKLRL
AAFSRQQELQ ADVIGVRMLG EAGYDPYAAA RFLDSMAAYS RFMSVDPEAD QSLDFLSSHP
NSAQRIELAR THARAFGQEG SVGDKGRDYY LDGIDGLLYG DSPEEGYVRG QTFLHGGLGI
RFDVPPDFHI DNKVEAVMAT GPNDIAVRFD GVADNQNQSL TNYISSGWVT GLDPSTIQPI
TINGMEAATA RASADRWDFD VTVIRNNSQI FRFLTAVPKG SGALEPTANV LRASFRRMTP
AEAASLKPLR IRVVTVRPGE NISTLAARMM GTDRKLDLFK LINALPTGAA VSIGDRVKII
AE