Gene Rleg_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1950 
Symbol 
ID8012989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1938519 
End bp1941365 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content61% 
IMG OID644824539 
Productpeptidase M16 domain protein 
Protein accessionYP_002975771 
Protein GI241204675 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCACA AAAAAATGGA ATGCTCCACG GCGTGGTGGT TTTTTGCGAC GGTCTCCCTT 
GTCGCAAACT TCGCGTTACC AGCCTATGCG GACACCTCGT CCGTGCCCTG GCCACAAACG
CAAAGCGACA TGCAAGCCGA GTCCGACGTG CATTTTGGCA CGCTCGCCAA CGGTATGCGG
TTTGCGATCA TGCGCAATGT CACGCCGCCC GGACAGGCAG CGATCCGCTT TCGCATTGGC
TCCGGTTCGC TCGACGAAAA CGACGACCAG CAGGGCCTGG CGCATGTTCT TGAGCACATG
GCCTTCAAGG GTTCGACACA TGTCGCCGAA GGGGAGATGA TCCGTATCTT GCAGCGCAAG
GGCTTGGCCT TTGGACCGGA CACCAATGCC CATACCTCCT ATGACGAGAC CGTCTATGCG
CTCGATCTGC CCGAGGTCGA TGCAGACACA ATTTCGACGG GCCTGATGCT GATGCTAGAA
ACGGCGAGCG AGCTGACCCT CGATGCCGGC GCCTTCGATC GCGAACGCGG TGTCATCCTG
TCGGAGGAGC GGCTGCGCGA CACGCCGCAG TATCGCGCGT CACTCGGAAT CATGAATTCG
CTGCTCGCCG GCCAGCGCGC GACCATGCGC GCGCCGATAG GTAAAGCCGA CATCATCAGC
AATGCGCCCG TGGACCTCGT CCGTGATTAT TACGGGGCCA ATTACCGACC CGATCGGGCA
ACGCTGATAG TGGTGGGCGA TATCGACCCC GCCGCCATGG AAGTCGAAAT CCGGCAGCGC
TTCGGCGACT GGAAGGCCGT GGGTCCGGCG CCGACAAAAG CGGATCTTGG CGCGCTGGAG
ACGAAAGGCG AAAGCGCCGA GGTCATCGTC GTTCCCGGCG GCATGACCAG CATACAGATC
GCCTGGACGC GTCCCTATGA CGCCGCGCCT GACACCTTCG CCAAGCGCCG CGCTGGGCTT
ATTGAGGATC TCGGTTTCCT GGTGCTCAAA CGTCGGGTGA GCGCCATCGC CAGCAAGGCG
GATGCCCCTT TCATCAGTGC GGACGTCGGC TCCCAGGATC TCCTCGATTC CGCCCATGTC
GTCCTGATCG CGGCGAACTC CGAGCCGGAC AAATGGCAGG CGGCGCTCAC GGCCATTGAC
CAGGAACAGC GCCGGATCCA GGAGTTCGGC GTTGCGCAGG CGGAGATCGA TCGCGAAATT
CGCGAATATC GCTCGGCCCT GCAAGCTGCT GCGGCCGGAG CCGCGACGCG GATGACGACC
GACATTGCTT CCATGCTGGC TCGCAGCGTC GATGACGATC AAGTCTTCAC CTCGCCCGCC
GAAGACCTCT CTATGTTCGA GACGATGACG AACGGCGTCA CGGCGGACGA GGTCAATGGG
GCCTTGCAGC GTGCTTTCTC CGGCAACGGT CCGCAGGTCG TGCTACAGGC GGACCAATCA
CCTGAGGGTG GAGCCGACAC GGTTCGGCAA GTCTATGACG CTTCAAATGC CATTGCCGTC
TCGGCACCAT CAGGTGCAAC TGATGTCGCC TGGCCTTACA CCCATTTCGG CGAACCGGGC
GCTGTGGTCG AACGCCGTGC GGTTGAAGAT CTCGGCTTGA CCATGGTGCG CTTTTCCAAC
GGCATTCTGC TTACCGTCAA GCCAACCAGG CTGCGTGCCA ACGAAGTGCT GGTACGCGAA
GATATCGGCC GCGGTCGGCT GGACCTGCCG CACGACCGTT CCGCTGCGAT CTGGGCATCT
CCGGCCGTCG TGCTGTCTGG CGTAAAGGCC ATGGATTACC AGGATATACA GAAAGCGCTG
ACCGCCAACA TTGTCGGCGT CGACTTCTCG GTCGGCGACA GTTCCTTCAG GTTCGACGGT
CGTACACGGA CTGAAGATCT TGCGACGCAG TTGCAGCTGA TGAGCGCATA CACCTCCGAT
CCAGCCTATC GCCCCGAGGC GTTCAAGCGC GTGCAGCAGG CCTATTTGAG CGGCCTCGAT
CAGTACAACG CGTCGCCCGG CGGCGTTTTC AGCCGCGATT TCGCAGGTCT CGTGCATTCC
GGCGACCCGC GCTGGACCTT CCCCGACCGC GCGCAGTTGT CCGCCGCCAA GCCAGACGAA
TTCGAGGCGC TGTTCCGGCC CATGGTTTCC AATGGCCCCA TCGACATCAC CATCGTCGGC
GACGTAACAG TGGACGACGC AATCCGCCTG ACGGCTGAAA CCTTTGGCGC TTTGCCGCCG
CGCCCGGAGA CGGCGTCAAG CAACGATCGG GACGACGTGC ATTTTCCGGC GACGACCGAG
AAGCCCGTTT TGCAGACCCA TAGCGGTAGG GCAGATAATG CCGCCGCCGC CGTAGGGGCT
TCCATCGGAG ATTTGCTCTC CGATCTGCCG CGGTCCTTCA CCGCCAATAT TGCCACCCAG
ATTTTCCAAA ACAGGTTGAT CGACCAGTTT CGCATTGCAG AAGGAGCAAG TTATGCCCTG
CAGGGCGATG TTGAGCTTTC AAGGGAAGTT CCCGGCTACG GCTACGCATA TTTCTACGTC
GAGACCGACC CGGCAAAGGT TGCGCGCTTC TATGAACTTG TCGACGAGAC CGCCAATGAT
CTGCGGTCGC ATGATGTCTC CGAAGACGAG CTCGCGCGCG CCCGGGGACC CATCATCGAG
ACATTGAAGC ATCAGCAGCA GAGCAACGAG TATTGGATCG AATACCTGCA CCACGCCCAA
GAGGATTCGC GTCGTTTAGA CCGGATACGC GATAGTCTCA GCGGCTACGG CAAGGTCACC
GCCGGGGATA TCCGCGCGTT TGCCGCGGCC TATTTGAGCC CGGAAAAATT CTGGAAATTC
GAAGTGCTGC CGGTGGTGGT ACGATAG
 
Protein sequence
MSHKKMECST AWWFFATVSL VANFALPAYA DTSSVPWPQT QSDMQAESDV HFGTLANGMR 
FAIMRNVTPP GQAAIRFRIG SGSLDENDDQ QGLAHVLEHM AFKGSTHVAE GEMIRILQRK
GLAFGPDTNA HTSYDETVYA LDLPEVDADT ISTGLMLMLE TASELTLDAG AFDRERGVIL
SEERLRDTPQ YRASLGIMNS LLAGQRATMR APIGKADIIS NAPVDLVRDY YGANYRPDRA
TLIVVGDIDP AAMEVEIRQR FGDWKAVGPA PTKADLGALE TKGESAEVIV VPGGMTSIQI
AWTRPYDAAP DTFAKRRAGL IEDLGFLVLK RRVSAIASKA DAPFISADVG SQDLLDSAHV
VLIAANSEPD KWQAALTAID QEQRRIQEFG VAQAEIDREI REYRSALQAA AAGAATRMTT
DIASMLARSV DDDQVFTSPA EDLSMFETMT NGVTADEVNG ALQRAFSGNG PQVVLQADQS
PEGGADTVRQ VYDASNAIAV SAPSGATDVA WPYTHFGEPG AVVERRAVED LGLTMVRFSN
GILLTVKPTR LRANEVLVRE DIGRGRLDLP HDRSAAIWAS PAVVLSGVKA MDYQDIQKAL
TANIVGVDFS VGDSSFRFDG RTRTEDLATQ LQLMSAYTSD PAYRPEAFKR VQQAYLSGLD
QYNASPGGVF SRDFAGLVHS GDPRWTFPDR AQLSAAKPDE FEALFRPMVS NGPIDITIVG
DVTVDDAIRL TAETFGALPP RPETASSNDR DDVHFPATTE KPVLQTHSGR ADNAAAAVGA
SIGDLLSDLP RSFTANIATQ IFQNRLIDQF RIAEGASYAL QGDVELSREV PGYGYAYFYV
ETDPAKVARF YELVDETAND LRSHDVSEDE LARARGPIIE TLKHQQQSNE YWIEYLHHAQ
EDSRRLDRIR DSLSGYGKVT AGDIRAFAAA YLSPEKFWKF EVLPVVVR