Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1950 |
Symbol | |
ID | 8012989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1938519 |
End bp | 1941365 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644824539 |
Product | peptidase M16 domain protein |
Protein accession | YP_002975771 |
Protein GI | 241204675 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCACA AAAAAATGGA ATGCTCCACG GCGTGGTGGT TTTTTGCGAC GGTCTCCCTT GTCGCAAACT TCGCGTTACC AGCCTATGCG GACACCTCGT CCGTGCCCTG GCCACAAACG CAAAGCGACA TGCAAGCCGA GTCCGACGTG CATTTTGGCA CGCTCGCCAA CGGTATGCGG TTTGCGATCA TGCGCAATGT CACGCCGCCC GGACAGGCAG CGATCCGCTT TCGCATTGGC TCCGGTTCGC TCGACGAAAA CGACGACCAG CAGGGCCTGG CGCATGTTCT TGAGCACATG GCCTTCAAGG GTTCGACACA TGTCGCCGAA GGGGAGATGA TCCGTATCTT GCAGCGCAAG GGCTTGGCCT TTGGACCGGA CACCAATGCC CATACCTCCT ATGACGAGAC CGTCTATGCG CTCGATCTGC CCGAGGTCGA TGCAGACACA ATTTCGACGG GCCTGATGCT GATGCTAGAA ACGGCGAGCG AGCTGACCCT CGATGCCGGC GCCTTCGATC GCGAACGCGG TGTCATCCTG TCGGAGGAGC GGCTGCGCGA CACGCCGCAG TATCGCGCGT CACTCGGAAT CATGAATTCG CTGCTCGCCG GCCAGCGCGC GACCATGCGC GCGCCGATAG GTAAAGCCGA CATCATCAGC AATGCGCCCG TGGACCTCGT CCGTGATTAT TACGGGGCCA ATTACCGACC CGATCGGGCA ACGCTGATAG TGGTGGGCGA TATCGACCCC GCCGCCATGG AAGTCGAAAT CCGGCAGCGC TTCGGCGACT GGAAGGCCGT GGGTCCGGCG CCGACAAAAG CGGATCTTGG CGCGCTGGAG ACGAAAGGCG AAAGCGCCGA GGTCATCGTC GTTCCCGGCG GCATGACCAG CATACAGATC GCCTGGACGC GTCCCTATGA CGCCGCGCCT GACACCTTCG CCAAGCGCCG CGCTGGGCTT ATTGAGGATC TCGGTTTCCT GGTGCTCAAA CGTCGGGTGA GCGCCATCGC CAGCAAGGCG GATGCCCCTT TCATCAGTGC GGACGTCGGC TCCCAGGATC TCCTCGATTC CGCCCATGTC GTCCTGATCG CGGCGAACTC CGAGCCGGAC AAATGGCAGG CGGCGCTCAC GGCCATTGAC CAGGAACAGC GCCGGATCCA GGAGTTCGGC GTTGCGCAGG CGGAGATCGA TCGCGAAATT CGCGAATATC GCTCGGCCCT GCAAGCTGCT GCGGCCGGAG CCGCGACGCG GATGACGACC GACATTGCTT CCATGCTGGC TCGCAGCGTC GATGACGATC AAGTCTTCAC CTCGCCCGCC GAAGACCTCT CTATGTTCGA GACGATGACG AACGGCGTCA CGGCGGACGA GGTCAATGGG GCCTTGCAGC GTGCTTTCTC CGGCAACGGT CCGCAGGTCG TGCTACAGGC GGACCAATCA CCTGAGGGTG GAGCCGACAC GGTTCGGCAA GTCTATGACG CTTCAAATGC CATTGCCGTC TCGGCACCAT CAGGTGCAAC TGATGTCGCC TGGCCTTACA CCCATTTCGG CGAACCGGGC GCTGTGGTCG AACGCCGTGC GGTTGAAGAT CTCGGCTTGA CCATGGTGCG CTTTTCCAAC GGCATTCTGC TTACCGTCAA GCCAACCAGG CTGCGTGCCA ACGAAGTGCT GGTACGCGAA GATATCGGCC GCGGTCGGCT GGACCTGCCG CACGACCGTT CCGCTGCGAT CTGGGCATCT CCGGCCGTCG TGCTGTCTGG CGTAAAGGCC ATGGATTACC AGGATATACA GAAAGCGCTG ACCGCCAACA TTGTCGGCGT CGACTTCTCG GTCGGCGACA GTTCCTTCAG GTTCGACGGT CGTACACGGA CTGAAGATCT TGCGACGCAG TTGCAGCTGA TGAGCGCATA CACCTCCGAT CCAGCCTATC GCCCCGAGGC GTTCAAGCGC GTGCAGCAGG CCTATTTGAG CGGCCTCGAT CAGTACAACG CGTCGCCCGG CGGCGTTTTC AGCCGCGATT TCGCAGGTCT CGTGCATTCC GGCGACCCGC GCTGGACCTT CCCCGACCGC GCGCAGTTGT CCGCCGCCAA GCCAGACGAA TTCGAGGCGC TGTTCCGGCC CATGGTTTCC AATGGCCCCA TCGACATCAC CATCGTCGGC GACGTAACAG TGGACGACGC AATCCGCCTG ACGGCTGAAA CCTTTGGCGC TTTGCCGCCG CGCCCGGAGA CGGCGTCAAG CAACGATCGG GACGACGTGC ATTTTCCGGC GACGACCGAG AAGCCCGTTT TGCAGACCCA TAGCGGTAGG GCAGATAATG CCGCCGCCGC CGTAGGGGCT TCCATCGGAG ATTTGCTCTC CGATCTGCCG CGGTCCTTCA CCGCCAATAT TGCCACCCAG ATTTTCCAAA ACAGGTTGAT CGACCAGTTT CGCATTGCAG AAGGAGCAAG TTATGCCCTG CAGGGCGATG TTGAGCTTTC AAGGGAAGTT CCCGGCTACG GCTACGCATA TTTCTACGTC GAGACCGACC CGGCAAAGGT TGCGCGCTTC TATGAACTTG TCGACGAGAC CGCCAATGAT CTGCGGTCGC ATGATGTCTC CGAAGACGAG CTCGCGCGCG CCCGGGGACC CATCATCGAG ACATTGAAGC ATCAGCAGCA GAGCAACGAG TATTGGATCG AATACCTGCA CCACGCCCAA GAGGATTCGC GTCGTTTAGA CCGGATACGC GATAGTCTCA GCGGCTACGG CAAGGTCACC GCCGGGGATA TCCGCGCGTT TGCCGCGGCC TATTTGAGCC CGGAAAAATT CTGGAAATTC GAAGTGCTGC CGGTGGTGGT ACGATAG
|
Protein sequence | MSHKKMECST AWWFFATVSL VANFALPAYA DTSSVPWPQT QSDMQAESDV HFGTLANGMR FAIMRNVTPP GQAAIRFRIG SGSLDENDDQ QGLAHVLEHM AFKGSTHVAE GEMIRILQRK GLAFGPDTNA HTSYDETVYA LDLPEVDADT ISTGLMLMLE TASELTLDAG AFDRERGVIL SEERLRDTPQ YRASLGIMNS LLAGQRATMR APIGKADIIS NAPVDLVRDY YGANYRPDRA TLIVVGDIDP AAMEVEIRQR FGDWKAVGPA PTKADLGALE TKGESAEVIV VPGGMTSIQI AWTRPYDAAP DTFAKRRAGL IEDLGFLVLK RRVSAIASKA DAPFISADVG SQDLLDSAHV VLIAANSEPD KWQAALTAID QEQRRIQEFG VAQAEIDREI REYRSALQAA AAGAATRMTT DIASMLARSV DDDQVFTSPA EDLSMFETMT NGVTADEVNG ALQRAFSGNG PQVVLQADQS PEGGADTVRQ VYDASNAIAV SAPSGATDVA WPYTHFGEPG AVVERRAVED LGLTMVRFSN GILLTVKPTR LRANEVLVRE DIGRGRLDLP HDRSAAIWAS PAVVLSGVKA MDYQDIQKAL TANIVGVDFS VGDSSFRFDG RTRTEDLATQ LQLMSAYTSD PAYRPEAFKR VQQAYLSGLD QYNASPGGVF SRDFAGLVHS GDPRWTFPDR AQLSAAKPDE FEALFRPMVS NGPIDITIVG DVTVDDAIRL TAETFGALPP RPETASSNDR DDVHFPATTE KPVLQTHSGR ADNAAAAVGA SIGDLLSDLP RSFTANIATQ IFQNRLIDQF RIAEGASYAL QGDVELSREV PGYGYAYFYV ETDPAKVARF YELVDETAND LRSHDVSEDE LARARGPIIE TLKHQQQSNE YWIEYLHHAQ EDSRRLDRIR DSLSGYGKVT AGDIRAFAAA YLSPEKFWKF EVLPVVVR
|
| |