Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3520 |
Symbol | |
ID | 6982280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3646245 |
End bp | 3648167 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643398244 |
Product | Peptidase M23 |
Protein accession | YP_002283013 |
Protein GI | 209551096 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.259567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGCT CGTTGGGCAA CGAGCCTCCG CTTCTGGCCG ACGGCAGGCG CGCGCCCGAC CGCCGCGAAG TGTCTCTCCG CTGGCTCTCG GGTACGTTCC TCACCGGCAT TACATCCTCG GTCCTGATGG GCGTCGCGCT TTTTGCCGCC CTTGACGGCC GCCAGCAACT GGCAATCCCC GCCGAAGCCT ATGCAAGTGC TGCGGCAGAC GCGCACGAAG ATACGACAGT GGTGCGCGGC GGAAGGCTGA TCGCGCCTGC CATCGCCGCA AAACCGTCCG ACCGGGCCAT CATGGAAGTT TCCACAGTCG TCCACGACGG TGAAAAGGAA GTCGTGCGCC GCCAGCCCTT CGCACATGTG AAGATGACGC TGGCCGCCAA CCATGTGGCG ACCGAGGACT ATCCAGACTT CGATCCCCTG GCGATCTTTT CCGCCGACGA GCCGCAGCCC GCCCCGCAAA GCCGCACCGG CGCGATTTAC GGCTCCGATG TCGAATCCGA AGTCAGCCTG AAGACCGTTT CCTTCCCGAC CGGCAAGACC AGCATGAAGA TGGCCTCCGG CCTGTCGCTC GAGGAGGTTG AAGAGAATGT GCGCTCCAAC GGCTCGGTGC TGACGGATGG CAACACCCAG CTTGCAGCCC TCTATTACGT CGATCCGCGC CGTTTCTCCA ACGAGGATGC CGATGTCGAT CTGACCGCCG GCCTCTCCGC GCGTGTGCTC GAACAGAACA TGACCGTTTC CGCATCGGAA TCGATCACGC CGCAGACCGA GGAATTCGCC GACGACATCC TGCCGGTCCG CGTCGACACG CCGATCGCTA AGGCGCTGAC GGATTCAGGC TATCCGCAGC AATATGCCGA TGGCATTGCC GGTTTCATCT CGCAGCAACT GGGTTCCACC GATCTCGACA AGGGCGACGT GCTGCGCATC GGCATCATTC AGAAGGGCGA ACAGGCAAAG ATCATCCGGG CCAGCGTCTA TCGCAGCACC CGCCATCTCG TCACCGTCGC CGTCGACGAC AAGGGCAGAT ACGTGCCCGG CAGCGAGCCG CCGATGCTGG ATGCCATCGC CACCGCCTTC GATGACAATT CCTTCGCGCC GCCGCCGGGC CAGAACCTGC CGCGCGTCTA TGACGGCATC TATCGCGCTG CCCTTTCCTA CGGCATGACC AAGGACATGA CGGCGCTGAT CATCAAGCTG CTCGCCAGCA ATGTCGATTT CCAGGCGCAG TTGAGGCCGA CCGACAGTCT CGAAGCCTTC TTCTCCGTCG CCGACAGCGC CGGCCAGGCG ACCGAGGATT CCGAGCTGCT CTACGTCAAC GCCCGTTTCG GCGATACGCA GACACGCTTC TACCGCTTCC AGGATCCGGA GGACGGCACG GTCGATTATT TCGACGAGAA CGGTAAAAGC ATCCGCCAGT TCCTGCTGCG CAACCCGGTC CCGAACGGCA TTTTCAAATC GGGCTTCGGC ATGCGCCGTC ACCCGATCCT CGGTTTCGCC CGCATGCACA CCGGCGTCGA TTGGGCGGCA CCCCGCGGCA CCGCAATCAT CGCCGCCGGC AACGGCACTG TGGAAAAAGC CGGCTGGGAT TCTGGCGGTT ATGGCAACCA GACGATTATC CGGCACGCCA ACGGCTATGA ATCCTCCTAC AATCACCAGA GCGCCATCGC CAAAGGCGTT ACCCCCGGCG CCAAGATCCG CCAGGGCCAG GTGATCGGCT GGGTCGGCAC GACGGGCGAG TCCACCGGGC CGCACCTGCA TTACGAGCTG ATCGTCAACG GCACAAAGGT CGATCCCCTG CGCATCCGCC TGCCGGGCGG CAAATCGCTG CAAGGCGAGG CGCTGGCGAA ATTCGAGGAC GAGCGCAAGC GTATCGATAC GCTGCTGAAC AACCAGACGT CCGACCAGGT GGCGAGCAAA TAA
|
Protein sequence | MIRSLGNEPP LLADGRRAPD RREVSLRWLS GTFLTGITSS VLMGVALFAA LDGRQQLAIP AEAYASAAAD AHEDTTVVRG GRLIAPAIAA KPSDRAIMEV STVVHDGEKE VVRRQPFAHV KMTLAANHVA TEDYPDFDPL AIFSADEPQP APQSRTGAIY GSDVESEVSL KTVSFPTGKT SMKMASGLSL EEVEENVRSN GSVLTDGNTQ LAALYYVDPR RFSNEDADVD LTAGLSARVL EQNMTVSASE SITPQTEEFA DDILPVRVDT PIAKALTDSG YPQQYADGIA GFISQQLGST DLDKGDVLRI GIIQKGEQAK IIRASVYRST RHLVTVAVDD KGRYVPGSEP PMLDAIATAF DDNSFAPPPG QNLPRVYDGI YRAALSYGMT KDMTALIIKL LASNVDFQAQ LRPTDSLEAF FSVADSAGQA TEDSELLYVN ARFGDTQTRF YRFQDPEDGT VDYFDENGKS IRQFLLRNPV PNGIFKSGFG MRRHPILGFA RMHTGVDWAA PRGTAIIAAG NGTVEKAGWD SGGYGNQTII RHANGYESSY NHQSAIAKGV TPGAKIRQGQ VIGWVGTTGE STGPHLHYEL IVNGTKVDPL RIRLPGGKSL QGEALAKFED ERKRIDTLLN NQTSDQVASK
|
| |