Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3812 |
Symbol | |
ID | 6982575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3945413 |
End bp | 3946954 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643398534 |
Product | peptidase M48 Ste24p |
Protein accession | YP_002283300 |
Protein GI | 209551383 |
COG category | [R] General function prediction only |
COG ID | [COG4784] Putative Zn-dependent protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.685027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAGAA ACAGACTGGA GAGCTTGACG ACGTGGAAAT CGCCCGCGCT TTCCAGTGAT GCCCTCTTCG CGCCCCGGCG CTTCGCGCGT CGCCTGATGC TGCTTTCGGC CGTCGCACTG ACGCTTAATG GCTGTCAAAC GCTGATCGAG CAATCCTATC AGCCGAGCGT CTCGCCTTCT TCCAATCCGC AGATCGTCGA CGAGGTGCAG AAGAACGACC CGCGCGCGGC GATGGGCGCC CGCGAACATC CGCGCATCGT CGCAAGCTAC GGCGGCGAAT ACAAGGACGC CAAGACCGAG CGCCTCGTCG CCCGCATCGC CGGCGCGCTG ACGGCGGTGT CTGAGAATCC GAGCCAGTCC TACCGCATCA CCATCCTGAA TTCACCGGCG ATCAACGCCT TCGCGCTGCC GGGTGGTTAT CTCTACGTCA CCCGCGGCCT GCTCGCCCTT GCCAACGACG CCTCCGAAGT CGCCGCCGTG CTGTCGCACG AGATGGGCCA TGTGACGGCA AACCACGGCA TCGAGCGGCA GAAGCGCGAA GAGGCTGAGG TCATCGCCAG CCGCGTCGTC GCCGAGGTCC TTTCCAGCGA TATCGCCGGC AAGCAGGCAC TGGCCCGCGG CAAACTGCGC CTTGCCGCCT TCTCCCGCCA GCAGGAACTA CAGGCCGATG TCATCGGCGT ACGGATGCTC GGCGAAGCCG GCTACGACCC CTATTCCGCT GCCCGCTTTC TCGATTCCAT GGCGGCGTAC AGCCGCTTCA TGTCGGTCGA TCCCGAAGCC GACCAGAGCC TCGACTTCCT GTCGAGCCAT CCGAATTCCG CCCAGCGCAT CGAGCTTGCC CGCACCCATG CGCGGGCCTT CGGCCAGGAA GGGTCAGTCG GCGACAAGGG CCGCGACTAT TATCTCGACG GCATAGACGG TCTGCTCTAC GGCGATAGCC CTGAGGAAGG CTATGTGCGC GGCCAGACCT TCCTGCATGG AGGCCTCGGC ATCCGCTTCG ACGTGCCGCC GGACTTCCAC ATCGACAACA AGGTCGAAGC CGTGATGGCC ACGGGCCCGA ACGACATCGC CGTCCGCTTC GACGGCGTCG CCGACAATCA GAACCAGAGC CTCACCAACT ATATTTCCAG CGGCTGGGTG ACCGGCCTCG ACCCGTCGAC CATCCAGCCG GTTACCATCA ACGGCATGGA AGCAGCCACA GCACGCGCAA GTGCGGATCG CTGGGATTTC GATGTCACCG TGATCCGCAA CAATTCGCAG ATCTTCCGTT TCCTGACCGC CGTGCCGAAA GGCAGCGACG CCCTCGAGCC GACCGCCAAT GTCCTGCGCG CAAGTTTCCG GCGCATGACG CCGGCCGAGG CAGCCTCCCT GAAGCCGCTG CGCATCCGTG TCGTCACCGT CCGGCCGGGT GAAAACATCT CGACGCTCGC CGCCCGCATG ATGGGCACCG ACCGCAAGCT CGATCTCTTC AAACTCATCA ATGCCTTGCC CACGGGTGCA GCCGTTTCAC CCGGCGATCG CGTCAAGATC ATCGCTGAAT AA
|
Protein sequence | MRRNRLESLT TWKSPALSSD ALFAPRRFAR RLMLLSAVAL TLNGCQTLIE QSYQPSVSPS SNPQIVDEVQ KNDPRAAMGA REHPRIVASY GGEYKDAKTE RLVARIAGAL TAVSENPSQS YRITILNSPA INAFALPGGY LYVTRGLLAL ANDASEVAAV LSHEMGHVTA NHGIERQKRE EAEVIASRVV AEVLSSDIAG KQALARGKLR LAAFSRQQEL QADVIGVRML GEAGYDPYSA ARFLDSMAAY SRFMSVDPEA DQSLDFLSSH PNSAQRIELA RTHARAFGQE GSVGDKGRDY YLDGIDGLLY GDSPEEGYVR GQTFLHGGLG IRFDVPPDFH IDNKVEAVMA TGPNDIAVRF DGVADNQNQS LTNYISSGWV TGLDPSTIQP VTINGMEAAT ARASADRWDF DVTVIRNNSQ IFRFLTAVPK GSDALEPTAN VLRASFRRMT PAEAASLKPL RIRVVTVRPG ENISTLAARM MGTDRKLDLF KLINALPTGA AVSPGDRVKI IAE
|
| |