Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3597 |
Symbol | |
ID | 8014450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3630776 |
End bp | 3632629 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644826162 |
Product | oligoendopeptidase, pepF/M3 family |
Protein accession | YP_002977382 |
Protein GI | 241206286 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.794644 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.398139 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA AGCTCCCGCA CGCCGGCCTT CTTCTATCGG CAGCAGCGCC AGCTGGGGCC GCCGATCCGG CGCTCGGCGT TCTGCCGGTC TGGAAGCTGC AGGATCTTTA TCCCTCCGCC ACCTCCACCG CCTTCGTCGC CGACATGGAA AAAGCCGGCA AGGCCGCGAT CGCCTTTGAA GAAAAGTGGA AGGGCACACT CACGGAGGCG ACGGCGAAGA CCGGCGCCGA GGGCATCGGC GCGGCCCTGA AGGAATATGA GGCGCTGGAC GACATCATCG GCCGCCTCGG CTCCTTTGCC GGCCTCACTT ATTTCTCCGA TACCACCAAC CCGACAAACG GCAAGCTCTA CGGCGACGTA CAGGCCAAGA TCACCGAATT TTCCGGTCAT CTCCTGTTCT TCGCGCTGGA ACTCAACCGC ATCGACGACG CGGTGATCGA CGCCTGCATG GCGAATGATC CCGCCGCCGG ACATTATCGC CCTTGGCTGC TCGACCTCAG GAAGGACAAG CCCTACCAGC TCGACGATAG GCTGGAACAG CTCTTCCTTG AGAAGTCGAT GACATCAGCC GCAGCCTTCA ACCGCCTCTT CGACGAAACC ATGGCGGAAC TTCGCTACGA GATCGATGGC GAGAAAGTGC CGCTCGAAGT GGCGCTGAAC AAGCTGCAGG AAAAGGATCC GGAAGTGCGC CGCAAGGCAG CCATGGCGCT CGCCGAAACC TTCAAGGCGA ATATCCGCAC CTTTACGCTG ATCACCAACA CGCTTGCCAA GGATAAGGAG ATCGCCGACC GCTGGCGCGG CTTCGAGGAC ATCGCCGACA GCCGACACCT GGCAAACCGC GTCGAGCGCG AGGTCGTCGA TGCGCTGGCC GCAGCCGTCC GCGAAGCCTA TCCCCGCCTT TCGCATCGCT ATTACAAGAT GAAGGCGAAA TGGCTTGGCA TGGAGCAGAT GAATTTCTGG GACCGCAACG CGCCGCTTCC GGAAACCTCC AGCGCCATCA TCTCGTGGCC GGAGGCGAAG GACACCGTGC TATCGGCCTA TGGCAATTTT TCCCCCGAGA TGGCTGATAT CGCCAGGCGG TTCTTCGACG AACAGTGGAT CGATGCTCCG GTTCGTGCCG GCAAGGCGCC CGGCGCCTTC GCGCATCCGA CGGTTCCCTC GGCCCATCCC TATGTGCTCG TCAATTATAT GGGCAAGCCG CGCGATGTAA TGACGCTTGC CCATGAACTC GGGCACGGCG TGCATCAGGT TCTCGCCGGC GCGCAGGGAG CGCTGATGTG CCAGACGCCG CTGACGCTTG CCGAAACCGC TTCCGTCTTC GGCGAGATGC TGACCTTCCG CGCGCTTCTG CAAAAGACCA CCGATACGCG CGAGCGCAAG GCGATGCTCG CCCAGAAGGT CGAGGATATG ATCAACACGG TCGTGCGCCA GATCGCCTTC TACGAATTCG AGCGCAAGCT CCACACCGCT CGTAAAGCTG GCGAACTCAC AGCTGACGAC ATCGGCGAAC TCTGGCTCTC CGTCCAGTCG GAAAGCCTCG GGCCGGCGAT CAGCATTTCT GAAGGGTACG AGACCTATTG GGCCTATATC CCCCATTTCA TCCACTCGCC CTTCTATGTC TACGCCTATG CCTTTGGCGA TTGCCTGGTA AATTCGCTCT ATGCCGTCTA CCAGAAAGCC GAGAAGGGCT TTCAGGAGAA GTATTTCGAA CTGCTGAGGG CCGGCGGCAC CAAGCATCAC TCGGAACTGC TGAAGCCTTT TGGCCTCGAC GCCACCGATC CGTCGTTCTG GAGCCAGGGC CTGTCGATGA TCGAAGGGCT GATCGATGAG TTGGAAGCGT TGGATAGGGG CTGA
|
Protein sequence | MKIKLPHAGL LLSAAAPAGA ADPALGVLPV WKLQDLYPSA TSTAFVADME KAGKAAIAFE EKWKGTLTEA TAKTGAEGIG AALKEYEALD DIIGRLGSFA GLTYFSDTTN PTNGKLYGDV QAKITEFSGH LLFFALELNR IDDAVIDACM ANDPAAGHYR PWLLDLRKDK PYQLDDRLEQ LFLEKSMTSA AAFNRLFDET MAELRYEIDG EKVPLEVALN KLQEKDPEVR RKAAMALAET FKANIRTFTL ITNTLAKDKE IADRWRGFED IADSRHLANR VEREVVDALA AAVREAYPRL SHRYYKMKAK WLGMEQMNFW DRNAPLPETS SAIISWPEAK DTVLSAYGNF SPEMADIARR FFDEQWIDAP VRAGKAPGAF AHPTVPSAHP YVLVNYMGKP RDVMTLAHEL GHGVHQVLAG AQGALMCQTP LTLAETASVF GEMLTFRALL QKTTDTRERK AMLAQKVEDM INTVVRQIAF YEFERKLHTA RKAGELTADD IGELWLSVQS ESLGPAISIS EGYETYWAYI PHFIHSPFYV YAYAFGDCLV NSLYAVYQKA EKGFQEKYFE LLRAGGTKHH SELLKPFGLD ATDPSFWSQG LSMIEGLIDE LEALDRG
|
| |