Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2754 |
Symbol | |
ID | 5831725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 3085993 |
End bp | 3087870 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641368554 |
Product | pepF/M3 family oligoendopeptidase |
Protein accession | YP_001640216 |
Protein GI | 163852173 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAGCC GAGCCGTTTT CGCCGCCCTG TCGCCGGACC TGCCGAGAGA GGCTGAGGAA CTCGCGGCGG TGCACGGCGC GATCCAGGCG GTCGATCTCG GCGTGCTGCC GGAATGGGAC CTCACCGACC TCTATCCGAG CATGGACGCG CCGGCCTTTC GCGAGGATCT CGACCGGGCC GAGGCCGAGA GCCGAGCCTT CGCCGAACGT TATGCGGGCC GGATCGCCGA GATCGCGGCG GGGCCGGATG CCTCGTCCGT CCTCGGCGAG GCGGTGCAGA CCTTCGAGCG GATCGAGGAT CTGATGGGGC GGCTGATGTC CTATGCCGGT CTCGTCTATT CCGGCGACAC GACGGACGAG ACCCGCGCCA AGTTCTACGG CGACACCCGC GAGCGGCTGA CCACGGCCTC GGGTGACCTG CTGTTCTTCG GCCTGGAATT GAACCGCGTC GAGGATGCGG TGCTCGACGC GGCGATGGCG GACGGGCCGC TGGCCCATTA CCGCCCCTGG ATCGAGGATC TTCGGCGCGA GAAGCCGCAC CAGCTCGACG ACCGGACGGA GAAGCTGTTC CTCGACAAGT CGGTGACCTC GAACGCCGCC TGGGACCGGC TGTTCAATGA GACGATCGCT TCGCTCCGCT TCTCGGTTCA GGGCGAGCGC CTGACGCTGG AGCCGACGCT CAACAAGCTC GTCGACTCGG ACGGGGCGGT GCGCCAGGAG GCGGCGCAGG CGCTCGGCGA GACCCTGCGG GCGAATCTGC GCATCTTCAC GCTGATCACC AACACGCTGG CCAAGGACAA GGAGATCTCC GACCGCTGGC GCGGCTTCAA GGATGTGGCC GATGCCCGCC ACCTCTCGAA TCGTGTTGAG CCGGAGGTCG TGGCCGCCAT GGTCGAGGCC GTGCGTGCGG CCTATCCACG GCTCTCGCAC CGCTATTACC GGCTCAAGGC CAAGTGGTTC GGCGTCGAGG CGCTGCCCTA TTGGGACCGC AACGCTCCGC TGCCGAAGGT CGAGCAGCGC ACGATTCCCT GGGCCCAGGC CCGTGACACG GTGCTGGAGG CCTACGACGC CTTCTCGCCG GATATGGCCG GCATCGCCAA AAAGTTCTTC GACGGCGGCT GGATCGATGC GCCGACCCGA CCGGGCAAGG CACCCGGCGC CTTCGCGCAT CCGACCGTGC CCTCGGCTCA CCCCTACGTG CTGGTGAATT ACCAGGGCAA GCCGCGCGAC GTAATGACTC TCGCGCACGA ACTCGGCCAC GGCGTGCATC AGGTGCTCGC CGCCCCCAAC GGCGCGCTGA TGGCGCCGAC CCCGCTGACG CTGGCCGAGA CCGCGAGCGT GTTCGGCGAG ATGCTGACCT TCCAGCGGCT GCTGGGCCAG ACCACGGACC CGACCCAGCG CCGGGCGATG CTCGCGGCCA AGGTCGAGGA CATGATCAAC ACGGTGGTGC GCCAGATCGC GTTCTATTCG TTCGAGCGGA AGGTCCACCT CGCCCGCGCC AAGGGCGAGC TGACAGCCGA GCAGATCAAC GAACTTTGGA TGTCGGTGCA GTCCGAGAGC CTCGGGCCGG CGATCACCCT CGATAAGGGC TACGAGCCGT TCTGGGCCTA CATCCCGCAC TTCATCCACT CGCCGTTCTA CGTCTACGCC TACGCGTTCG GCGATTGTCT CGTGAACTCG CTCTACGGCG TCTACGCTCG CGCCGAGCCC GGCTTCGTCG AGCGCTACTT CGCTCTGCTG TCAGCCGGCG GCTCGAAGCC CTACGGCGAG TTGCTGAAGC CGTTCGGACT CGACGCCAGC GATCCCGGCT TCTGGCAGAT CGGGCTGGGG ATGATCGAGG GGATGATCGC GGAGCTGGAG GCGATGGAAC ACGATTAA
|
Protein sequence | MSSRAVFAAL SPDLPREAEE LAAVHGAIQA VDLGVLPEWD LTDLYPSMDA PAFREDLDRA EAESRAFAER YAGRIAEIAA GPDASSVLGE AVQTFERIED LMGRLMSYAG LVYSGDTTDE TRAKFYGDTR ERLTTASGDL LFFGLELNRV EDAVLDAAMA DGPLAHYRPW IEDLRREKPH QLDDRTEKLF LDKSVTSNAA WDRLFNETIA SLRFSVQGER LTLEPTLNKL VDSDGAVRQE AAQALGETLR ANLRIFTLIT NTLAKDKEIS DRWRGFKDVA DARHLSNRVE PEVVAAMVEA VRAAYPRLSH RYYRLKAKWF GVEALPYWDR NAPLPKVEQR TIPWAQARDT VLEAYDAFSP DMAGIAKKFF DGGWIDAPTR PGKAPGAFAH PTVPSAHPYV LVNYQGKPRD VMTLAHELGH GVHQVLAAPN GALMAPTPLT LAETASVFGE MLTFQRLLGQ TTDPTQRRAM LAAKVEDMIN TVVRQIAFYS FERKVHLARA KGELTAEQIN ELWMSVQSES LGPAITLDKG YEPFWAYIPH FIHSPFYVYA YAFGDCLVNS LYGVYARAEP GFVERYFALL SAGGSKPYGE LLKPFGLDAS DPGFWQIGLG MIEGMIAELE AMEHD
|
| |