Gene Mext_2754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2754 
Symbol 
ID5831725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3085993 
End bp3087870 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content67% 
IMG OID641368554 
ProductpepF/M3 family oligoendopeptidase 
Protein accessionYP_001640216 
Protein GI163852173 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGCC GAGCCGTTTT CGCCGCCCTG TCGCCGGACC TGCCGAGAGA GGCTGAGGAA 
CTCGCGGCGG TGCACGGCGC GATCCAGGCG GTCGATCTCG GCGTGCTGCC GGAATGGGAC
CTCACCGACC TCTATCCGAG CATGGACGCG CCGGCCTTTC GCGAGGATCT CGACCGGGCC
GAGGCCGAGA GCCGAGCCTT CGCCGAACGT TATGCGGGCC GGATCGCCGA GATCGCGGCG
GGGCCGGATG CCTCGTCCGT CCTCGGCGAG GCGGTGCAGA CCTTCGAGCG GATCGAGGAT
CTGATGGGGC GGCTGATGTC CTATGCCGGT CTCGTCTATT CCGGCGACAC GACGGACGAG
ACCCGCGCCA AGTTCTACGG CGACACCCGC GAGCGGCTGA CCACGGCCTC GGGTGACCTG
CTGTTCTTCG GCCTGGAATT GAACCGCGTC GAGGATGCGG TGCTCGACGC GGCGATGGCG
GACGGGCCGC TGGCCCATTA CCGCCCCTGG ATCGAGGATC TTCGGCGCGA GAAGCCGCAC
CAGCTCGACG ACCGGACGGA GAAGCTGTTC CTCGACAAGT CGGTGACCTC GAACGCCGCC
TGGGACCGGC TGTTCAATGA GACGATCGCT TCGCTCCGCT TCTCGGTTCA GGGCGAGCGC
CTGACGCTGG AGCCGACGCT CAACAAGCTC GTCGACTCGG ACGGGGCGGT GCGCCAGGAG
GCGGCGCAGG CGCTCGGCGA GACCCTGCGG GCGAATCTGC GCATCTTCAC GCTGATCACC
AACACGCTGG CCAAGGACAA GGAGATCTCC GACCGCTGGC GCGGCTTCAA GGATGTGGCC
GATGCCCGCC ACCTCTCGAA TCGTGTTGAG CCGGAGGTCG TGGCCGCCAT GGTCGAGGCC
GTGCGTGCGG CCTATCCACG GCTCTCGCAC CGCTATTACC GGCTCAAGGC CAAGTGGTTC
GGCGTCGAGG CGCTGCCCTA TTGGGACCGC AACGCTCCGC TGCCGAAGGT CGAGCAGCGC
ACGATTCCCT GGGCCCAGGC CCGTGACACG GTGCTGGAGG CCTACGACGC CTTCTCGCCG
GATATGGCCG GCATCGCCAA AAAGTTCTTC GACGGCGGCT GGATCGATGC GCCGACCCGA
CCGGGCAAGG CACCCGGCGC CTTCGCGCAT CCGACCGTGC CCTCGGCTCA CCCCTACGTG
CTGGTGAATT ACCAGGGCAA GCCGCGCGAC GTAATGACTC TCGCGCACGA ACTCGGCCAC
GGCGTGCATC AGGTGCTCGC CGCCCCCAAC GGCGCGCTGA TGGCGCCGAC CCCGCTGACG
CTGGCCGAGA CCGCGAGCGT GTTCGGCGAG ATGCTGACCT TCCAGCGGCT GCTGGGCCAG
ACCACGGACC CGACCCAGCG CCGGGCGATG CTCGCGGCCA AGGTCGAGGA CATGATCAAC
ACGGTGGTGC GCCAGATCGC GTTCTATTCG TTCGAGCGGA AGGTCCACCT CGCCCGCGCC
AAGGGCGAGC TGACAGCCGA GCAGATCAAC GAACTTTGGA TGTCGGTGCA GTCCGAGAGC
CTCGGGCCGG CGATCACCCT CGATAAGGGC TACGAGCCGT TCTGGGCCTA CATCCCGCAC
TTCATCCACT CGCCGTTCTA CGTCTACGCC TACGCGTTCG GCGATTGTCT CGTGAACTCG
CTCTACGGCG TCTACGCTCG CGCCGAGCCC GGCTTCGTCG AGCGCTACTT CGCTCTGCTG
TCAGCCGGCG GCTCGAAGCC CTACGGCGAG TTGCTGAAGC CGTTCGGACT CGACGCCAGC
GATCCCGGCT TCTGGCAGAT CGGGCTGGGG ATGATCGAGG GGATGATCGC GGAGCTGGAG
GCGATGGAAC ACGATTAA
 
Protein sequence
MSSRAVFAAL SPDLPREAEE LAAVHGAIQA VDLGVLPEWD LTDLYPSMDA PAFREDLDRA 
EAESRAFAER YAGRIAEIAA GPDASSVLGE AVQTFERIED LMGRLMSYAG LVYSGDTTDE
TRAKFYGDTR ERLTTASGDL LFFGLELNRV EDAVLDAAMA DGPLAHYRPW IEDLRREKPH
QLDDRTEKLF LDKSVTSNAA WDRLFNETIA SLRFSVQGER LTLEPTLNKL VDSDGAVRQE
AAQALGETLR ANLRIFTLIT NTLAKDKEIS DRWRGFKDVA DARHLSNRVE PEVVAAMVEA
VRAAYPRLSH RYYRLKAKWF GVEALPYWDR NAPLPKVEQR TIPWAQARDT VLEAYDAFSP
DMAGIAKKFF DGGWIDAPTR PGKAPGAFAH PTVPSAHPYV LVNYQGKPRD VMTLAHELGH
GVHQVLAAPN GALMAPTPLT LAETASVFGE MLTFQRLLGQ TTDPTQRRAM LAAKVEDMIN
TVVRQIAFYS FERKVHLARA KGELTAEQIN ELWMSVQSES LGPAITLDKG YEPFWAYIPH
FIHSPFYVYA YAFGDCLVNS LYGVYARAEP GFVERYFALL SAGGSKPYGE LLKPFGLDAS
DPGFWQIGLG MIEGMIAELE AMEHD