Gene Mpe_A2968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2968 
Symbol 
ID4783572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3150204 
End bp3153893 
Gene Length3690 bp 
Protein Length1229 aa 
Translation table11 
GC content73% 
IMG OID640091539 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_001022156 
Protein GI124268152 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit
[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCAA GACGGTGGCA GTTCTGGATC GACCGGGGGG GCACCTTCAC CGACGTGATC 
GGCCGCGCAC CCGACGGTCG CCTGCACACC GCCAAGCTGC TCAGCGAGAA CCCCGGGCAA
TACGACGACG CGGTGGTCGA GGGTCTCCGC CGCCTGCTGG GCCTGCCGGC CGGCGCGGCC
ATCCCCGTCG CGCAGGTCGA GTGCGTGAAG ATGGGCACCA CGGTGGCGAC CAATGCGCTG
CTGGAGCGCA AGGGCGAACC GACGCTGCTG GTCACGACCC GCGGTTTCCG TGACGCACTG
CGCATCGCCT ACCAGGCGCG GCCGCGGCTG TTCGACCGCC GCATCGTGCT GCCCGAACTG
CTGTACCAGC GCGTGGTCGA GGCGGCCGAG CGCATGGGCG CGCACGGCGA GGTGGTCGAG
CCGCTCGACG AGGCCGCGCT GCGCGAGCGC CTGTGGGCGG CCCACGACGC CGGCCTGCGT
GCCTGCGCCA TCGTCTTCCT GCACGGCTAC CGCTACCCGC AGCACGAGGC GGCGGCGGCG
CGCATCGCGC GCGAGCTGGG TTTCACGCAG GTGTCGGTGT CGCACGAGGT CAGCCCGCTG
ATGAAGTTCG TGTCGCGTGG CGACACCACC GTGGTCGACG CCTACCTGTC GCCGATCCTG
CGCCGCTATG TCGAGCGCGT GGCCGCGCAG ATGCCCGGCG TGCCGCTGTA CTTCATGCAG
AGCTCGGGCG GGTTGGCGCA GGCCGAGAGC TTCCAGGGCA AGGACGCGGT GCTGTCCGGT
CCGGCCGGCG GCATCGTCGG CATGGTGCGC ACAGCACTCG AAGCCGGCCA CACGCGCGTG
ATCGGCTTCG ACATGGGCGG CACCTCGACC GACGTGTCGC ATTACGCAGG CGAGTTCGAG
CGCGCCTTCG AGACCCAGGT GGCCGGCGTG CGGCTGCGGG CGCCGATGAT GAGCATTCAC
ACGGTGGCCG CGGGCGGCGG CTCGATCGTG CGCTTCGACG GCGGCCGGCT GCGCGTCGGG
CCTGAGTCGG CCGGCGCCGA TCCCGGCCCG GCCTGCTACC GCCGCGGTGG GCCGTTGACC
ACCACCGACG CGAACCTGCT GCTCGGGCGC ATCCAGCCCG CATATTTCCC CGCGGTGTTC
GGGCCGCGCG GCGACGAGCC GCTCGACGCC GAGGGCGTGG TGCAGCGCTT CGGCGCGCTG
GCGCAGCAGC TCGCCGTGGC GACCGGCCGT GCCACGCTGC CGGAGGACGT GGCCGCCGGC
GCGCTGCAGA TCGCTGTGGC CAACATGGCC AATGCGATCA AGCGCATCTC GGTCGCGCGC
GGCCACGACG TGAGCGGCTA CACGCTGCAG TGTTTCGGCG GCGCCGGCGG GCAGCACGCC
TGCGCGGTCG CCGACGCACT GGGCATGACG CGCGTCTTCA TCCACCCGCT GGCCGGTGTG
CTGTCGGCCT ACGGCATGGG TCTGGCCGAC CAGACCGCGA TGCGCGAGGC GGCGGTCGAG
CGCCGCCTCG ATGCGGCAGG CCTGGCCGCC GCCGGCGAAC GGCTCGATGC GCTGGCGGCC
GAGGCGGAGG CCGCGCTGGC GGCGCAGGGC GTGGCTGCGT CGCGCATCGA GGTGCTGCGC
CGCGTGCATG TGCGCTACGA AGGCACGGAC ACGGCGCTGG TGCTGGCCGA CGGCGACGAG
GCGGCGCTGC GCGCGCGCTT CGACGCTGCC TACCGCCAGC GCTATGCCTT CCTGATGGCG
GGCCGGGCGC TGGTGATCGA GGCGGTGTCG GTGGAGGCGG TGGGTGCCGG CGAGCCGCTG
CCGACGCCTG CTGCGGCCGA AGGTGCCGAT CACATTGCGG CGCCGCTGGC CACGGTGCGC
CTGCACGGCG CGACCGACGG CGAGGCCGGC GCCGCCTGGC AGGACGCCGG CCTGTACCGG
CGCGAAGCGC TGCAGCCCGG CGCACGCATC GACGGGCCGG CCGTCATCGC CGAGCGCAAC
GCGACGACGG TGGTCGAGCC GGGCTGGCAG GCCCGCGTGA CGGCGCAGGG CGCGCTGGAG
CTGGCGCGCG TGCGACCGCG CGCGCTGCGC ACCGCGCTCG GCACCGCGGT GGACCCGGTG
CGCCTGGAGG TCTTCAACAA CCTCTTCATG AACATCGCCG AGCAGATGGG CCTGCGGCTG
CAGAACACCG CGCACTCGGT GAACATCAAG GAGCGGCTCG ACTTCTCCTG CGCGCTGTTC
GACGCTGCCG GCGAGCTGAT CGCCAATGCA CCGCACATGC CGGTGCACCT GGGTTCGATG
AGCGAGTCGA TCAAGACCGT GATCGCCCGC AACCCTCGGC TGCTGCCGGG CGACGTGTTC
GTGCTCAACG ACCCCTACCA CGGCGGCACG CACCTGCCGG ACATCACGGT GGTGACGCCG
GTGTTCCTGG CCCCTCACTC CAGCCGTCTC CCGGAGGGCG AGGGAGAAAG GCCGCTGTTC
TACGTGGCTT CGCGCGGCCA CCATGCCGAT GTCGGCGGCA TCACGCCGGG GTCGATGCCG
CCGTTCTCGC GCCGCATCGA CGACGAGGGC GTGCTGTTCG ACAATTTCCG GCTCGTCGAG
GGCGGTGCCA CGCCGCGGCT GCGCGAGGCC GAACTGCTGG CCGTGTTGGG CGCCGGGCCG
CACCCGGCGC GCAATCCGGC GCAGAACCTC GCCGACCTGC GGGCGCAGAT CGCCGCCAAC
GAGAAGGGCG CCCAGGAGCT GCGCGCGCTG GTGGCGCAGG TCGGCCTGGA GACGGTGCAG
GCCTACATGC AGCACGTGCA GGACAACGCC GAGGAGAGCG TGCGCCGCGT CGTCACGGCG
CTGGCCGCGA CGATCGGCGA CGGCCGCTAC ACGCTGCCGC TCGACAACGG CGCGCAGATC
GCGGTGCAGG TGAGCGTCGA TGCCACCGCG CGCAGCGCCT GCATCGACTT CAGCGGCAGC
AGCCCCCAGC AGGACGACGG CAACTTCAAC GCACCGAAGT CGGTCACGAT GGCGGCGGTG
CTGTACGTCT TCCGCACGCT GGTGGGCGAC GACATCCCGC TCAACGCCGG CTGCCTGAAG
CCGCTGCGGG TGGTGGTGCC CGAAGGCTCG CTGCTGAACC CGCGGCCGCC GGCGGCGGTG
GTGGCCGGCA ACGTCGAGAC CTCGATGTGC GTGACCAATG CGCTCTACGG CGCGCTCGGC
GTGATGGCGG CCAGCCAGTG CACGATGAAC AACTTCACCT TCGGCAACGA GCGCCACCAG
TACTACGAGA CGGTCGCCGG CGGCAGCGGC GCCGGCCCCG ACTTCGACGG CACGGCCGTC
GTGCAGACCC ACATGACCAA CTCCCGGCTC ACCGACCCGG AGGTGCTGGA GTTCCGCTTC
CCGGTGCGGC TGGACAGCTA CGCCATCCGC CACGGTTCGG GCGGTGCCGG CCGTCACCGG
GGCGGCGACG GCGGCGTGCG GCGCCTGCGC TTCCTGGAGC CGATGACGGC CAGCATCCTC
AGCAACGGCC GCCGCGTGCC GGCCTTCGGG CTGGCCGGCG GCGAGGCCGG CGCGCTGGGG
ATCAACCGGG TCGAGCGCGC GCCCGGCACG GACGGCCGGC GTGGCGCGAT CGAGGAGCTC
GGGCCGCTGG GATCGGTGGC GATGGAGCCG GGCGACGTGT TCGTGATCGA GACGCCCGGC
GGCGGCGGCT ACGGCTCGCC CGGGCGCTGA
 
Protein sequence
MDARRWQFWI DRGGTFTDVI GRAPDGRLHT AKLLSENPGQ YDDAVVEGLR RLLGLPAGAA 
IPVAQVECVK MGTTVATNAL LERKGEPTLL VTTRGFRDAL RIAYQARPRL FDRRIVLPEL
LYQRVVEAAE RMGAHGEVVE PLDEAALRER LWAAHDAGLR ACAIVFLHGY RYPQHEAAAA
RIARELGFTQ VSVSHEVSPL MKFVSRGDTT VVDAYLSPIL RRYVERVAAQ MPGVPLYFMQ
SSGGLAQAES FQGKDAVLSG PAGGIVGMVR TALEAGHTRV IGFDMGGTST DVSHYAGEFE
RAFETQVAGV RLRAPMMSIH TVAAGGGSIV RFDGGRLRVG PESAGADPGP ACYRRGGPLT
TTDANLLLGR IQPAYFPAVF GPRGDEPLDA EGVVQRFGAL AQQLAVATGR ATLPEDVAAG
ALQIAVANMA NAIKRISVAR GHDVSGYTLQ CFGGAGGQHA CAVADALGMT RVFIHPLAGV
LSAYGMGLAD QTAMREAAVE RRLDAAGLAA AGERLDALAA EAEAALAAQG VAASRIEVLR
RVHVRYEGTD TALVLADGDE AALRARFDAA YRQRYAFLMA GRALVIEAVS VEAVGAGEPL
PTPAAAEGAD HIAAPLATVR LHGATDGEAG AAWQDAGLYR REALQPGARI DGPAVIAERN
ATTVVEPGWQ ARVTAQGALE LARVRPRALR TALGTAVDPV RLEVFNNLFM NIAEQMGLRL
QNTAHSVNIK ERLDFSCALF DAAGELIANA PHMPVHLGSM SESIKTVIAR NPRLLPGDVF
VLNDPYHGGT HLPDITVVTP VFLAPHSSRL PEGEGERPLF YVASRGHHAD VGGITPGSMP
PFSRRIDDEG VLFDNFRLVE GGATPRLREA ELLAVLGAGP HPARNPAQNL ADLRAQIAAN
EKGAQELRAL VAQVGLETVQ AYMQHVQDNA EESVRRVVTA LAATIGDGRY TLPLDNGAQI
AVQVSVDATA RSACIDFSGS SPQQDDGNFN APKSVTMAAV LYVFRTLVGD DIPLNAGCLK
PLRVVVPEGS LLNPRPPAAV VAGNVETSMC VTNALYGALG VMAASQCTMN NFTFGNERHQ
YYETVAGGSG AGPDFDGTAV VQTHMTNSRL TDPEVLEFRF PVRLDSYAIR HGSGGAGRHR
GGDGGVRRLR FLEPMTASIL SNGRRVPAFG LAGGEAGALG INRVERAPGT DGRRGAIEEL
GPLGSVAMEP GDVFVIETPG GGGYGSPGR