Gene Mpe_A1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1431 
Symbol 
ID4783713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1540264 
End bp1543275 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content70% 
IMG OID640089997 
Productglycine dehydrogenase 
Protein accessionYP_001020628 
Protein GI124266624 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.961766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGGC CCCTCGCCCT GCCCCTCGCC GCACTCGAAC ACACCACCGA CTTCGCCGGC 
CGTCACATCG GCATCGACCC CGCCGACGAA CAGCACATGC TGTCGGTGAT CGGCGCGGCT
TCGCGCCAGG CCATGATCGA GGCCATCGTC CCGCGCAGCA TCGCCCGCGC GCGCCCGATG
GTGCTGCCCG AACCCGTCGG CGAGGTCCAG GCGCTGGCCG AGCTGAAGGC GATCGCCGCG
AAGAACCGCG TGTACCGCAG CTACATCGGG CAGGGCTACC ACGGAACGCA CACGCCGGGC
GTCATCCTGC GCAACATCCT CGAGAACCCG GCCTGGTACA CCGCCTACAC GCCCTACCAG
GCCGAGATCT CGCAGGGTCG GCTCGAGGCG CTGGTGAATT TCCAGACCAT GGTGTGCGAC
CTGACCGGCA TGGCGATCGC CAACGCCTCG ATGCTCGACG AGGCCACCGC CGCCGCCGAG
GCCATGACAT TGGCCCGACG CAGCGTGAAG AGCAAGAGCG CCACGGTCAT CGTGGCTTCG
GATTGCCACC CGCAGACCAT CGAGGTGATC CGCACGCGCG CGCGGCCGCT GGGCATCGAG
GTGCTGGTGG GCCTGGTGCC GGAGCTGATG GCGCAGCACG AGTACTTCGC CGTGATCGCA
CAGTACCCGG CGACCGGCGG CATCGTGCAC GACATGAAGC CCTATGTGGA TGCGGCGCAT
GCCAGCGGTG CGGCCTTCGT CGTCGCCGCC GACCTGCTGG CGCTGACGCT GCTCACGCCG
CCCGGTGAAT GGGGTGCCGA TATCGCCATC GGCAGCACCC AGCGCTTCGG CATGCCGATG
GGCGCCGGCG GCCCGCACGC GGCCTTCCTG GCCTGCCGTG ACGAGTTCAA GCGCTCGATG
CCGGGACGCC TGGCCGGCGT GAGCGTCGAC GCGCACGGCC GGCCCGCCTA CCGGCTGGCG
TTGCAGACGC GCGAGCAGCA CATCCGCCGC GAGAAGGCCA CGTCGAACAT CTGCACCGCC
CAGGTACTGC CGGCGGTGGT GGCGAGCATG TATGCGGTCT ACCACGGACC TCAGGGGCTC
AAGCGCATCG CGCAGCGCGT GGCGAGCTAC ACCGCCATCC TGGCGCACGG CCTGCGCGCG
ATGGGCTTCG AGCTCGCCGG TGACAGCGCC TTCGACACGC TGTGCGTCGA GACCGGCATC
GAAACCAATG CCATCGCCCG CCGCGCCCGC GATGCCGGAG CCAACCTGCG ACGCCTGTCG
GCCACGCAGC TCGGCATCAC GCTGGACGAG ACCACCACGC GCGAAGATCT CACCGCGCTG
TGGTCATGGT TCGCGCGCTC CGAGGGCATG GTGCCCAACG TCGCTGCGTT CGCGCACGGC
GCCGACCCGC TGATCCCCAG CGAGCTGCGC CGCACCAGCG CCTACCTGAC CCACCCGGTC
TTCAACAGCC ACCACAGCGA GACCGAGATG CTGCGCTACC TGCGCAGCCT GGCCGACAAG
GACCTCGCGC TCGACCGCAC GATGATCCCG CTGGGTTCGT GCACCATGAA GCTCAACGCG
ACCAGCGAGA TGATCCCGAT CACCTGGCCC GAGTTCGCGC ACATCCACCC GTTCGCCCCG
GCCGAGCAGC TGGCCGGCTA TGCCGAACTC GACGCCCAGC TGCGCCAGTG GCTGTGCGAG
GCCACCGGCT ACGCCGGCAT CAGCCTGCAG CCCAACGCCG GCTCGCAGGG CGAATATGCC
GGCCTGCTGG CGATCCAGGC CTGGCATGCC TCGCGCGGCG ACGCGCAGCG CGACGTCTGC
CTGATCCCCG AGTCGGCGCA CGGCACCAAC CCGGCGAGCG CCCAGATGGC CGGCATGCGC
GTGGTGGTGA CGAAGTGCGA CGCCGAGGGC AACGTCGACC TCGACGACCT GCGCGCCAAG
TGCGAGCAGC ACAGCTCGCA ACTGGCCGCC GTGATGGTCA CCTACCCGTC GACCTACGGC
GTGTTCGACC CGCACATCAA GGCGCTGTGC GCACTGGTCC ACCAGCACGG CGGCCGGGTC
TACGTGGACG GCGCGAACAT GAACGCGCTG GTCGGCGTGG CCGCGCCGGG CGAGTTCGGC
GGCGACGTGA GCCACCTCAA CCTTCACAAG ACCTTCTGCA TTCCGCACGG CGGCGGCGGC
CCGGGCGTGG GCCCGGTGTG CGTGGTCGAG GACCTGGTGC CCTTCCTGCC GACGCACCGC
ACGGGCGCCT CGGCGAGCTC CGTGGATGCC TACGTCGGCA GCGCGGTCGA GCCGACGCCG
GGCCGCCCCA AGGTGGCGAG CGCCCTCTCG GGGGGCAGCG AAGTGCCCGC AGGGCCGAGC
GTGGGGGCCG TCAGTGCTTC GCCGCTGGGC AACGCGGCGG TGCTGCCGAT CAGCTGGATG
TACGTCCGCA TGATGGGTGC AGCCGGCCTG ACCGCTGCGA CCGAGACCGC GATCCTGAGC
GCGAACTACA TCGCCGCGCG GCTGGCCGAC CACTACGACA TCCACTTCAG CGGCGAGGTC
GACGGGCTCA AGGGCGGCGG TGTCGCCCAC GAGTGCATCC TCGACCTGCG CCCGCTGAAG
GACAGCAGCG GCGTCAGCGC CGAGGACGTG GCCAAGCGCC TGATCGACTA CGGCTTCCAC
GCGCCGACGC TGAGCTTCCC GGTCGCCGGC ACGCTGATGG TCGAGCCGAC CGAGAGCGAG
TCGCTGCACG AGCTCGACCG CTTCTGCGAC GCACTGATCG CGATCCGCGC CGAGATCGCC
CGGGTCGAGC AAGGCCACTG GCCGCAGGAC GACAACCCGC TCAAGCACGC GCCGCACACC
GCCGAGGCGC TGCTGAAGGC CGACTGGCCG CACCCCTACT CGCGTGAGGA GGCCGCCTAC
CCGGTGAGCA GCCTGCGGCG TCAGAAGTAC TGGGCACCGG TCGGTCGCGT CGACAACGTG
CACGGCGACC GCAACCTGTT CTGCAGCTGC GTGCCGCTGA GCGCCTACGC CGAAGCCGAC
AAGCAGGCAT GA
 
Protein sequence
MLRPLALPLA ALEHTTDFAG RHIGIDPADE QHMLSVIGAA SRQAMIEAIV PRSIARARPM 
VLPEPVGEVQ ALAELKAIAA KNRVYRSYIG QGYHGTHTPG VILRNILENP AWYTAYTPYQ
AEISQGRLEA LVNFQTMVCD LTGMAIANAS MLDEATAAAE AMTLARRSVK SKSATVIVAS
DCHPQTIEVI RTRARPLGIE VLVGLVPELM AQHEYFAVIA QYPATGGIVH DMKPYVDAAH
ASGAAFVVAA DLLALTLLTP PGEWGADIAI GSTQRFGMPM GAGGPHAAFL ACRDEFKRSM
PGRLAGVSVD AHGRPAYRLA LQTREQHIRR EKATSNICTA QVLPAVVASM YAVYHGPQGL
KRIAQRVASY TAILAHGLRA MGFELAGDSA FDTLCVETGI ETNAIARRAR DAGANLRRLS
ATQLGITLDE TTTREDLTAL WSWFARSEGM VPNVAAFAHG ADPLIPSELR RTSAYLTHPV
FNSHHSETEM LRYLRSLADK DLALDRTMIP LGSCTMKLNA TSEMIPITWP EFAHIHPFAP
AEQLAGYAEL DAQLRQWLCE ATGYAGISLQ PNAGSQGEYA GLLAIQAWHA SRGDAQRDVC
LIPESAHGTN PASAQMAGMR VVVTKCDAEG NVDLDDLRAK CEQHSSQLAA VMVTYPSTYG
VFDPHIKALC ALVHQHGGRV YVDGANMNAL VGVAAPGEFG GDVSHLNLHK TFCIPHGGGG
PGVGPVCVVE DLVPFLPTHR TGASASSVDA YVGSAVEPTP GRPKVASALS GGSEVPAGPS
VGAVSASPLG NAAVLPISWM YVRMMGAAGL TAATETAILS ANYIAARLAD HYDIHFSGEV
DGLKGGGVAH ECILDLRPLK DSSGVSAEDV AKRLIDYGFH APTLSFPVAG TLMVEPTESE
SLHELDRFCD ALIAIRAEIA RVEQGHWPQD DNPLKHAPHT AEALLKADWP HPYSREEAAY
PVSSLRRQKY WAPVGRVDNV HGDRNLFCSC VPLSAYAEAD KQA