Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1431 |
Symbol | |
ID | 4783713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1540264 |
End bp | 1543275 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640089997 |
Product | glycine dehydrogenase |
Protein accession | YP_001020628 |
Protein GI | 124266624 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.961766 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAGGC CCCTCGCCCT GCCCCTCGCC GCACTCGAAC ACACCACCGA CTTCGCCGGC CGTCACATCG GCATCGACCC CGCCGACGAA CAGCACATGC TGTCGGTGAT CGGCGCGGCT TCGCGCCAGG CCATGATCGA GGCCATCGTC CCGCGCAGCA TCGCCCGCGC GCGCCCGATG GTGCTGCCCG AACCCGTCGG CGAGGTCCAG GCGCTGGCCG AGCTGAAGGC GATCGCCGCG AAGAACCGCG TGTACCGCAG CTACATCGGG CAGGGCTACC ACGGAACGCA CACGCCGGGC GTCATCCTGC GCAACATCCT CGAGAACCCG GCCTGGTACA CCGCCTACAC GCCCTACCAG GCCGAGATCT CGCAGGGTCG GCTCGAGGCG CTGGTGAATT TCCAGACCAT GGTGTGCGAC CTGACCGGCA TGGCGATCGC CAACGCCTCG ATGCTCGACG AGGCCACCGC CGCCGCCGAG GCCATGACAT TGGCCCGACG CAGCGTGAAG AGCAAGAGCG CCACGGTCAT CGTGGCTTCG GATTGCCACC CGCAGACCAT CGAGGTGATC CGCACGCGCG CGCGGCCGCT GGGCATCGAG GTGCTGGTGG GCCTGGTGCC GGAGCTGATG GCGCAGCACG AGTACTTCGC CGTGATCGCA CAGTACCCGG CGACCGGCGG CATCGTGCAC GACATGAAGC CCTATGTGGA TGCGGCGCAT GCCAGCGGTG CGGCCTTCGT CGTCGCCGCC GACCTGCTGG CGCTGACGCT GCTCACGCCG CCCGGTGAAT GGGGTGCCGA TATCGCCATC GGCAGCACCC AGCGCTTCGG CATGCCGATG GGCGCCGGCG GCCCGCACGC GGCCTTCCTG GCCTGCCGTG ACGAGTTCAA GCGCTCGATG CCGGGACGCC TGGCCGGCGT GAGCGTCGAC GCGCACGGCC GGCCCGCCTA CCGGCTGGCG TTGCAGACGC GCGAGCAGCA CATCCGCCGC GAGAAGGCCA CGTCGAACAT CTGCACCGCC CAGGTACTGC CGGCGGTGGT GGCGAGCATG TATGCGGTCT ACCACGGACC TCAGGGGCTC AAGCGCATCG CGCAGCGCGT GGCGAGCTAC ACCGCCATCC TGGCGCACGG CCTGCGCGCG ATGGGCTTCG AGCTCGCCGG TGACAGCGCC TTCGACACGC TGTGCGTCGA GACCGGCATC GAAACCAATG CCATCGCCCG CCGCGCCCGC GATGCCGGAG CCAACCTGCG ACGCCTGTCG GCCACGCAGC TCGGCATCAC GCTGGACGAG ACCACCACGC GCGAAGATCT CACCGCGCTG TGGTCATGGT TCGCGCGCTC CGAGGGCATG GTGCCCAACG TCGCTGCGTT CGCGCACGGC GCCGACCCGC TGATCCCCAG CGAGCTGCGC CGCACCAGCG CCTACCTGAC CCACCCGGTC TTCAACAGCC ACCACAGCGA GACCGAGATG CTGCGCTACC TGCGCAGCCT GGCCGACAAG GACCTCGCGC TCGACCGCAC GATGATCCCG CTGGGTTCGT GCACCATGAA GCTCAACGCG ACCAGCGAGA TGATCCCGAT CACCTGGCCC GAGTTCGCGC ACATCCACCC GTTCGCCCCG GCCGAGCAGC TGGCCGGCTA TGCCGAACTC GACGCCCAGC TGCGCCAGTG GCTGTGCGAG GCCACCGGCT ACGCCGGCAT CAGCCTGCAG CCCAACGCCG GCTCGCAGGG CGAATATGCC GGCCTGCTGG CGATCCAGGC CTGGCATGCC TCGCGCGGCG ACGCGCAGCG CGACGTCTGC CTGATCCCCG AGTCGGCGCA CGGCACCAAC CCGGCGAGCG CCCAGATGGC CGGCATGCGC GTGGTGGTGA CGAAGTGCGA CGCCGAGGGC AACGTCGACC TCGACGACCT GCGCGCCAAG TGCGAGCAGC ACAGCTCGCA ACTGGCCGCC GTGATGGTCA CCTACCCGTC GACCTACGGC GTGTTCGACC CGCACATCAA GGCGCTGTGC GCACTGGTCC ACCAGCACGG CGGCCGGGTC TACGTGGACG GCGCGAACAT GAACGCGCTG GTCGGCGTGG CCGCGCCGGG CGAGTTCGGC GGCGACGTGA GCCACCTCAA CCTTCACAAG ACCTTCTGCA TTCCGCACGG CGGCGGCGGC CCGGGCGTGG GCCCGGTGTG CGTGGTCGAG GACCTGGTGC CCTTCCTGCC GACGCACCGC ACGGGCGCCT CGGCGAGCTC CGTGGATGCC TACGTCGGCA GCGCGGTCGA GCCGACGCCG GGCCGCCCCA AGGTGGCGAG CGCCCTCTCG GGGGGCAGCG AAGTGCCCGC AGGGCCGAGC GTGGGGGCCG TCAGTGCTTC GCCGCTGGGC AACGCGGCGG TGCTGCCGAT CAGCTGGATG TACGTCCGCA TGATGGGTGC AGCCGGCCTG ACCGCTGCGA CCGAGACCGC GATCCTGAGC GCGAACTACA TCGCCGCGCG GCTGGCCGAC CACTACGACA TCCACTTCAG CGGCGAGGTC GACGGGCTCA AGGGCGGCGG TGTCGCCCAC GAGTGCATCC TCGACCTGCG CCCGCTGAAG GACAGCAGCG GCGTCAGCGC CGAGGACGTG GCCAAGCGCC TGATCGACTA CGGCTTCCAC GCGCCGACGC TGAGCTTCCC GGTCGCCGGC ACGCTGATGG TCGAGCCGAC CGAGAGCGAG TCGCTGCACG AGCTCGACCG CTTCTGCGAC GCACTGATCG CGATCCGCGC CGAGATCGCC CGGGTCGAGC AAGGCCACTG GCCGCAGGAC GACAACCCGC TCAAGCACGC GCCGCACACC GCCGAGGCGC TGCTGAAGGC CGACTGGCCG CACCCCTACT CGCGTGAGGA GGCCGCCTAC CCGGTGAGCA GCCTGCGGCG TCAGAAGTAC TGGGCACCGG TCGGTCGCGT CGACAACGTG CACGGCGACC GCAACCTGTT CTGCAGCTGC GTGCCGCTGA GCGCCTACGC CGAAGCCGAC AAGCAGGCAT GA
|
Protein sequence | MLRPLALPLA ALEHTTDFAG RHIGIDPADE QHMLSVIGAA SRQAMIEAIV PRSIARARPM VLPEPVGEVQ ALAELKAIAA KNRVYRSYIG QGYHGTHTPG VILRNILENP AWYTAYTPYQ AEISQGRLEA LVNFQTMVCD LTGMAIANAS MLDEATAAAE AMTLARRSVK SKSATVIVAS DCHPQTIEVI RTRARPLGIE VLVGLVPELM AQHEYFAVIA QYPATGGIVH DMKPYVDAAH ASGAAFVVAA DLLALTLLTP PGEWGADIAI GSTQRFGMPM GAGGPHAAFL ACRDEFKRSM PGRLAGVSVD AHGRPAYRLA LQTREQHIRR EKATSNICTA QVLPAVVASM YAVYHGPQGL KRIAQRVASY TAILAHGLRA MGFELAGDSA FDTLCVETGI ETNAIARRAR DAGANLRRLS ATQLGITLDE TTTREDLTAL WSWFARSEGM VPNVAAFAHG ADPLIPSELR RTSAYLTHPV FNSHHSETEM LRYLRSLADK DLALDRTMIP LGSCTMKLNA TSEMIPITWP EFAHIHPFAP AEQLAGYAEL DAQLRQWLCE ATGYAGISLQ PNAGSQGEYA GLLAIQAWHA SRGDAQRDVC LIPESAHGTN PASAQMAGMR VVVTKCDAEG NVDLDDLRAK CEQHSSQLAA VMVTYPSTYG VFDPHIKALC ALVHQHGGRV YVDGANMNAL VGVAAPGEFG GDVSHLNLHK TFCIPHGGGG PGVGPVCVVE DLVPFLPTHR TGASASSVDA YVGSAVEPTP GRPKVASALS GGSEVPAGPS VGAVSASPLG NAAVLPISWM YVRMMGAAGL TAATETAILS ANYIAARLAD HYDIHFSGEV DGLKGGGVAH ECILDLRPLK DSSGVSAEDV AKRLIDYGFH APTLSFPVAG TLMVEPTESE SLHELDRFCD ALIAIRAEIA RVEQGHWPQD DNPLKHAPHT AEALLKADWP HPYSREEAAY PVSSLRRQKY WAPVGRVDNV HGDRNLFCSC VPLSAYAEAD KQA
|
| |