Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0599 |
Symbol | |
ID | 4785700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 629145 |
End bp | 630665 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640089158 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001019796 |
Protein GI | 124265792 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.213204 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTACG CCCCTCCCGG CTCCTCCGAT TCCAAGGTCG CCTTCAAGGC CCAGTACGAC AACTTCATCG GCGGCAAGTT CGTGGCCCCC GTGAAGGGAC AGTATTTCGA CGTGATCACG CCGATCACCG GCAAGACCTA CACCCGCGCC GCCCGTTCGG GCCCGGAGGA CATCGAGCTG GCGCTCGACG CCGCCCACGC CGCCGCCGAC AAGTGGGGCC GCACCTCGGC CGCCGAGCGC GCCAACGTGC TGCTGAAGAT CGCCGACCGC ATCGACGCCA ACGTGGAACT GCTGGCCTAC GCGGAATCGG TGGACAACGG CAAGCCGATG CGCGAAACGC TCAACGCCGA CATCCCGCTG AGCGCCGACC ACTTCCGCTA CTTCGCCGGC TGCGTGCGCG CGCAGGAAGG CGGCGTCAGC GAGATCGACG AAACCACGGT GGCCTACCAC TTCCACGAGC CGCTCGGCGT CGTCGGCCAG ATCATTCCCT GGAACTTCCC GATCCTGATG GCCGCCTGGA AGCTGGCGCC GGCCCTGGGC GCCGGTAACT GCGTGGTGCT GAAGCCGGCC GAGTCGACCC CGGTGAGCAT CCTGGTGCTG GCCGAGCTGA TCGCCGACCT GCTGCCGCCG GGCGTGCTGA ACATCGTCAA CGGCCTCGGT CGTGAAGCCG GCATGCCGCT GGCCACCAGC AAGCGCATCG CCAAGATCGC CTTCACCGGC TCCACCTCCA CCGGCCGCGT GATCGCGCAG GCCGCCGCCA ACAGCCTGAT CCCGGCCACG CTGGAGCTGG GCGGCAAGTC GCCGAACGTG TTCTTCGCCG ACGTGATGGA CAAGGACGAC GCCTTCCTCG ACAAGGCGAT CGAGGGCCTG GTGCTGTTCG CGTTCAACCA GGGCGAGGTC TGCACCTGCC CGTCGCGCGC GCTGATCCAC GAGTCGATCT ACGACAAGTT CATGGAGCGC GCCCTCAAGC GCGTGGCCGC GATCAAGCAG GGCAGCCCGC TCGACACCGA CACGATGATC GGCGCGCAGG CCTCGAAGGA ACAGCTGACC AAGATCCTGT CCTACCTCGA CCTGGGCAAG CAGGAAGGCG CGCAGGTGCT GATCGGCGGC CAGCAGGCCC AGATGAGCGG CGACCTCGAC GGCGGCTACT ACGTGCAGCC CACCATCTTC AAGGGCCACA ACAAGATGCG CATCTTCCAG GAGGAGATCT TCGGCCCGGT GCTGGCCGTG ACCACCTTCA AGGACGAGGC GGAAGCCCTC GCCATCGCCA ACGACACGCT GTACGGCCTG GGCGCCGGCG TGTGGAGCCG CAACGGCAAC GTGGCCTACC GTATGGGCCG CGCCATCAAG GCCGGTCGCG TGTGGACCAA CTGCTACCAC GCCTACCCCG CGCACGCCGC CTTCGGCGGC TACAAGGAGT CGGGCATCGG CCGCGAGAAC CACAAGGTCA TGCTCGACCA CTACCAGCAG ACCAAGAACC TGCTGGTCAG CTACAGCGAG AACAAGCTCG GCTTCTTCTG A
|
Protein sequence | MRYAPPGSSD SKVAFKAQYD NFIGGKFVAP VKGQYFDVIT PITGKTYTRA ARSGPEDIEL ALDAAHAAAD KWGRTSAAER ANVLLKIADR IDANVELLAY AESVDNGKPM RETLNADIPL SADHFRYFAG CVRAQEGGVS EIDETTVAYH FHEPLGVVGQ IIPWNFPILM AAWKLAPALG AGNCVVLKPA ESTPVSILVL AELIADLLPP GVLNIVNGLG REAGMPLATS KRIAKIAFTG STSTGRVIAQ AAANSLIPAT LELGGKSPNV FFADVMDKDD AFLDKAIEGL VLFAFNQGEV CTCPSRALIH ESIYDKFMER ALKRVAAIKQ GSPLDTDTMI GAQASKEQLT KILSYLDLGK QEGAQVLIGG QQAQMSGDLD GGYYVQPTIF KGHNKMRIFQ EEIFGPVLAV TTFKDEAEAL AIANDTLYGL GAGVWSRNGN VAYRMGRAIK AGRVWTNCYH AYPAHAAFGG YKESGIGREN HKVMLDHYQQ TKNLLVSYSE NKLGFF
|
| |