Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2634 |
Symbol | |
ID | 4785859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2809717 |
End bp | 2810877 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640091205 |
Product | putative dioxygenase (alpha subunit) oxidoreductase protein |
Protein accession | YP_001021823 |
Protein GI | 124267819 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.456409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACC TGAGCATCAC CCGCGAAGTC CTCGGGCGCA GCCGCGCGCA GCTGCCCGTG TCGAGCTATT TCGACGAGGA TCTGTACCGC CGCGAGGAGG AGCTCATCAT CCGACCTGGC CCGCGCTACG TCGGGCATGC GCTCTCGGTG CCCGAGGTCG GCGACCACCA TGCGCTGCTG CAGGAAGGCG AAGGCCGGGC GCTGGTGCGC ACGCCGCAGG GCATCGAGCT GATCTCCAAC GTCTGCCGCC ACCGCCAGGC GGTGATGCTG CGCGGGCGCG GCAACACGAA GAACCACATC GTCTGCCCGC TGCACCGCTG GACCTACGAC CTGTCGGGGC AGCTGGTCGG CGCGCCGCAC TTCCCCGACG ACCCCTGCCT GCACCTGAAC AACTACCCGG TGCAGCAGTG GAACGGGCTG CTGTTCGAGG CGCCGCGGGA GGTGAACGGC CAACGGATCG GTCGCGACGT GCACGCCGAC CTGGCCGGCC TCGCGCTGCC GCCGGAGTTC GACTTCTCGG GTTATGTGTT CGACCGCGTG CACCTGCACC AGTGCGACTA CAACTGGAAG ACCTTCATCG AGGTCTACCT CGAGGACTAC CACGTCGGCC CCTTCCACCC CGGGCTCGGC AACTTCGTCG CCTGCGACGA CCTGCGCTGG CAGATGGGCC CCGAGCATTC GCTGCAGACC GTCGGCGTGG CGCGCCATGC CGGCGAGGCG CTGGGCAAGC CCGGCTCCGA CGTCTATGGC CGCTGGCACC AGGCGCTGCT GAACTACCGC CGCGACGCGC ACGACAGCGT GCCGCCGAAG CACGGTGCCA TCTGGCTGAC CTACCACCCG ACGGTGATGC TGGAGTGGTA CCCGCATGTG CTGGTGGTCT CGACCCTGGT GCCGCAGGGG CCGCGCAAGA CGCTCAACGT GGTCGAGTTC TTCTACCCCG AGGAGATCGC CGCGTTCGAG CGCGAGTTCG TCGAGGCGCA GCAGGCGGCC TACATGGAGA CCTGCGTGGA GGACGACGAG ATCGCGCTGC GCATGGACGC CGGGCGCGAG GCGCTGTGGC GACGTGGCGA CGACGAGTTC GGCCCCTACC AGAGCCCGAT GGAAGACGGC ATGCAGCACT TCCACGAGTG GTATCGGCGG CGCATCGCGC TGCCGCGCTG A
|
Protein sequence | MSDLSITREV LGRSRAQLPV SSYFDEDLYR REEELIIRPG PRYVGHALSV PEVGDHHALL QEGEGRALVR TPQGIELISN VCRHRQAVML RGRGNTKNHI VCPLHRWTYD LSGQLVGAPH FPDDPCLHLN NYPVQQWNGL LFEAPREVNG QRIGRDVHAD LAGLALPPEF DFSGYVFDRV HLHQCDYNWK TFIEVYLEDY HVGPFHPGLG NFVACDDLRW QMGPEHSLQT VGVARHAGEA LGKPGSDVYG RWHQALLNYR RDAHDSVPPK HGAIWLTYHP TVMLEWYPHV LVVSTLVPQG PRKTLNVVEF FYPEEIAAFE REFVEAQQAA YMETCVEDDE IALRMDAGRE ALWRRGDDEF GPYQSPMEDG MQHFHEWYRR RIALPR
|
| |