Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1004 |
Symbol | |
ID | 4787180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1067383 |
End bp | 1068444 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640089566 |
Product | putative vanillate O-demethylase oxygenase subunit A |
Protein accession | YP_001020201 |
Protein GI | 124266197 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.413289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCAGA AGATCGAATT GAAGCAGTCC TACCTGACCA ACGCCTGGTA CGTGGCGGCG CTGTCCACCG AGGTGGGAGC GCAGGCGCTG TTCCACCGCA AGATCCTGGA CACCTCGATC CTGATCTACC GCAAGCAGGA CGGCACGGCG GTGGCGCTGC ACGACCGCTG CCCGCACCGC TTCGCGCCGC TGCACCTGGG CAAGCGGATC GGCGACGAGG TGGCGTGCCT GTACCACGCG CTGCAGTTCG ACTGCACGGG GCAGTGCACG AAGAACCCGC ACGGCAACGG GCAGATCCCG AAGGCGGCGA AGGTGCGCAG CTTCCCGCTG GAGGAGCGCT ACGGATTCCT GTGGATCTGG ATGGGCGAGG ACGCGCCGGA CCTGGCACGG CTGCCGGACT TCGGCGAGCT CGACAAGGGC CCCGACACCG GCGTCGCCTT CACCTACATG CACATGAAGG CGAACTACGA GCTGATCATC GACAACGTGA TGGACCTGAG CCACGTCGAC CACGTGCACG GCGAGATCAT CACCACGCGC GGGCAGCTGT CGCCGCAGAT TCCGAAGCTG CGCGAGGGCA CCGAGTCGGT CGCGGCGCGC TGGGAATGGC AGCAGACGCC GCCGCTGCTG ATCCTGGCGG ACTTCCTGCC CGAACCGAAG ACGCAGGCCC GGCACTTCAT CGAGGTGAGC TGGAGCCCGC CGGCCAACAT CCAGCTCTCG GTCGGTGCGA CGCAGAACGA CGGCGCGCTC GACCTGGTCA ACTGCATCGG CCAGTACGAC CTGCACACCT GCACGCCGGA GACTGCGAAC ACCACGCACT ACTTCTTCGC CACACGGCGC AACCACGTCG TCGACGACGC GGGCTACAAC GCGGCCAAGA TCCAGGCCAT GCACACCGCC TTCGAGACCG AGGACGGCCC GATCATCCAG GCCATCCACA ACGAGATGGA CACCACCGAC TTCTTCGGCC TGAATCCGGT GCTGATGACG AACGACGTCG CGCCGGTGAA GGTGCGACGC CTGCTGCAGC GGCTGATCGA GCAGGAGGCC GGCGCGCGCT GA
|
Protein sequence | MEQKIELKQS YLTNAWYVAA LSTEVGAQAL FHRKILDTSI LIYRKQDGTA VALHDRCPHR FAPLHLGKRI GDEVACLYHA LQFDCTGQCT KNPHGNGQIP KAAKVRSFPL EERYGFLWIW MGEDAPDLAR LPDFGELDKG PDTGVAFTYM HMKANYELII DNVMDLSHVD HVHGEIITTR GQLSPQIPKL REGTESVAAR WEWQQTPPLL ILADFLPEPK TQARHFIEVS WSPPANIQLS VGATQNDGAL DLVNCIGQYD LHTCTPETAN TTHYFFATRR NHVVDDAGYN AAKIQAMHTA FETEDGPIIQ AIHNEMDTTD FFGLNPVLMT NDVAPVKVRR LLQRLIEQEA GAR
|
| |